Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmc1_2413 |
Symbol | |
ID | 4480977 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Magnetococcus sp. MC-1 |
Kingdom | Bacteria |
Replicon accession | NC_008576 |
Strand | - |
Start bp | 3079886 |
End bp | 3080965 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 639723161 |
Product | cellulose biosynthesis protein CelD |
Protein accession | YP_866319 |
Protein GI | 117925702 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.92081 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0825886 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGACCC TAACCCCCCA ACAGGTTTTT AGCGTGGAGA TCGTGCGCGA GGTAAATAGC TTTGTGCAGA TTACCCATAG CGTGGTCGCG CAGCAGCCTT GGCAGCACAA TGCCACAGCC TATTATAACA ACCAACCGGT ATGGGTTGCG CTGGCTTGGC AATATATGGA ACAAGCGCGG GGTTCCCGCC CCTTTATCGT GCTGGTGCGC GATGGGGCGG GTCAGATCGC CGGTTGGTGG CCCTTGCTGT TGAGTAAACG CTCCCTGGGG TGGCGTTTAC AAGGGTTTGG TCAAGAGTAT TCGGATTATG TGGCCCCCTA TATTCATACC CCTTACCTAG CAGTACAACC CGCAATCCAG AGCGCCCTGG CTGCGGCGCT GTGCGAACAC CGCGCCGCCT TTAGCATGGC CTTTTTTCCA TCATTGCTGT GGCAGGGTGG GGTAGAGAGT CTGCTGCGTG ATCGCTGTGG CTGGCAGCAC CAGCGGACAT CCCTTAACCT CTATCTGCAC TGGCAGGCCG AGGAGGGTTT GGATGCGCTC ATGGAGCGCC TGCACTCCAA CAAATACCGT AAATTGCTGC GTTATGAGCA GCGCCGCTTG GAAAGCATGG GGGATTTGCG TTGCGAAAGC ATTACCACAC AAGCGCCCCT GGAAGAGGCC GAACGCTTTT TCCGGGCCAA CTATAGCCAC GGGGCGGAGC AGGCGAAGGT GGATCTTTGG TTTGCTTATA TTAGGGCGAC ATTGGGGGGG CAGAGCCGCC TCTCGCTGCT TAGCTTGGAT GGCGAGATTA TCAACATGAT CCTCTGGTTT CCACGGGGTG GGCAGGTGGA CTTTTTTAGT ACCGTCTATG CGCAAAAGCT GGCCAAATTG AGCCCAGGTA AAACCCATCT CTACCTCTTT ATCGCCCAAC TCTTCAATCA GGGTGAGGGG GGTATGCTTA ACTTTCTCTC TGGGGACGAA CCCTACAAAC AGCGCTGGGC GACCGGACAT TTTGAGAGCT ATCGGCTGTT TTTGCACCAT TTTCGCACCC CGACGGCGGC TATGCTGGCA TTAAAGCCGC TGCTTAAACG GCTGTTTTAA
|
Protein sequence | MQTLTPQQVF SVEIVREVNS FVQITHSVVA QQPWQHNATA YYNNQPVWVA LAWQYMEQAR GSRPFIVLVR DGAGQIAGWW PLLLSKRSLG WRLQGFGQEY SDYVAPYIHT PYLAVQPAIQ SALAAALCEH RAAFSMAFFP SLLWQGGVES LLRDRCGWQH QRTSLNLYLH WQAEEGLDAL MERLHSNKYR KLLRYEQRRL ESMGDLRCES ITTQAPLEEA ERFFRANYSH GAEQAKVDLW FAYIRATLGG QSRLSLLSLD GEIINMILWF PRGGQVDFFS TVYAQKLAKL SPGKTHLYLF IAQLFNQGEG GMLNFLSGDE PYKQRWATGH FESYRLFLHH FRTPTAAMLA LKPLLKRLF
|
| |