Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0849 |
Symbol | |
ID | 8136165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 1006288 |
End bp | 1007298 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644868460 |
Product | glycosyl transferase family 2 |
Protein accession | YP_003020674 |
Protein GI | 253699485 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 107 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTAACC ACATTCCGGC CATTTCCATC CTGATGCCGG TGAGAAACGA GGAGCGGTTC CTGCCGGCGG CGCTCCGCTC GCTCGCGGCC CAGACCTTTG CCGATTGGGA GCTTTTGGCT GTGGATGACG GCTCGACCGA CGGGACCCCC CGCGTCCTGG CCGAGGCGGC GAAAAACGAC CCGCGCATCC GGGTGCTTCA CTGCGGAAAG GGGCTGGTCC CCGCCTTGAA CCTGGGGCTG AAAGAGTGCC GGGCCCAGCT TGTCGCCCGG ATGGACGGCG ACGATATCGC GCACCCGCAA AGACTCGCGG CGCAGGTGGC TTTCCTGGCC GCCCGCCCCG GGACAGGGCT CGTTGCCTGC TCTTTCAAGC ACTTCCCGCG GCAGCAGGTA GGCCTCGGGA TGGCGGGGTA CGAAAAGTGG CAGAACCGGC TCATCAGCCA TGAGGAGATA GCCGCAGACC TCTTCGTCGA GTCCCCTTTC GTGCACCCGA GCGTTATGTA CCGCAGGTCG GATGTAGAGC AGTTGGGCGG CTACCGCGAC AAAGGATGGC CGGAGGATTA CGACCTGTGG CTGCGGCTTG CCGCCGCGCA AGTAAAGTTC GCACGGCTCC CCGAGACTCT GTTCTTCTGG CGAGAGCGCC CCGAGCGGAC CACGCGCACC AATCCGGCCT ATGCGCCCGA CGCCTTTAGG CGCTGTAAGC TGCACCACCT GATGAACGGG TTTCTGAAAG GGGAAAGCGA GGTCATCCTG GCCGGAGCGG GTCTGGAGGG GCGGGCGTGG TATCGCCTGC TGCGGGAGGA GGGAATCAGG GTCTCCACCT GGCTCGACGT CGATCCCCGC AAGATCGGGC GGGAGCTGCA CGGTGCCCCG GTACTTGCCA CCGGCCAGGT GAGGGCATCC GGGGTCAAGA TGCTGATGAC GGTAGGCGCT CGGGGGGCTC GGGCGCTGGT GCGGGCATCC TCCTCGAAAG CGGGGTTCGT CGAAGGAATC GACGCCGTCT GCGTCGCTTG A
|
Protein sequence | MLNHIPAISI LMPVRNEERF LPAALRSLAA QTFADWELLA VDDGSTDGTP RVLAEAAKND PRIRVLHCGK GLVPALNLGL KECRAQLVAR MDGDDIAHPQ RLAAQVAFLA ARPGTGLVAC SFKHFPRQQV GLGMAGYEKW QNRLISHEEI AADLFVESPF VHPSVMYRRS DVEQLGGYRD KGWPEDYDLW LRLAAAQVKF ARLPETLFFW RERPERTTRT NPAYAPDAFR RCKLHHLMNG FLKGESEVIL AGAGLEGRAW YRLLREEGIR VSTWLDVDPR KIGRELHGAP VLATGQVRAS GVKMLMTVGA RGARALVRAS SSKAGFVEGI DAVCVA
|
| |