Gene GM21_0849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0849 
Symbol 
ID8136165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1006288 
End bp1007298 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content66% 
IMG OID644868460 
Productglycosyl transferase family 2 
Protein accessionYP_003020674 
Protein GI253699485 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones107 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAACC ACATTCCGGC CATTTCCATC CTGATGCCGG TGAGAAACGA GGAGCGGTTC 
CTGCCGGCGG CGCTCCGCTC GCTCGCGGCC CAGACCTTTG CCGATTGGGA GCTTTTGGCT
GTGGATGACG GCTCGACCGA CGGGACCCCC CGCGTCCTGG CCGAGGCGGC GAAAAACGAC
CCGCGCATCC GGGTGCTTCA CTGCGGAAAG GGGCTGGTCC CCGCCTTGAA CCTGGGGCTG
AAAGAGTGCC GGGCCCAGCT TGTCGCCCGG ATGGACGGCG ACGATATCGC GCACCCGCAA
AGACTCGCGG CGCAGGTGGC TTTCCTGGCC GCCCGCCCCG GGACAGGGCT CGTTGCCTGC
TCTTTCAAGC ACTTCCCGCG GCAGCAGGTA GGCCTCGGGA TGGCGGGGTA CGAAAAGTGG
CAGAACCGGC TCATCAGCCA TGAGGAGATA GCCGCAGACC TCTTCGTCGA GTCCCCTTTC
GTGCACCCGA GCGTTATGTA CCGCAGGTCG GATGTAGAGC AGTTGGGCGG CTACCGCGAC
AAAGGATGGC CGGAGGATTA CGACCTGTGG CTGCGGCTTG CCGCCGCGCA AGTAAAGTTC
GCACGGCTCC CCGAGACTCT GTTCTTCTGG CGAGAGCGCC CCGAGCGGAC CACGCGCACC
AATCCGGCCT ATGCGCCCGA CGCCTTTAGG CGCTGTAAGC TGCACCACCT GATGAACGGG
TTTCTGAAAG GGGAAAGCGA GGTCATCCTG GCCGGAGCGG GTCTGGAGGG GCGGGCGTGG
TATCGCCTGC TGCGGGAGGA GGGAATCAGG GTCTCCACCT GGCTCGACGT CGATCCCCGC
AAGATCGGGC GGGAGCTGCA CGGTGCCCCG GTACTTGCCA CCGGCCAGGT GAGGGCATCC
GGGGTCAAGA TGCTGATGAC GGTAGGCGCT CGGGGGGCTC GGGCGCTGGT GCGGGCATCC
TCCTCGAAAG CGGGGTTCGT CGAAGGAATC GACGCCGTCT GCGTCGCTTG A
 
Protein sequence
MLNHIPAISI LMPVRNEERF LPAALRSLAA QTFADWELLA VDDGSTDGTP RVLAEAAKND 
PRIRVLHCGK GLVPALNLGL KECRAQLVAR MDGDDIAHPQ RLAAQVAFLA ARPGTGLVAC
SFKHFPRQQV GLGMAGYEKW QNRLISHEEI AADLFVESPF VHPSVMYRRS DVEQLGGYRD
KGWPEDYDLW LRLAAAQVKF ARLPETLFFW RERPERTTRT NPAYAPDAFR RCKLHHLMNG
FLKGESEVIL AGAGLEGRAW YRLLREEGIR VSTWLDVDPR KIGRELHGAP VLATGQVRAS
GVKMLMTVGA RGARALVRAS SSKAGFVEGI DAVCVA