Gene GM21_1665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1665 
Symbol 
ID8136996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1936602 
End bp1937807 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content63% 
IMG OID644869278 
Productglycosyl transferase family 2 
Protein accessionYP_003021478 
Protein GI253700289 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones169 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCATTGT GGAGGGGCCC CCCCCATAAC TCCTGTGACA GCGCCCATCT TGCCATGAAC 
ATCCTTTTCC TCATTGCCGC AGCGACACTC ACCCTTACTT GCCTTGCGGG ACTCGAGGTC
GTCACCGGCG CCCGCCGGGT CAAGGCCTTG CGGGAGATGG ATCCTGGCAT GGCGACCAAC
CTGCCGCTTG TCTCCATCGT CGTCGCCGCC AGAAACGAAG AGCGCAACAT CCGCGAGGCG
CTGCAATCGC TTTTCGACCT CGACTACCCC GATTACGAGC TGATCGTGGT GGACGACCGC
TCAGACGACG AAACCGGCTG CATCCTGGAA CGGATGTCCC GGGAACGCCA GCGGCTTCGC
GTAATCCACG TGGAATCCCT CCCGCAGGAT TGGCTCGGCA AGAACCACGC GCTCTGGGTG
GGGAGCCGAC TGGCCCGCGG GGAATACCTC CTTTTCACCG ACGCCGATAT CGTGATGGAG
CCGACGGTGC TGACGAGGGC GGTGATGTTC ATGCGAAAGC ACCGCCTGGA CCACCTGGCA
GCTACCCCGT CGCTGCGGAT GCCGAGCCTC TTCCTCGACA TGTTCGGCGC TTCCTTCATC
ATCTTCTTTT CCCTTTTTGC TCGTCCCTGG AAGGCGCGCG ACCCAAAAAG CCGCTGCCAT
ATCGGCATCG GCGCCTTCAA CATGCTGCGC GCCGACGCCT ACCGCGGCAT CGGCGGACAC
GAGGCGATCC GCTTGAGGCC GGACGACGAC ATCAAGCTCG GCAAGCTGAT CAAAAAGGCG
GGGCTGCGGC AGGAGGCGGC ATACGCGCCG GAATTCCTTT TCGTCGAGTG GTACGCCTCG
GTGGGGGAGG TGGTGCGGGG GTTGGAGAAA AACGCCTTCG CCGGCACCGA CTATAGCGTC
CCGCTCGTGC TCGCCGGTGT TCTAGGGCAG ACCGTCTGCG GCATCTGGCC CTTCCTGGCC
ATCTTCTTAA CCGGCGGCGC CGTCCAGGCG ATGTACCTCG CCACGGTCCT CGTCACGCTA
ATGGTGGTCG CCGACAGCGC CCGGTTCCAC CGCTCCCGCC CCTGGTACGC CATCGCCTAT
CCGCTCACCT CCGCGCTCTT CGTCTACATC CTGATGCGCA CCATGCTGCT GAACCTCTGG
CAGGGGGGGA TCTACTGGCG CGGGACCTTC TATCCCCTTA AGGAACTCAA GAAAAACCGC
GTCTAG
 
Protein sequence
MALWRGPPHN SCDSAHLAMN ILFLIAAATL TLTCLAGLEV VTGARRVKAL REMDPGMATN 
LPLVSIVVAA RNEERNIREA LQSLFDLDYP DYELIVVDDR SDDETGCILE RMSRERQRLR
VIHVESLPQD WLGKNHALWV GSRLARGEYL LFTDADIVME PTVLTRAVMF MRKHRLDHLA
ATPSLRMPSL FLDMFGASFI IFFSLFARPW KARDPKSRCH IGIGAFNMLR ADAYRGIGGH
EAIRLRPDDD IKLGKLIKKA GLRQEAAYAP EFLFVEWYAS VGEVVRGLEK NAFAGTDYSV
PLVLAGVLGQ TVCGIWPFLA IFLTGGAVQA MYLATVLVTL MVVADSARFH RSRPWYAIAY
PLTSALFVYI LMRTMLLNLW QGGIYWRGTF YPLKELKKNR V