Gene EcSMS35_2081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2081 
SymbolmdoG 
ID6143444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2096427 
End bp2097980 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content50% 
IMG OID641616957 
Productglucan biosynthesis protein G 
Protein accessionYP_001744133 
Protein GI170682373 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.242328 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACATA AACTACAAAT GATGAAAATG CGTTGGTTGA GTGCTGCAGT AATGTTAACC 
CTGTATACAT CTTCAAGCTG GGCTTTCAGT ATTGATGATG TCGCAAAGCA AGCTCAATCC
TTAGCCGGGA AAGGCTATGA GGCGCCCAAA AGCAACTTGC CCTCCGTTTT CCGCGACATG
AAATACGCGG ACTATCAGCA GATCCAGTTT AATCATGACA AAGCGTACTG GAACAATCTG
AAGACCCCAT TCAAACTCGA GTTCTACCAT CAGGGTATGT ACTTCGATAC CCCGGTCAAA
ATAAATGAAG TGACTGCCAC CGCAGTCAAA CGAATCAAAT ACAGCCCGGA TTATTTCACT
TTCGGCGATG TTCAGCATGA CAAAGACACG GTAAAAGACC TTGGTTTTGC CGGTTTTAAA
GTGCTTTACC CGATCAACAG CAAAGATAAA AACGATGAAA TCGTCAGCAT GCTCGGGGCC
AGCTATTTCC GCGTGATTGG TGCAGGTCAG GTTTATGGCC TTTCTGCCCG CGGCCTGGCA
ATTGATACCG CCTTGCCATC GGGTGAAGAA TTTCCACGCT TCAAAGAGTT CTGGATCGAG
CGTCCAAAAC CGACTGATAA ACGTTTAACC ATTTATGCAT TGCTTGACTC GCCGCGCGCG
ACAGGTGCTT ACAAATTCGT GGTTATGCCA GGGCGTGACA CGGTTGTGGA TGTGCAGTCG
AAAATCTATC TGCGCGATAA AGTCGGCAAA CTGGGGGTTG CACCGTTAAC CAGTATGTTC
CTGTTTGGGC CGAACCAACC GTCGCCTGCA AATAACTATC GTCCGGAGTT GCACGACTCT
AACGGCCTGT CTATCCATGC TGGTAATGGC GAATGGATCT GGCGTCCGTT GAATAACCCG
AAACATTTAG CGGTCAGCAG CTTCTCGATG GAAAACCCGC AAGGCTTCGG TCTGTTGCAG
CGCGGTCGTG ATTTCTCCCG CTTTGAAGAT CTCGATGATC GTTACGATCT TCGTCCGAGC
GCATGGGTGA CTCCGAAAGG GGAGTGGGGC AAAGGCAGCG TTGAGCTGGT GGAAATTCCA
ACCAACGATG AAACCAACGA TAACATCGTC GCTTACTGGA CGCCGGATCA GCTGCCGGAG
CCGGGTAAAG AGATGAACTT TAAATACACC ATCACCTTCA GCCGTGATGA AGACAAACTG
CATGCACCAG ATAACGCATG GGTGCAACAA ACGCGTCGTT CAACGGGGGA TGTGAAGCAG
TCGAACCTGA TTCGCCAGCC TGACGGTACT ATCGCCTTTG TGGTCGATTT TACCGGCGCA
GAGATGAAAA AACTGCCAGA GGATACCCCG GTCACAGCGC AAACCAGCAT TGGTGATAAT
GGTGAGATAG TTGAAAGCAC GGTGCGCTAT AACCCGGTTA CCAAAGGCTG GCGTCTGGTG
ATGCGTGTGA AAGTGAAAGA TGCCAAGAAA ACCACTGAAA TGCGTGCTGC GCTGGTGAAT
GCCGATCAGA CGTTGAGTGA AACCTGGAGC TACCAGTTAC CTGCCAATGA ATAA
 
Protein sequence
MKHKLQMMKM RWLSAAVMLT LYTSSSWAFS IDDVAKQAQS LAGKGYEAPK SNLPSVFRDM 
KYADYQQIQF NHDKAYWNNL KTPFKLEFYH QGMYFDTPVK INEVTATAVK RIKYSPDYFT
FGDVQHDKDT VKDLGFAGFK VLYPINSKDK NDEIVSMLGA SYFRVIGAGQ VYGLSARGLA
IDTALPSGEE FPRFKEFWIE RPKPTDKRLT IYALLDSPRA TGAYKFVVMP GRDTVVDVQS
KIYLRDKVGK LGVAPLTSMF LFGPNQPSPA NNYRPELHDS NGLSIHAGNG EWIWRPLNNP
KHLAVSSFSM ENPQGFGLLQ RGRDFSRFED LDDRYDLRPS AWVTPKGEWG KGSVELVEIP
TNDETNDNIV AYWTPDQLPE PGKEMNFKYT ITFSRDEDKL HAPDNAWVQQ TRRSTGDVKQ
SNLIRQPDGT IAFVVDFTGA EMKKLPEDTP VTAQTSIGDN GEIVESTVRY NPVTKGWRLV
MRVKVKDAKK TTEMRAALVN ADQTLSETWS YQLPANE