Gene ECH74115_1428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1428 
SymbolmdoG 
ID6971733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1412044 
End bp1413597 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content49% 
IMG OID643385402 
Productglucan biosynthesis protein G 
Protein accessionYP_002269896 
Protein GI209398428 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0037891 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.223049 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACATA AACTACAAAT GATGAAAATG CGTTGGTTGA GTGCTGCAGT AATGTTAACC 
CTGTATACAT CTTCAAGCTG GGCTTTCAGT ATTGATGATG TCGCAAAGCA AGCTCAATCC
TTAGCCGGGA AAGGCTATGA GGCGCCCAAA AGCAACTTGC CCTCCGTTTT CCGCGATATG
AAATACGCGG ACTATCAGCA GATCCAGTTT AATCATGACA AAGCGTACTG GAACAATCTG
AAGACCCCAT TCAAACTCGA GTTCTACCAT CAGGGTATGT ACTTCGATAC CCCGGTCAAA
ATAAATGAAG TGACTGCCAC CGCAGTCAAA CGAATCAAAT ACAGCCCGGA TTATTTCACT
TTCGGCGATG TTCAGCATGA CAAAGACACG GTAAAAGACC TTGGTTTTGC CGGTTTTAAA
GTGCTTTACC CGATCAACAG CAAAGATAAA AACGATGAAA TCGTCAGCAT GCTCGGGGCC
AGCTATTTCC GCGTGATTGG TGCAGGTCAG GTTTATGGCC TTTCTGCCCG CGGTCTGGCA
ATTGATACCG CCTTGCCATC GGGTGAAGAA TTTCCACGCT TCAAAGAGTT CTGGATCGAG
CGTCCAAAAC CAACTGATAA ACGTTTAACC ATTTATGCAT TGCTTGACTC GCCGCGCGCG
ACAGGTGCTT ACAAATTCGT AGTTATGCCA GGGCGTGACA CGGTTGTGGA TGTGCAGTCG
AAAATCTATC TGCGCGATAA AGTCGGCAAA CTGGGGGTTG CACCGTTAAC CAGTATGTTC
CTGTTTGGGC CGAACCAACC GTCGCCAGCA AATAACTATC GTCCGGAGTT GCACGACTCT
AACGGTCTCT CTATCCATGC CGGTAATGGC GAATGGATCT GGCGTCCGTT GAATAACCCG
AAACATTTAG CGGTCAGCAG CTTCTCCATG GAAAACCCGC AAGGCTTTGG TCTGTTGCAG
CGCGGTCGTG ATTTCTCCCG CTTTGAAGAT CTCGATGATC GTTACGATCT CCGTCCAAGC
GCATGGGTGA CTCCGAAAGG GGAGTGGGGT AAAGGCAGCG TTGAGCTGGT GGAAATTCCA
ACCAACGATG AAACTAACGA TAACATCGTC GCTTACTGGA CGCCGGATCA GCTGCCGGAG
CCGGGTAAAG AGATGAACTT TAAATACACC ATCACCTTCA GCCGTGATGA AGACAAACTG
CATGCGCCAG ATAACGCATG GGTGCAACAA ACGCGTCGTT CAACGGGGGA TGTGAAGCAG
TCGAACCTGA TTCGCCAGCC TGACGGTACT ATCGCCTTTG TGGTCGATTT TACCGGCGCA
GAGATGAAAA AACTGCCAGA GGATACCCCG GTCACAGCGC AAACCAGCAT TGGTGATAAT
GGTGAGATAG TTGAAAGCAC GGTGCGCTAT AACCCGGTCA CCAAAGGCTG GCGTCTGGTG
ATGCGTGTGA AAGTGAAAGA TGCCAAGAAA ACCACTGAAA TGCGCGCTGC GCTGGTGAAT
GCCGATCAGA CGTTGAGTGA AACCTGGAGC TACCAGTTAC CTGCCAATGA ATAA
 
Protein sequence
MKHKLQMMKM RWLSAAVMLT LYTSSSWAFS IDDVAKQAQS LAGKGYEAPK SNLPSVFRDM 
KYADYQQIQF NHDKAYWNNL KTPFKLEFYH QGMYFDTPVK INEVTATAVK RIKYSPDYFT
FGDVQHDKDT VKDLGFAGFK VLYPINSKDK NDEIVSMLGA SYFRVIGAGQ VYGLSARGLA
IDTALPSGEE FPRFKEFWIE RPKPTDKRLT IYALLDSPRA TGAYKFVVMP GRDTVVDVQS
KIYLRDKVGK LGVAPLTSMF LFGPNQPSPA NNYRPELHDS NGLSIHAGNG EWIWRPLNNP
KHLAVSSFSM ENPQGFGLLQ RGRDFSRFED LDDRYDLRPS AWVTPKGEWG KGSVELVEIP
TNDETNDNIV AYWTPDQLPE PGKEMNFKYT ITFSRDEDKL HAPDNAWVQQ TRRSTGDVKQ
SNLIRQPDGT IAFVVDFTGA EMKKLPEDTP VTAQTSIGDN GEIVESTVRY NPVTKGWRLV
MRVKVKDAKK TTEMRAALVN ADQTLSETWS YQLPANE