Gene ECH74115_2029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2029 
SymbolmdoD 
ID6967079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1928991 
End bp1930610 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content51% 
IMG OID643385945 
Productglucan biosynthesis protein D 
Protein accessionYP_002270434 
Protein GI209396923 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.511288 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0000000000014294 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCGCCG TGTGCGGTAC CAGCGGCATT GCTTCTCTTT TTTCTCAGGC GGCATTCGCG 
GCAGATTCTG ATATTGCCGA CGGGCAAACC CAGCGTTTTG ACTTCTCCAT TCTACAGTCA
ATGGCGCACG ACTTAGCGCA AACAGCGTGG CGTGGTGCGC CTCGTCCGTT ACCTGACACG
CTGGCGACAA TGACGCCGCA GGCTTATAAC AGTATTCAAT ACGACGCCGA AAAATCGCTC
TGGCATAACG TTGAGAACCG TCAACTGGAC GCTCAGTTCT TCCATATGGG AATGGGATTC
CGTCGCCGCG TTCGTATGTT TTCTGTAGAT CCAGCAACAC ATCTGGCGCG TGAAATTCAC
TTTCGCCCGG AGTTGTTCAA ATACAACGAT GCAGGTGTTG ATACCAAACA ATTAGAAGGG
CAAAGCGATC TCGGCTTTGC CGGTTTTCGC GTGTTTAAAG CCCCCGAACT GGCGCGCCGT
GATGTAGTAT CATTCCTCGG AGCGAGTTAT TTCCGCGCCG TTGATGACAC ATATCAATAC
GGTTTGTCGG CCCGCGGCCT GGCGATTGAC ACTTACACCG ACAGTAAAGA AGAGTTCCCC
GACTTTACCG CCTTCTGGTT TGATACGGTA AAACCGGGGG CAACTACCTT TACCGTTTAT
GCGTTGCTCG ATAGCGCCAG CATTACTGGT GCCTATAAGT TCACTATCCA TTGTGAGAAA
AGTCAGGTGA TTATGGATGT GGAAAATCAC CTGTATGCGC GCAAAGACAT TAAACAGCTG
GGCATTGCGC CGATGACCAG TATGTTCAGC TGCGGTACTA ATGAACGTCG GATGTGCGAC
ACCATTCATC CGCAAATTCA TGACTCTGAT CGTCTGTCCA TGTGGCGGGG CAACGGCGAG
TGGATTTGCC GTCCGCTGAA TAATCCGCAA AAATTGCAGT TCAATGCTTA CACCGACAAC
AACCCGAAAG GGTTTGGTTT ATTGCAACTG GATCGTGATT TCTCCCATTA TCAGGACATT
ATGGGCTGGT ATAACAAACG CCCAAGTCTG TGGGTGGAAC CGCGTAACAA GTGGGGTAAG
GGCACCATCG GCCTGATGGA AATCCCAACA ACGGGCGAAA CGCTGGATAA CATTGTCTGC
TTCTGGCAGC CAGAAAAAGC TGTAAAGGCG GGTGATGAGT TTGCATTCCA GTATCGTCTG
TACTGGAGTG CGCAACCGCC TGTTCATTGC CCATTAGCGC GCGTTATGGC GACGCGTACC
GGCATGGGCG GTTTCCCGGA AGGTTGGGCA CCAGGTGAAC ACTATCCCGA AAAATGGGCG
CGTCGTTTTG CCGTCGATTT CGTTGGTGGT GATCTGAAAG CTGCCGCACC AAAAGGCATT
GAGCCGGTGA TTACGCTTTC CAGTGGGGAA GCGAAGCAAA TCGAAATTCT CTATATTGAA
CCCATTGATG GTTATCGTAT TCAGTTTGAC TGGTATCCGA CTTCGGACTC CACTGATCCG
GTCGATATGC GGATGTATCT GCGTTGTCAG GGCGACGCTA TCAGTGAAAC ATGGCTGTAT
CAGTATTTCC CGCCAGCGCC GGATAAACGT CAGTATGTTG ACGACCGCGT GATGAGTTAA
 
Protein sequence
MAAVCGTSGI ASLFSQAAFA ADSDIADGQT QRFDFSILQS MAHDLAQTAW RGAPRPLPDT 
LATMTPQAYN SIQYDAEKSL WHNVENRQLD AQFFHMGMGF RRRVRMFSVD PATHLAREIH
FRPELFKYND AGVDTKQLEG QSDLGFAGFR VFKAPELARR DVVSFLGASY FRAVDDTYQY
GLSARGLAID TYTDSKEEFP DFTAFWFDTV KPGATTFTVY ALLDSASITG AYKFTIHCEK
SQVIMDVENH LYARKDIKQL GIAPMTSMFS CGTNERRMCD TIHPQIHDSD RLSMWRGNGE
WICRPLNNPQ KLQFNAYTDN NPKGFGLLQL DRDFSHYQDI MGWYNKRPSL WVEPRNKWGK
GTIGLMEIPT TGETLDNIVC FWQPEKAVKA GDEFAFQYRL YWSAQPPVHC PLARVMATRT
GMGGFPEGWA PGEHYPEKWA RRFAVDFVGG DLKAAAPKGI EPVITLSSGE AKQIEILYIE
PIDGYRIQFD WYPTSDSTDP VDMRMYLRCQ GDAISETWLY QYFPPAPDKR QYVDDRVMS