Gene ECH74115_2968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2968 
Symbolgmd 
ID6968025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2744579 
End bp2745697 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content39% 
IMG OID643386808 
ProductGDP-mannose 4,6-dehydratase 
Protein accessionYP_002271276 
Protein GI209400350 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1089] GDP-D-mannose dehydratase 
TIGRFAM ID[TIGR01472] GDP-mannose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0000191448 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTAAAG TCGCTCTTAT TACAGGTGTA ACTGGACAAG ATGGATCTTA TCTAGCTGAG 
TTTTTGCTTG ATAAAGGGTA TGAAGTTCAT GGTATCAAAC GCCGAGCCTC ATCTTTTAAT
ACAGAACGCA TAGACCATAT TTATCAAGAT CCACATGGTT CTAACCCAAA TTTTCACTTG
CACTATGGAG ATCTGACTGA TTCATCTAAC CTCACTAGAA TTCTAAAGGA GGTACAGCCA
GATGAAGTAT ATAATTTAGC TGCTATGAGT CACGTAGCAG TTTCTTTTGA GTCTCCAGAA
TATACAGCCG ATGTCGATGC AATTGGTACA TTACGTTTAC TGGAAGCAAT TCGCTTTTTA
GGATTGGAAA ACAAAACGCG TTTCTATCAA GCTTCAACCT CAGAATTATA TGGACTTGTT
CAGGAAATCC CTCAAAAAGA ATCCACCCCT TTTTATCCTC GTTCCCCTTA TGCAGTTGCA
AAACTTTACG CATATTGGAT CACGGTAAAT TATCGAGAGT CATATGGTAT TTATGCATGT
AATGGTATAT TGTTCAATCA TGAATCTCCA CGCCGTGGAG AAACGTTTGT AACAAGGAAA
ATTACTCGAG GACTTGCAAA TATTGCACAA GGCTTGGAAT CATGTTTGTA TTTAGGGAAT
ATGGATTCGT TACGAGATTG GGGACATGCA AAAGATTATG TTAGAATGCA ATGGTTGATG
TTACAACAGG AGCAACCCGA AGATTTTGTG ATTGCAACAG GAGTCCAATA CTCAGTCCGT
CAGTTTGTCG AAATGGCAGC AGCACAACTT GGTATTAAGA TGAGCTTTGT TGGTAAAGGA
ATCGAAGAAA AAGGCATTGT AGATTCGGTT GAAGGACAGG ATGCTCCAGG TGTGAAACCA
GGTGATGTCA TTGTTGCTGT TGATCCTCGT TATTTCCGAC CAGCTGAAGT TGATACTTTG
CTTGGAGATC CGAGCAAAGC TAATCTCAAA CTTGGTTGGA GACCAGAAAT TACTCTTGCT
GAAATGATTT CTGAAATGGT TGCCAAAGAT CTTGAAGCCG CTAAAAAACA TTCTCTTTTA
AAATCGCATG GTTTTTCTGT AAGCTTAGCT CTGGAATGA
 
Protein sequence
MTKVALITGV TGQDGSYLAE FLLDKGYEVH GIKRRASSFN TERIDHIYQD PHGSNPNFHL 
HYGDLTDSSN LTRILKEVQP DEVYNLAAMS HVAVSFESPE YTADVDAIGT LRLLEAIRFL
GLENKTRFYQ ASTSELYGLV QEIPQKESTP FYPRSPYAVA KLYAYWITVN YRESYGIYAC
NGILFNHESP RRGETFVTRK ITRGLANIAQ GLESCLYLGN MDSLRDWGHA KDYVRMQWLM
LQQEQPEDFV IATGVQYSVR QFVEMAAAQL GIKMSFVGKG IEEKGIVDSV EGQDAPGVKP
GDVIVAVDPR YFRPAEVDTL LGDPSKANLK LGWRPEITLA EMISEMVAKD LEAAKKHSLL
KSHGFSVSLA LE