Gene ECH74115_2158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2158 
Symbol 
ID6968327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2069950 
End bp2071410 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content50% 
IMG OID643386053 
Productmannitol dehydrogenase family protein 
Protein accessionYP_002270542 
Protein GI209399925 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0246] Mannitol-1-phosphate/altronate dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000417107 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAATA ATTTGTTATC AGCAAAAGCG ACACTCCCTG TTTATGATCG TAATAACCTG 
GCCCCAAGAA TTGTTCATTT AGGCTTTGGT GCATTTCACC GTGCGCATCA GGGCGTGTAT
GCCGATATTC TTGCTACAGA ACATTTCAGT GACTGGGGAT ATTATGAGGT CAACTTAATC
GGCGGCGAAC AGCAAATTGC CGATTTACAT CAGCAAGATA ATCTTTATAC CGTTGCGGAA
ATGTCGGCCG ATGCGTGGAC GGCTCGCGTC GTTGGCGTCG TTAAAAAAGC CTTGCACGTA
CAGATAGATG GCTTAGAAAC CGTGTTGGCT GCGATGTGTG AACCGCAAAT CGCGATTGTC
TCTCTGACAA TCACCGAAAA AGGGTATTTC CACTCTCCGG CGACCGGACA GTTAATGCTC
GATCACCCGA TGGTCGCTGC CGACGTGCAA AATCCCCACC AGCCGAAAAC AGCAACAGGG
GTGATTGTTG AGGCGCTGGC TCGCCGTAAA GCGGCAGGAC TTCCCGCATT TACCGTCATG
TCATGTGACA ACATGCCAGA AAACGGTCAT GTTATGCGTG ACGTTGTCAC TTCCTACGCA
CAAGCCGTTG ATGTAAAACT AGCACAATGG ATCGAAGATA ACGTGACTTT CCCATCAACA
ATGGTGGACC GTATTGTGCC CGCAGTGACA GAGGATACGC TGGCGAAAAT CGAACAGCTT
ACCGGTGTGC GCGATCCTGC TGGCGTTGCC TGTGAACCTT TCCGCCAGTG GGTAATCGAA
GATAACTTTG TTGCCGGACG TCCGGAATGG GAAAAAGCGG GAGCCGAACT GGTTAGTGAT
GTGCTGCCTT ATGAAGAGAT GAAGTTGCGC ATGCTCAACG GCAGTCATTC ATTCCTGGCG
TATCTGGGTT ATCTTGCCGG ATATCAGCAC ATTAATGACT GTATGGAAGA TGAACATTAT
CGTCATGCGG CGTATGCCTT GATGTTGCAG GAACAAGCGC CGACGCTGAA AGTGCAGGGC
GTTGATTTGC AAGATTACGC TAACCGATTA ATTGCACGCT ATAGCAACCC GGCGTTACGT
CATCGAACCT GGCAGATTGC GATGGATGGC AGCCAGAAAT TGCCACAGCG GATGTTGGAT
TCTGTTCGCT GGCATCTGGC GCATGACAGC AAGTTCGATC TGCTGGCGCT GGGCGTCGCG
GGTTGGATGC GTTATGTCGG TGGTGTTGAT GAACAGGGAA ATCCGATAGA AATCAGTGAC
CCACTGTTAC CTGTTATTCA GAAGGCTGTA CAAAGTAGTG CCGAAGGGAA AGCGCGCGTC
CAGTCATTGC TGGCGATTAA GGCGATTTTT GGTGGTGATT TGCCAGACAA TAGCTTGTTT
ACTGCAAAAG TGACGGAAGC GTACTTGTCT TTATTAGCGC ATGGTGCGAA AGCGACCGTG
GCGAAATATT CCGTGAAGTA A
 
Protein sequence
MGNNLLSAKA TLPVYDRNNL APRIVHLGFG AFHRAHQGVY ADILATEHFS DWGYYEVNLI 
GGEQQIADLH QQDNLYTVAE MSADAWTARV VGVVKKALHV QIDGLETVLA AMCEPQIAIV
SLTITEKGYF HSPATGQLML DHPMVAADVQ NPHQPKTATG VIVEALARRK AAGLPAFTVM
SCDNMPENGH VMRDVVTSYA QAVDVKLAQW IEDNVTFPST MVDRIVPAVT EDTLAKIEQL
TGVRDPAGVA CEPFRQWVIE DNFVAGRPEW EKAGAELVSD VLPYEEMKLR MLNGSHSFLA
YLGYLAGYQH INDCMEDEHY RHAAYALMLQ EQAPTLKVQG VDLQDYANRL IARYSNPALR
HRTWQIAMDG SQKLPQRMLD SVRWHLAHDS KFDLLALGVA GWMRYVGGVD EQGNPIEISD
PLLPVIQKAV QSSAEGKARV QSLLAIKAIF GGDLPDNSLF TAKVTEAYLS LLAHGAKATV
AKYSVK