Gene EcHS_A4549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4549 
Symbol 
ID5591794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4557909 
End bp4559369 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content58% 
IMG OID640923645 
Productmannitol dehydrogenase family protein 
Protein accessionYP_001461085 
Protein GI157163767 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0246] Mannitol-1-phosphate/altronate dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTACTA TTGTTGACAG CAATCTGCCG GTTGCCCGCC CGTCATGGGA TCATTCTCGT 
CTGGAATCAC GCATTGTGCA TCTCGGTTGC GGGGCGTTTC ACCGCGCGCA CCAGGCGCTG
TATACCCATC ATCTGCTGGA AAGCACCGAC AGCGACTGGG GCATCTGCGA AGTTAACCTG
ATGCCAGGCA ACGACCGCGT GCTGATCGAA AACCTGAAAA AACAGCAACT GCTGTACACC
GTAGCGGAAA AAGGCGCAGA AAGCACCGAG CTGAAAATTA TCGGTTCGAT GAAAGAAGCG
CTGCACCCGG AAATCGACGG CTGCGAAGGT ATTCTCAACG CGATGGCGCG TCCGCAAACG
GCGATTGTCT CTCTGACGGT CACGGAAAAA GGCTACTGCG CTGATGCGGC AAGCGGTCAA
CTGGATCTCA ATAACCCGCT GATCAAGCAC GATCTGGAAA ACCCGACTGC GCCGAAGTCT
GCGATTGGTT ACATCGTCGA AGCGTTGCGT CTGCGTCGTG AGAAAGGGTT GAAAGCGTTT
ACGGTGATGT CCTGCGACAA CGTGCGTGAA AACGGTCATG TGGCGAAGGT CGCGGTTCTG
GGGCTGGCTC AGGCGCGTGA CCCGCAGCTG GCGGCATGGA TTGAAGAGAA CGTCACCTTC
CCGTGCACCA TGGTTGACCG CATCGTTCCG GCGGCGACGC CAGAAACCTT ACAGGAAATT
GCTGACCAGC TGGGTGTTTA CGACCCGTGC GCCATTGCCT GCGAACCGTT CCGTCAGTGG
GTGATTGAAG ATAACTTCGT TAATGGTCGC CCGGACTGGG ATAAAGTGGG CGCACAGTTC
GTTGCAGACG TTGTGCCGTT CGAAATGATG AAGCTGCGTA TGCTGAACGG CAGCCACTCT
TTCCTGGCAT ACCTCGGTTA CCTCGGCGGC TATGAAACTA TTGCCGACAC CATGACTAAC
CCGGATTATC GCAAAGCGGC CTTCTCCTTG ATGATGCAGG AACAAGCGCC AACGCTGTCT
ATGCCGGAGG GTACGGACCT GAAGGCCTAT GCGACGCTGC TGATCGAGCG TTTCAGCAAC
CCGTCTCTGC GTCACCGTAC CTGGCAGATT GCGATGGACG GCAGCCAGAA GTTACCGCAG
CGTCTGTTGG ATCCAGTGCG TCTGCACCTG CAAAACGGCG GCAGCTGGCG TCACCTGGCG
CTGGGCGTGG CTGGCTGGAT GCGTTACACC CAGGGCGTGG ATGAGCAGGG TAATGCCATT
GACGTGGTCG ACCCGATGCT GGCCGAGTTC CAGAAGATTA ACGCGCAGTA TCAGGGCGCA
GACCGCGTGA AAGCGCTGCT GGGCCTGAGC GGTATTTTTG CCGATGATCT GCCGCAGAAT
GCCGACTTTG TTGGCGCGGT GACGGCGGCA TATCAGCAGC TGTGCGAACG CGGTGCGCGC
GAGTGTGTGG CTGCGCTGTA A
 
Protein sequence
MTTIVDSNLP VARPSWDHSR LESRIVHLGC GAFHRAHQAL YTHHLLESTD SDWGICEVNL 
MPGNDRVLIE NLKKQQLLYT VAEKGAESTE LKIIGSMKEA LHPEIDGCEG ILNAMARPQT
AIVSLTVTEK GYCADAASGQ LDLNNPLIKH DLENPTAPKS AIGYIVEALR LRREKGLKAF
TVMSCDNVRE NGHVAKVAVL GLAQARDPQL AAWIEENVTF PCTMVDRIVP AATPETLQEI
ADQLGVYDPC AIACEPFRQW VIEDNFVNGR PDWDKVGAQF VADVVPFEMM KLRMLNGSHS
FLAYLGYLGG YETIADTMTN PDYRKAAFSL MMQEQAPTLS MPEGTDLKAY ATLLIERFSN
PSLRHRTWQI AMDGSQKLPQ RLLDPVRLHL QNGGSWRHLA LGVAGWMRYT QGVDEQGNAI
DVVDPMLAEF QKINAQYQGA DRVKALLGLS GIFADDLPQN ADFVGAVTAA YQQLCERGAR
ECVAAL