Gene EcHS_A0778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0778 
Symbol 
ID5595236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp792120 
End bp794036 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content55% 
IMG OID640919954 
ProductPTS system 2-O-a-mannosyl-D-glycerate specific transporter subunit IIABC 
Protein accessionYP_001457528 
Protein GI157160210 
COG category[G] Carbohydrate transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG1299] Phosphotransferase system, fructose-specific IIC component
[COG1445] Phosphotransferase system fructose-specific component IIB
[COG1762] Phosphotransferase system mannitol/fructose-specific IIA domain (Ntr-type) 
TIGRFAM ID[TIGR00829] PTS system, fructose-specific, IIB component
[TIGR00848] PTS system, fructose subfamily, IIA component
[TIGR01427] PTS system, fructose subfamily, IIC component 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCTGA CGACTCTGAC CCACCGCGAT GCGTTGTGTC TGAATGCGCG CTTTACCAGC 
CGTGAAGAGG CCATCCACGT GTTGACTCAA CGTCTTGCTG CTCTGGGGAA AATTTCCAGT
ACTGAGCAAT TTCTGGAAGA AGTGTATCGC CGTGAAAGCC TTGGCCCGAC CGCCTTAGGT
GAAGGGTTGG CTGTGCCGCA TGGCAAAACT GCTGCGGTAA AAGAAGCGGC GTTTGCGGTC
GCCACACTCA GCGAGCCGCT TCAGTGGGAA GGCGTTGATG GCCCGGAAGC AGTTGATTTA
GTGGTGCTGC TGGCGATTCC CCCCAATGAA GCGGGTACTA CGCATATGCA ACTGCTGACA
GCGCTGACCA CGCGCCTTGC GGATGATGAG ATTCGGGCGC GTATACAGTC GGCGACGACG
CCTGATGAGT TGCTCTCGGC GCTGGATGAC AAGGGAGGCA CGCAACCTTC TGCCTCTTTT
TCCAACGCGC CAACTATCGT CTGCGTAACG GCCTGTCCGG CGGGTATTGC TCACACCTAT
ATGGCTGCGG AATATCTGGA AAAAGCCGGA CGCAAACTCG GCGTAAATGT TTACGTTGAA
AAACAAGGCG CTAACGGCAT TGAAGGGCGT TTAACGGCGG ATCAACTCAA TAGTGCAACC
GCCTGTATTT TTGCGGCTGA AGTCGCCATC AAGGAGAGTG AGCGTTTTAA TGGCATTCCC
GCGCTTTCAG TGCCTGTTGC CGAGCCGATT CGCCATGCAG AAGCGTTGAT CCAACAAGCG
CTTACCCTCA AGCGTAGCGA TGAGACGCGT ACCGTACAGC AAGATACGCA ACCGGTGAAA
AGTGTCAAAA CGGAGCTGAA ACAGGCACTG TTGAGCGGAA TCTCTTTTGC CGTACCGTTG
ATTGTCGCGG GGGGCACGGT GCTGGCGGTC GCGGTATTAC TGTCGCAAAT CTTCGGGCTA
CAAGATCTGT TTAATGAAGA AAACTCCTGG CTGTGGATGT ACCGCAAGCT GGGCGGCGGG
CTGCTCGGAA TTTTGATGGT ACCGGTGCTC GCGGCCTATA CCGCCTATTC TCTGGCAGAT
AAACCGGCGT TAGCGCCAGG CTTTGCGGCT GGACTTGCCG CCAACATGAT CGGCTCCGGG
TTTCTCGGCG CGGTCGTTGG CGGATTGATA GCCGGTTACT TGATGCGCTG GGTGAAAAAT
CACTTGCGTC TTAGCAGTAA ATTCAATGGA TTCCTGACTT TTTATCTCTA CCCGGTGCTC
GGTACGTTGG GAGCGGGCAG TCTGATGCTG TTTGTGGTGG GGGAACCTGT CGCCTGGATC
AATAACTCGC TTACCGCCTG GCTGAACGGT CTGTCAGGAA GTAACGCGCT GTTGCTGGGT
GCCATTCTCG GTTTTATGTG TTCCTTTGAC CTTGGAGGGC CAGTGAATAA AGCCGCTTAT
GCATTCTGCC TGGGCGCAAT GGCGAACGGC GTTTACGGCC CGTATGCCAT TTTCGCCTCC
GTCAAAATGG TTTCGGCATT TACCGTAACC GCTTCCACGA TGCTCGCACC GCGCCTGTTT
AAAGAGTTTG AAATTGAGAC CGGGAAATCC ACCTGGCTGT TAGGGCTGGC AGGTATTACC
GAAGGGGCGA TCCCGATGGC GATTGAAGAT CCGCTGCGGG TTATTGGTTC GTTTGTGCTG
GGCTCTATGG TAACGGGCGC TATTGTCGGT GCGATGAATA TCGGCCTTTC GACACCCGGT
GCCGGCATTT TCTCGCTCTT TTTACTTCAT GATAATGGCG CGGGCGGTGT TATGGCGGCA
ATTGGCTGGT TTGGCGCGGC ATTGGTGGGG GCTGCAATCT CGACTGCAAT TCTCCTGATG
TGGCGGCGTC ACGCGGTTAA GCATGGCAAC TATCTGACTG ATGGCGTAAT GCCATAA
 
Protein sequence
MNLTTLTHRD ALCLNARFTS REEAIHVLTQ RLAALGKISS TEQFLEEVYR RESLGPTALG 
EGLAVPHGKT AAVKEAAFAV ATLSEPLQWE GVDGPEAVDL VVLLAIPPNE AGTTHMQLLT
ALTTRLADDE IRARIQSATT PDELLSALDD KGGTQPSASF SNAPTIVCVT ACPAGIAHTY
MAAEYLEKAG RKLGVNVYVE KQGANGIEGR LTADQLNSAT ACIFAAEVAI KESERFNGIP
ALSVPVAEPI RHAEALIQQA LTLKRSDETR TVQQDTQPVK SVKTELKQAL LSGISFAVPL
IVAGGTVLAV AVLLSQIFGL QDLFNEENSW LWMYRKLGGG LLGILMVPVL AAYTAYSLAD
KPALAPGFAA GLAANMIGSG FLGAVVGGLI AGYLMRWVKN HLRLSSKFNG FLTFYLYPVL
GTLGAGSLML FVVGEPVAWI NNSLTAWLNG LSGSNALLLG AILGFMCSFD LGGPVNKAAY
AFCLGAMANG VYGPYAIFAS VKMVSAFTVT ASTMLAPRLF KEFEIETGKS TWLLGLAGIT
EGAIPMAIED PLRVIGSFVL GSMVTGAIVG AMNIGLSTPG AGIFSLFLLH DNGAGGVMAA
IGWFGAALVG AAISTAILLM WRRHAVKHGN YLTDGVMP