Gene EcHS_A2839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2839 
SymbolsrlE 
ID5594525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2843809 
End bp2844768 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content55% 
IMG OID640921956 
ProductPTS system, glucitol/sorbitol-specific, IIB component 
Protein accessionYP_001459467 
Protein GI157162149 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3732] Phosphotransferase system sorbitol-specific component IIBC 
TIGRFAM ID[TIGR00825] PTS system, glucitol/sorbitol-specific, IIBC component 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCATA TTCGGATCGA AAAAGGAACG GGTGGCTGGG GCGGCCCGCT TGAGCTGAAA 
GCCACGCCGG GCAAAAAAAT CGTCTATATC ACCGCCGGTA CCCGGCCTGC GATTGTTGAC
AAACTGGCAC AGCTTACTGG CTGGCAGGCT ATTGACGGAT TTAAAGAAGG TGAACCCGCG
GAGGCGGAAA TTGGCATTGC GGTAATCGAC TGTGGCGGCA CATTACGCTG CGGCATCTAT
CCGAAACGGC GTATTCCCAC CATTAATATC CACTCGACGG GCAAGTCCGG TCCGCTGGCG
CAGTACATTG TGGAAGATAT TTATGTCTCT GGCGTAAAAG AAGAAAACAT CACTGTAGTG
GGTGATGCGA CACCACACCC CTCTTCCGTG GGCCGTGACT ATGACACCAG CAAGAAAATC
ACCGAACAAA GCGATGGTTT ACTGGCGAAG GTGGGAATGA GTATGGGTTC TGCCGTTGCC
GTGTTGTTTC AATCTGGTCG TGACACCATC GACACTGTAT TAAAAACCAT TCTGCCGTTT
ATGGCATTCG TTTCGGCGCT CATTGGCATC ATTATGGCTT CTGGCCTTGG TGACTGGATT
GCCCACGGTC TTGCTCCGCT GGCGAGCCAT CCACTGGGTC TGGTCATGCT GGCGCTCATC
TGCTCCTTCC CGCTGCTTTC ACCTTTCCTC GGCCCAGGCG CAGTTATCGC ACAGGTTATC
GGCGTATTGA TTGGCGTGCA GATTGGTCTC GGCAATATTC CGCCGCATCT GGCTTTACCG
GCACTGTTTG CCATCAACGC GCAGGCGGCC TGCGACTTCA TCCCGGTCGG TTTGTCGCTG
GCGGAAGCCC GTCAGGACAC GGTTCGCGTC GGTGTCCCTT CTGTACTGGT GAGCCGCTTT
TTAACCGGCG CACCGACTGT ACTAATCGCC TGGTTTGTCT CCGGTTTTAT CTATCAATAG
 
Protein sequence
MTHIRIEKGT GGWGGPLELK ATPGKKIVYI TAGTRPAIVD KLAQLTGWQA IDGFKEGEPA 
EAEIGIAVID CGGTLRCGIY PKRRIPTINI HSTGKSGPLA QYIVEDIYVS GVKEENITVV
GDATPHPSSV GRDYDTSKKI TEQSDGLLAK VGMSMGSAVA VLFQSGRDTI DTVLKTILPF
MAFVSALIGI IMASGLGDWI AHGLAPLASH PLGLVMLALI CSFPLLSPFL GPGAVIAQVI
GVLIGVQIGL GNIPPHLALP ALFAINAQAA CDFIPVGLSL AEARQDTVRV GVPSVLVSRF
LTGAPTVLIA WFVSGFIYQ