Gene EcE24377A_2987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_2987 
SymbolsrlE 
ID5590868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp2987060 
End bp2988019 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content55% 
IMG OID640926635 
ProductPTS system, glucitol/sorbitol-specific, IIB component 
Protein accessionYP_001464011 
Protein GI157154733 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3732] Phosphotransferase system sorbitol-specific component IIBC 
TIGRFAM ID[TIGR00825] PTS system, glucitol/sorbitol-specific, IIBC component 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCATA TTCGGATCGA AAAAGGAACG GGTGGCTGGG GCGGCCCGCT TGAGCTGAAA 
GCCACGCCGG GAAAAAAAAT CGTCTATATC ACCGCCGGTA CCCGGCCTGC GATTGTTGAC
AAACTGGCAC AGCTTACTGG CTGGCAGGCT ATTGACGGAT TTAAAGAAGG TGAACCCGCG
GAGGCGGAAA TTGGTGTCGC GGTAATCGAC TGTGGCGGCA CATTACGCTG CGGCATCTAT
CCGAAACGAC GTATTCCCAC CATTAATATC CACTCGACGG GCAAGTCCGG TCCGCTGGCG
CAGTACATTG TGGAAGATAT TTATGTCTCT GGCGTAAAAG AAGAAAACAT CACTGTAGTG
GGTGATGCGA CACCACAACC CTCTTCCGTG GGCCGTGACT ATGACACCAG TAAGAAAATC
ACCGAACAAA GCGATGGTTT ACTGGCGAAG GTGGGAATGG GCATGGGGTC CGCCGTTGCG
GTGCTGTTTC AATCTGGTCG TGACACCATC GACACTGTAT TAAAAACCAT TCTGCCGTTT
ATGGCATTCG TCTCGGCGCT CATTGGCATC ATTATGGCTT CTGGCCTTGG TGACTGGATT
GCCCACGGTC TTGCTCCGCT GGCGAGCCAT CCACTGGGTC TGGTCATGCT GGCGCTCATC
TGCTCCTTCC CACTGCTTTC ACCTTTCCTC GGCCCAGGCG CAGTTATCGC ACAGGTTATC
GGCGTATTGA TTGGCGTGCA GATTGGTCTC GGCAATATTC CGCCGCATCT GGCTTTACCG
GCACTGTTTG CCATCAACGC GCAGGCGGCC TGCGACTTCA TCCCGGTCGG TTTGTCGCTG
GCGGAAGCCC GTCAGGACAC GGTTCGCGTC GGTGTCCCTT CTGTACTGGT GAGCCGCTTT
TTAACCGGCG CACCGACTGT ACTGATCGCC TGGTTTGTCT CCGGTTTTAT CTATCAATAG
 
Protein sequence
MTHIRIEKGT GGWGGPLELK ATPGKKIVYI TAGTRPAIVD KLAQLTGWQA IDGFKEGEPA 
EAEIGVAVID CGGTLRCGIY PKRRIPTINI HSTGKSGPLA QYIVEDIYVS GVKEENITVV
GDATPQPSSV GRDYDTSKKI TEQSDGLLAK VGMGMGSAVA VLFQSGRDTI DTVLKTILPF
MAFVSALIGI IMASGLGDWI AHGLAPLASH PLGLVMLALI CSFPLLSPFL GPGAVIAQVI
GVLIGVQIGL GNIPPHLALP ALFAINAQAA CDFIPVGLSL AEARQDTVRV GVPSVLVSRF
LTGAPTVLIA WFVSGFIYQ