Gene EcSMS35_2826 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2826 
SymbolsrlE 
ID6145871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2901618 
End bp2902577 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content55% 
IMG OID641617695 
ProductPTS system, glucitol/sorbitol-specific, IIB component 
Protein accessionYP_001744850 
Protein GI170681652 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3732] Phosphotransferase system sorbitol-specific component IIBC 
TIGRFAM ID[TIGR00825] PTS system, glucitol/sorbitol-specific, IIBC component 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.204454 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0000775819 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGCGTA TTCGGATCGA AAAAGGAACG GGTGGCTGGG GCGGCCCGCT TGAGCTGGAA 
GCCAAACCGG GAAAAAAAAT CGTCTATATC ACCGCCGGTA CCCGGCCTGC GATTGTTGAC
AAACTGGCAC AGCTTACTGG CTGGCAGGCT ATTGACGGAT TTAAAGAAGG TGAACCCGCG
GAGGCGGAAA TTGGTGTCGC GGTAATCGAC TGTGGCGGCA CATTACGCTG TGGCATCTAT
CCGAAACGGC GTATTCCCAC CATTAATATC CACTCGACGG GCAAGTCCGG CCCGCTGGCG
CAGTACATTG TGGAAGATAT TTATGTCTCT GGCGTAAAAG AAGAAAACAT CACTGTAGTG
GGTGATGCGA CACCACAACC CTCTTCCGTG GGCCGTGACT ATGACACCAG CAAGAAAATC
ACCGAACAAA GCGATGGTTT ACTGGCGAAG GTGGGAATGG GTATGGGTTC TGCCGTTGCC
GTGTTGTTTC AATCTGGTCG TGACACCATC GACACTGTAT TAAAAACCAT TCTGCCGTTT
ATGGCGTTCG TCTCGGCGCT TATCGGCATC ATTATGGCTT CTGGCCTTGG TGACTGGATT
GCCCACGGTC TTGCTCCGCT GGCGAGCCAT CCACTGGGTC TGGTCATGTT GGCGCTCATC
TGCTCCTTCC CGCTGCTTTC ACCTTTCCTC GGCCCAGGCG CAGTTATCGC GCAGGTTATC
GGCGTATTGA TTGGTGTGCA GATTGGTCTC GGCAATATTC CGCCGCATCT GGCTTTACCT
GCACTGTTTG CCATCAACGC GCAGGCGGCC TGCGACTTCA TCCCGGTCGG TTTGTCGCTG
GCGGAAGCTC GTCAGGACAC GGTTCGCGTC GGTGTCCCTT CTGTTCTGGT GAGCCGCTTT
TTAACCGGCG CGCCAACTGT ACTGATCGCC TGGTTTGTCT CCGGTTTTAT CTATCAATAG
 
Protein sequence
MTRIRIEKGT GGWGGPLELE AKPGKKIVYI TAGTRPAIVD KLAQLTGWQA IDGFKEGEPA 
EAEIGVAVID CGGTLRCGIY PKRRIPTINI HSTGKSGPLA QYIVEDIYVS GVKEENITVV
GDATPQPSSV GRDYDTSKKI TEQSDGLLAK VGMGMGSAVA VLFQSGRDTI DTVLKTILPF
MAFVSALIGI IMASGLGDWI AHGLAPLASH PLGLVMLALI CSFPLLSPFL GPGAVIAQVI
GVLIGVQIGL GNIPPHLALP ALFAINAQAA CDFIPVGLSL AEARQDTVRV GVPSVLVSRF
LTGAPTVLIA WFVSGFIYQ