Gene EcSMS35_2026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2026 
SymbolptsG 
ID6147185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2047225 
End bp2048658 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content53% 
IMG OID641616902 
ProductPTS system glucose-specific transporter subunits IIBC 
Protein accessionYP_001744078 
Protein GI170681251 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific
[COG1264] Phosphotransferase system IIB components 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR00852] PTS system, maltose and glucose-specific subfamily, IIC component
[TIGR02002] PTS system, glucose-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000829082 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.00000105555 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTTAAGA ATGCATTTGC TAACCTGCAA AAGGTCGGTA AATCGCTGAT GCTGCCGGTA 
TCCGTACTGC CTATCGCAGG TATTCTGCTG GGCGTCGGTT CCGCGAATTT CAGCTGGCTG
CCCGCCGTTG TATCGCATGT TATGGCAGAA GCAGGCGGTT CCGTCTTTGC AAACATGCCA
CTGATTTTTG CGATCGGTGT CGCCCTCGGC TTTACCAATA ACGATGGCGT ATCCGCGCTG
GCCGCAGTTG TTGCCTATGG CATCATGGTT AAAACCATGG CCGTGGTTGC GCCACTGGTA
CTGCATTTAC CTGCTGAAGA AATTGCCTCT AAACACCTGG CGGATACTGG CGTACTCGGG
GGGATTATCT CCGGTGCGAT CGCAGCGTAC ATGTTTAACC GTTTCTACCG TATTAAGCTG
CCTGAGTATC TTGGCTTCTT TGCCGGTAAA CGCTTTGTGC CGATCATTTC TGGCCTGGCT
GCCATCTTTA CTGGCGTTGT GCTGTCCTTC ATTTGGCCGC CGATTGGTTC TGCAATCCAG
ACCTTCTCTC AGTGGGCTGC TTACCAGAAC CCGGTAGTTG CGTTTGGCAT TTACGGTTTC
ATCGAACGTT GCCTGGTACC GTTTGGTCTG CACCACATCT GGAACGTACC TTTCCAGATG
CAAATTGGTG AATACACCAA CGCAGCAGGT CAGGTTTTCC ACGGCGACAT TCCGCGTTAT
ATGGCGGGTG ACCCGACTGC GGGTAAACTG TCTGGTGGCT TCCTGTTCAA AATGTACGGT
CTGCCAGCTG CCGCAATTGC TATCTGGCAC TCTGCTAAAC CAGAAAACCG CGCGAAAGTG
GGCGGTATTA TGATCTCCGC GGCGCTGACC TCGTTCCTGA CCGGTATCAC CGAGCCGATC
GAGTTCTCCT TCATGTTCGT TGCGCCGATC CTGTACATCA TCCACGCGAT TCTGGCAGGC
CTGGCATTCC CAATCTGTAT TCTTTTGGGG ATGCGTGACG GTACGTCGTT CTCGCACGGT
CTGATCGACT TCATCGTTCT GTCTGGTAAC AGCAGCAAAC TGTGGCTGTT CCCGATCGTC
GGTATCGGTT ATGCGATTGT TTACTACACC ATCTTCCGCG TGCTGATTAA AGCACTGGAT
CTGAAAACGC CGGGTCGTGA AGACGCGACT GAAGACGCAA AAGCGACAGG TACCAGCGAA
ATGGCACCGG CTCTGGTTGC TGCATTTGGT GGTAAAGAAA ACATTACTAA CCTCGACGCA
TGTATTACCC GTCTGCGCGT CAGCGTTGCT GATGTGTCTA AAGTGGATCA GGCTGGCCTG
AAGAAACTGG GCGCAGCGGG CGTAGTGGTT GCTGGTTCTG GTGTTCAGGC GATTTTCGGT
ACTAAATCCG ATAACCTGAA AACCGAGATG GATGAGTACA TCCGTAACCA CTAA
 
Protein sequence
MFKNAFANLQ KVGKSLMLPV SVLPIAGILL GVGSANFSWL PAVVSHVMAE AGGSVFANMP 
LIFAIGVALG FTNNDGVSAL AAVVAYGIMV KTMAVVAPLV LHLPAEEIAS KHLADTGVLG
GIISGAIAAY MFNRFYRIKL PEYLGFFAGK RFVPIISGLA AIFTGVVLSF IWPPIGSAIQ
TFSQWAAYQN PVVAFGIYGF IERCLVPFGL HHIWNVPFQM QIGEYTNAAG QVFHGDIPRY
MAGDPTAGKL SGGFLFKMYG LPAAAIAIWH SAKPENRAKV GGIMISAALT SFLTGITEPI
EFSFMFVAPI LYIIHAILAG LAFPICILLG MRDGTSFSHG LIDFIVLSGN SSKLWLFPIV
GIGYAIVYYT IFRVLIKALD LKTPGREDAT EDAKATGTSE MAPALVAAFG GKENITNLDA
CITRLRVSVA DVSKVDQAGL KKLGAAGVVV AGSGVQAIFG TKSDNLKTEM DEYIRNH