Gene ECH74115_1480 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1480 
SymbolptsG 
ID6969847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1460549 
End bp1461982 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content52% 
IMG OID643385451 
ProductPTS system glucose-specific transporter subunits IIBC 
Protein accessionYP_002269945 
Protein GI209398205 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific
[COG1264] Phosphotransferase system IIB components 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR00852] PTS system, maltose and glucose-specific subfamily, IIC component
[TIGR02002] PTS system, glucose-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000807598 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0000000861418 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTTAAGA ATGCATTTGC TAACCTGCAA AAGGTCGGTA AATCGCTGAT GCTGCCGGTA 
TCCGTACTGC CTATCGCAGG TATTCTGCTG GGCGTCGGTT CCGCGAATTT CAGCTGGCTG
CCCGCCGTTG TATCGCATGT TATGGCAGAA GCAGGCGGTT CCGTCTTTGC AAACATGCCA
CTGATTTTTG CGATCGGTGT CGCCCTCGGC TTTACCAATA ACGATGGCGT ATCCGCGCTG
GCTGCAGTTG TTGCCTATGG CATCATGGTT AAAACCATGG CCGTGGTTGC GCCACTGGTA
CTGCATTTAC CTGCTGAAGA AATCGCCTCT AAACACCTGG CGGATACTGG CGTACTCGGA
GGGATTATCT CCGGTGCGAT CGCAGCGTAC ATGTTTAACC GTTTCTACCG TATTAAGCTG
CCTGAGTATC TTGGCTTCTT TGCCGGTAAA CGCTTTGTGC CGATCATTTC TGGCCTGGCT
GCCATCTTTA CTGGCGTTGT GCTGTCCTTC ATTTGGCCGC CGATTGGTTC TGCAATCCAG
ACCTTCTCTC AGTGGGCTGC TTACCAGAAC CCGGTAGTTG CGTTTGGCAT TTACGGTTTC
ATCGAACGTT GCCTGGTACC GTTTGGTCTG CACCACATCT GGAACGTACC TTTCCAGATG
CAGATTGGTG AATACACCAA CGCAGCAGGT CAGGTTTTCC ACGGCGACAT TCCGCGTTAT
ATGGCGGGTG ACCCGACTGC GGGTAAACTG TCTGGTGGCT TCCTGTTCAA AATGTACGGT
CTGCCAGCTG CCGCAATTGC TATCTGGCAC TCTGCTAAAC CAGAAAACCG CGCGAAAGTG
GGCGGTATTA TGATCTCCGC GGCGCTGACC TCGTTCCTGA CCGGTATCAC CGAGCCGATC
GAGTTCTCCT TCATGTTCGT TGCGCCGATC CTGTACATCA TCCACGCGAT TCTGGCAGGC
CTGGCATTCC CAATCTGTAT TCTTCTGGGG ATGCGTGACG GTACGTCGTT TTCGCACGGT
CTGATCGACT TCATCGTTCT GTCTGGTAAC AGCAGCAAAC TGTGGCTGTT CCCGATCGTC
GGTATCGGTT ATGCGATTGT TTACTACACC ATCTTCCGCG TGCTGATTAA AGCACTGGAT
CTGAAAACGC CGGGTCGTGA AGACGCGACT GAAGATGCAA AAGCGACAGG TACCAGCGAA
ATGGCACCGG CTCTGGTTGC TGCATTTGGT GGTAAAGAAA ACATTACTAA CCTCGACGCA
TGTATTACCC GTCTGCGCGT CAGCGTTGCT GATGTGTCTA AAGTGGATCA GGCTGGCCTG
AAGAAACTGG GCGCAGCGGG CGTAGTGGTT GCTGGTTCTG GTGTTCAGGC GATTTTCGGT
ACTAAATCCG ATAACCTGAA AACCGAGATG GATGAGTACA TCCGTAACCA CTAA
 
Protein sequence
MFKNAFANLQ KVGKSLMLPV SVLPIAGILL GVGSANFSWL PAVVSHVMAE AGGSVFANMP 
LIFAIGVALG FTNNDGVSAL AAVVAYGIMV KTMAVVAPLV LHLPAEEIAS KHLADTGVLG
GIISGAIAAY MFNRFYRIKL PEYLGFFAGK RFVPIISGLA AIFTGVVLSF IWPPIGSAIQ
TFSQWAAYQN PVVAFGIYGF IERCLVPFGL HHIWNVPFQM QIGEYTNAAG QVFHGDIPRY
MAGDPTAGKL SGGFLFKMYG LPAAAIAIWH SAKPENRAKV GGIMISAALT SFLTGITEPI
EFSFMFVAPI LYIIHAILAG LAFPICILLG MRDGTSFSHG LIDFIVLSGN SSKLWLFPIV
GIGYAIVYYT IFRVLIKALD LKTPGREDAT EDAKATGTSE MAPALVAAFG GKENITNLDA
CITRLRVSVA DVSKVDQAGL KKLGAAGVVV AGSGVQAIFG TKSDNLKTEM DEYIRNH