Gene ECH74115_5114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5114 
Symbol 
ID6967792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4756075 
End bp4757691 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content52% 
IMG OID643388786 
ProductPTS system, alpha-glucoside-specific IIBC component 
Protein accessionYP_002273212 
Protein GI209400134 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific
[COG1264] Phosphotransferase system IIB components 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR00852] PTS system, maltose and glucose-specific subfamily, IIC component
[TIGR02005] PTS system, alpha-glucoside-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.424783 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAGTC AAATTCAACG CTTTGGCGGC GCGATGTTCA CGCCAGTGCT GCTGTTTCCC 
TTCGCCGGGA TTGTGGTGGG TCTTGCCATC TTGCTGCAAA ACCCGATGTT TGTCGGGGAA
TCACTGACCG ATCCGAACAG TTTATTCGCG CAAATCGTAC ACATTATTGA AGAGGGCGGT
TGGACGGTAT TCCGTAATAT GCCGCTGATT TTTGCTGTCG GTTTACCCAT TGGCCTTGCT
AAGCAAGCGC AGGGGCGTGC TTGTCTGGCG GTGATGGTGA GTTTCCTGAC CTGGAACTAT
TTCATCAATG CGATGGGAAT GACCTGGGGA AGCTACTTCG GCGTCGATTT CACTCAGGAC
GCGGTGGCAG GTAGCGGTCT GACAATGATG GCCGGGATTA AAACCCTCGA TACCAGCATT
ATCGGCGCAA TTATCATTTC CGGCATTGTG ACGGCGCTGC ATAACCGTCT GTTCGATAAA
AAACTGCCGG TTTTTCTCGG CATTTTCCAG GGGACGTCTT ATGTGGTGAT TATCGCCTTC
CTGGTGATGA TCCCCTGTGC CTGGCTGACA TTGCTCGGCT GGCCAAAAGT ACAAATGGGG
ATTGAATCTC TGCAAGCGTT CCTGCGTTCG GCGGGTGCGC TTGGGGTGTG GGTTTACACC
TTCCTCGAAC GTATTCTGAT CCCAACCGGT TTACACCACT TCATCTACGG ACCGTTTATC
TTTGGTCCGG CAGCTGTTGA AGGCGGAATT CAGATGTACT GGGCGCAGCA TCTGCAAGAG
TTCAGTTTGA GCGCCGAGCC GCTGAAATCG TTGTTCCCGG AAGGAGGTTT TGCCCTGCAC
GGTAACTCAA AAATCTTTGG TGCCGTGGGC ATTTCTTTAG CGATGTACTT CACTGCCGCA
CCGGAAAATC GGGTAAAAGT GGCGGGTTTG CTGATTCCCG CAACCTTAAC CGCCATGCTG
GTGGGAATTA CCGAACCGCT GGAATTTACC TTCCTGTTCA TTTCACCGTT GCTGTTTGCG
GTACACGCTG TGCTTGCGGC CTCAATGTCG ACCGTGATGT ATCTCTTTGG TGTGGTGGGC
AACATGGGCG GAGGTCTGAT TGACCAGGTT TTACCGCAAA ACTGGATCCC GATGTTCAGC
AACCACGCGG ATATGATGCT GACCCAAATC GCCATTGGGT TGTGCTTTAC CCTGCTGTAC
TTCGTGGTTT TCCGCACCCT GATTCTGCAA TTCAACATGT GCACGCCGGG ACGTGAAGAT
GCGGAAGTGA AACTCTACTC AAAAGCCGAA TACAAAGCCT CGCGAGGCCA AACCACCGCG
GCAGAGCCAA AAAAAGAGCT GGATCAGGCT GCCGGTATCC TGCAAGCCCT GGGCGGGGTC
GGCAATATCT CCAGCATTAA CAATTGTGCG ACGCGTCTAC GCATTGCACT GCATGACATG
TCACAAACGC TGGATGACGA AGTCTTTAAA AAGCTGGGAG CGCACGGCGT CTTCCGTAGT
GGCGATGCCA TTCAGGTGAT CATTGGTCTG CATGTATCCC AGCTGCGTGA ACAGCTCGAT
AGCTTAATTA ATTCTCATCA ATCAGCAGAA AATGTTGCCA TTACGGAGGC AGTATAA
 
Protein sequence
MLSQIQRFGG AMFTPVLLFP FAGIVVGLAI LLQNPMFVGE SLTDPNSLFA QIVHIIEEGG 
WTVFRNMPLI FAVGLPIGLA KQAQGRACLA VMVSFLTWNY FINAMGMTWG SYFGVDFTQD
AVAGSGLTMM AGIKTLDTSI IGAIIISGIV TALHNRLFDK KLPVFLGIFQ GTSYVVIIAF
LVMIPCAWLT LLGWPKVQMG IESLQAFLRS AGALGVWVYT FLERILIPTG LHHFIYGPFI
FGPAAVEGGI QMYWAQHLQE FSLSAEPLKS LFPEGGFALH GNSKIFGAVG ISLAMYFTAA
PENRVKVAGL LIPATLTAML VGITEPLEFT FLFISPLLFA VHAVLAASMS TVMYLFGVVG
NMGGGLIDQV LPQNWIPMFS NHADMMLTQI AIGLCFTLLY FVVFRTLILQ FNMCTPGRED
AEVKLYSKAE YKASRGQTTA AEPKKELDQA AGILQALGGV GNISSINNCA TRLRIALHDM
SQTLDDEVFK KLGAHGVFRS GDAIQVIIGL HVSQLREQLD SLINSHQSAE NVAITEAV