Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5114 |
Symbol | |
ID | 6967792 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4756075 |
End bp | 4757691 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643388786 |
Product | PTS system, alpha-glucoside-specific IIBC component |
Protein accession | YP_002273212 |
Protein GI | 209400134 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific [COG1264] Phosphotransferase system IIB components |
TIGRFAM ID | [TIGR00826] PTS system, glucose-like IIB component [TIGR00852] PTS system, maltose and glucose-specific subfamily, IIC component [TIGR02005] PTS system, alpha-glucoside-specific IIBC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.424783 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCAGTC AAATTCAACG CTTTGGCGGC GCGATGTTCA CGCCAGTGCT GCTGTTTCCC TTCGCCGGGA TTGTGGTGGG TCTTGCCATC TTGCTGCAAA ACCCGATGTT TGTCGGGGAA TCACTGACCG ATCCGAACAG TTTATTCGCG CAAATCGTAC ACATTATTGA AGAGGGCGGT TGGACGGTAT TCCGTAATAT GCCGCTGATT TTTGCTGTCG GTTTACCCAT TGGCCTTGCT AAGCAAGCGC AGGGGCGTGC TTGTCTGGCG GTGATGGTGA GTTTCCTGAC CTGGAACTAT TTCATCAATG CGATGGGAAT GACCTGGGGA AGCTACTTCG GCGTCGATTT CACTCAGGAC GCGGTGGCAG GTAGCGGTCT GACAATGATG GCCGGGATTA AAACCCTCGA TACCAGCATT ATCGGCGCAA TTATCATTTC CGGCATTGTG ACGGCGCTGC ATAACCGTCT GTTCGATAAA AAACTGCCGG TTTTTCTCGG CATTTTCCAG GGGACGTCTT ATGTGGTGAT TATCGCCTTC CTGGTGATGA TCCCCTGTGC CTGGCTGACA TTGCTCGGCT GGCCAAAAGT ACAAATGGGG ATTGAATCTC TGCAAGCGTT CCTGCGTTCG GCGGGTGCGC TTGGGGTGTG GGTTTACACC TTCCTCGAAC GTATTCTGAT CCCAACCGGT TTACACCACT TCATCTACGG ACCGTTTATC TTTGGTCCGG CAGCTGTTGA AGGCGGAATT CAGATGTACT GGGCGCAGCA TCTGCAAGAG TTCAGTTTGA GCGCCGAGCC GCTGAAATCG TTGTTCCCGG AAGGAGGTTT TGCCCTGCAC GGTAACTCAA AAATCTTTGG TGCCGTGGGC ATTTCTTTAG CGATGTACTT CACTGCCGCA CCGGAAAATC GGGTAAAAGT GGCGGGTTTG CTGATTCCCG CAACCTTAAC CGCCATGCTG GTGGGAATTA CCGAACCGCT GGAATTTACC TTCCTGTTCA TTTCACCGTT GCTGTTTGCG GTACACGCTG TGCTTGCGGC CTCAATGTCG ACCGTGATGT ATCTCTTTGG TGTGGTGGGC AACATGGGCG GAGGTCTGAT TGACCAGGTT TTACCGCAAA ACTGGATCCC GATGTTCAGC AACCACGCGG ATATGATGCT GACCCAAATC GCCATTGGGT TGTGCTTTAC CCTGCTGTAC TTCGTGGTTT TCCGCACCCT GATTCTGCAA TTCAACATGT GCACGCCGGG ACGTGAAGAT GCGGAAGTGA AACTCTACTC AAAAGCCGAA TACAAAGCCT CGCGAGGCCA AACCACCGCG GCAGAGCCAA AAAAAGAGCT GGATCAGGCT GCCGGTATCC TGCAAGCCCT GGGCGGGGTC GGCAATATCT CCAGCATTAA CAATTGTGCG ACGCGTCTAC GCATTGCACT GCATGACATG TCACAAACGC TGGATGACGA AGTCTTTAAA AAGCTGGGAG CGCACGGCGT CTTCCGTAGT GGCGATGCCA TTCAGGTGAT CATTGGTCTG CATGTATCCC AGCTGCGTGA ACAGCTCGAT AGCTTAATTA ATTCTCATCA ATCAGCAGAA AATGTTGCCA TTACGGAGGC AGTATAA
|
Protein sequence | MLSQIQRFGG AMFTPVLLFP FAGIVVGLAI LLQNPMFVGE SLTDPNSLFA QIVHIIEEGG WTVFRNMPLI FAVGLPIGLA KQAQGRACLA VMVSFLTWNY FINAMGMTWG SYFGVDFTQD AVAGSGLTMM AGIKTLDTSI IGAIIISGIV TALHNRLFDK KLPVFLGIFQ GTSYVVIIAF LVMIPCAWLT LLGWPKVQMG IESLQAFLRS AGALGVWVYT FLERILIPTG LHHFIYGPFI FGPAAVEGGI QMYWAQHLQE FSLSAEPLKS LFPEGGFALH GNSKIFGAVG ISLAMYFTAA PENRVKVAGL LIPATLTAML VGITEPLEFT FLFISPLLFA VHAVLAASMS TVMYLFGVVG NMGGGLIDQV LPQNWIPMFS NHADMMLTQI AIGLCFTLLY FVVFRTLILQ FNMCTPGRED AEVKLYSKAE YKASRGQTTA AEPKKELDQA AGILQALGGV GNISSINNCA TRLRIALHDM SQTLDDEVFK KLGAHGVFRS GDAIQVIIGL HVSQLREQLD SLINSHQSAE NVAITEAV
|
| |