Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4825 |
Symbol | |
ID | 6970663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4458469 |
End bp | 4459827 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643388517 |
Product | PEP-dependent sugar transporting PTS family, IIC component |
Protein accession | YP_002272945 |
Protein GI | 209397224 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3775] Phosphotransferase system, galactitol-specific IIC component |
TIGRFAM ID | [TIGR00827] PTS system, galactitol-specific IIC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 66 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGATA TCGCGCATAC CCTCTATAAT ATTGTGCAAT ATATATTGGG ATTTGGCCCG ACGGTAATGT TGCCGTTGGT GTTATTTATT CTTGCTCTCT GTTTTAAAGT AAAACCCGCT AAAGCCTTAC GTTCGTCATT AACAGTCGGC ATTGGTTTTG TCGGTATTTA TGCCATTTTC GATATTCTGA CCAGCAATGT CGGGCCAGCG GCCCAGGCGA TGGTTGAACG CACCGGAATT AATTTACCGG TGGTAGATTT AGGCTGGCCG CCGCTTTCCG CTATTACATG GGGTTCGCCA ATTGCCCCGT TTGTTATTCC CCTGACCATT CTGATTAACG TGGCGATGCT GGCGTTAAAT AAAACCCGTA CCGTTGACGT AGATATGTGG AACTACTGGC ATTTTGCCCT TGCTGGTACG TTGGTTTATT ACAGCACCGG CAGCCTGTTC TTTGGTTTGC TGGCGGCGGC GATTGCTGCG GTGGTAGTAC TTAAACTCGC CGACTGGTCT GCGCCACTGG TACAAAAATA CTTTGGACTG GAAGGGATCT CATTGCCGAC GCTCTCTTCG GTGGTGTTCT TCCCGGTCGG TCTGCTGGTC GACAAAATCA TCGACCATAT CCCTGGCCTC AATCGTATTC ATATCGACCC GGAAACCGTA CAGAAAAAGT TTGGCATCTT CGGCGAACCG ATGATGGTTG GCACTATTCT GGGCATTCTG CTCGGCGTAA TTGCCGGATA CGATTTCAAA AAAGTATTGC TGCTTGGCAT CAGCATTGGC GGTGTGATGT TCATCCTGCC ACGCATGGTA CGCATCCTGA TGGAAGGTTT ATTACCGCTG TCTGAAGCTA TTAAAAAGTA TCTCAATGCC AAATACCCTG ACCGTGACGA TCTCTATATC GGCCTGGATA TCGCCGTTGC CGTAGGTAAT CCGGCGATTA TCTCCACCGC CCTGCTGCTA ACGCCAATCT CGGTCTTTAT CGCGTTTGTC CTTCCGGGTA ATGAAGTCCT GCCGCTTGGC GACCTTGCCA ACCTGGCGGT AATGGCGTCG ATGATTGCTT TAGCCAGCCG TGGCAATATT TTCCGCACCG TTCTGGCGGC GATCCCGGTG ATTATTGCCG ACCTATGGAT TGCTACCAAA ATCGCGCCGT TTATTACCGG AATGGCGAAA GACGTTAACT TCAAATTTGC CGAAGGCTCC AGCGGCCAGG TTTCCAGTTT TCTTGATGGC GGTAACCCGT TCCGCTTCTG GCTGCTGGAA ATCTTCAACG GCAATCTCAT CGCCATTGGT CTGGTGCCGG TTATCGCCCT GGTACTGTAT GGCATTTTCC GAATGACGCG GAGCACGGTT TATGCCTGA
|
Protein sequence | MNDIAHTLYN IVQYILGFGP TVMLPLVLFI LALCFKVKPA KALRSSLTVG IGFVGIYAIF DILTSNVGPA AQAMVERTGI NLPVVDLGWP PLSAITWGSP IAPFVIPLTI LINVAMLALN KTRTVDVDMW NYWHFALAGT LVYYSTGSLF FGLLAAAIAA VVVLKLADWS APLVQKYFGL EGISLPTLSS VVFFPVGLLV DKIIDHIPGL NRIHIDPETV QKKFGIFGEP MMVGTILGIL LGVIAGYDFK KVLLLGISIG GVMFILPRMV RILMEGLLPL SEAIKKYLNA KYPDRDDLYI GLDIAVAVGN PAIISTALLL TPISVFIAFV LPGNEVLPLG DLANLAVMAS MIALASRGNI FRTVLAAIPV IIADLWIATK IAPFITGMAK DVNFKFAEGS SGQVSSFLDG GNPFRFWLLE IFNGNLIAIG LVPVIALVLY GIFRMTRSTV YA
|
| |