Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2647 |
Symbol | tyrP |
ID | 6970736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 2497127 |
End bp | 2498338 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643386510 |
Product | tyrosine-specific transport protein |
Protein accession | YP_002270992 |
Protein GI | 209395924 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0814] Amino acid permeases |
TIGRFAM ID | [TIGR00837] aromatic amino acid transport protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.588852 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0000000318595 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAAAAACA GAACCCTGGG AAGTGTTTTT ATCGTGGCGG GAACCACAAT TGGCGCAGGC ATGCTGGCAA TGCCGCTGGC TGCGGCCGGT GTTGGTTTTA GCGTTACGTT AATCTTGTTG ATTGGGCTTT GGGCGTTGAT GTGCTACACG GCGCTATTAC TGCTGGAGGT GTACCAGCAT GTTCCGGCAG ATACCGGTCT GGGCACGCTG GCAAAACGCT ATCTGGGACG CTACGGTCAA TGGCTGACGG GCTTCAGTAT GATGTTCTTA ATGTATGCTC TGACTGCGGC ATACATCAGC GGTGCCGGTG AATTGTTGGC CTCCAGCATC AGCGACTGGA CAGGTATTTC AATGTCGGCA ACCGCTGGCG TGTTGTTGTT CACTTTTGTT GCCGGTGGCG TGGTTTGTGT CGGGACATCG CTGGTCGATT TATTTAACCG TTTTCTGTTC AGCGCCAAAA TTATTTTTCT GGTGGTGATG CTGGTATTGC TGCTACCGCA TATTCACAAA GTGAATCTTT TAACCCTGCC GTTGCAACAG GGGCTGGCAC TGTCTGCAAT CCCGGTGATT TTTACGTCGT TTGGTTTTCA CGGTAGCGTG CCGAGTATTG TCAGCTATAT GGATGGCAAC ATTCGTAAGC TACGCTGGGT GTTTATAACC GGTAGTGCGA TCCCCCTGGT GGCATATATT TTCTGGCAGG TGGCGACGCT TGGCAGCATT GATTCAACAA CCTTTATGGG ATTGCTGGCT AATCATGCCG GATTAAACGG GCTGTTACAG GCGTTACGCG AAATGGTGGC CTCTCCGCAT GTTGAGCTGG CAGTGCATTT ATTTGCTGAT TTAGCCCTTG CCACGTCATT TCTCGGCGTT GCGTTAGGCT TATTTGATTA TCTGGCAGAT TTGTTTCAGC GTTCAAATAC CGTTGGTGGA CGATTGCAAA CTGGAGCAAT TACCTTTCTG CCACCGTTGG CGTTTGCACT GTTTTATCCA CGAGGATTTG TGATGGCGCT GGGTTACGCC GGTGTGGCGC TGGCGGTACT GGCATTGATT ATCCCTTCGC TGTTGACCTG GCAAAGCAGA AAGCACAATC CTCAAGCGGG TTACCGGGTC AAAGGTGGTC GTCCGGCGCT GGTGGTGGTG TTTCTCTGTG GTATTGCTGT GATTGGCGTG CAATTTTTGA TTGCGGCAGG GTTGTTACCA GAAGTGGGGT GA
|
Protein sequence | MKNRTLGSVF IVAGTTIGAG MLAMPLAAAG VGFSVTLILL IGLWALMCYT ALLLLEVYQH VPADTGLGTL AKRYLGRYGQ WLTGFSMMFL MYALTAAYIS GAGELLASSI SDWTGISMSA TAGVLLFTFV AGGVVCVGTS LVDLFNRFLF SAKIIFLVVM LVLLLPHIHK VNLLTLPLQQ GLALSAIPVI FTSFGFHGSV PSIVSYMDGN IRKLRWVFIT GSAIPLVAYI FWQVATLGSI DSTTFMGLLA NHAGLNGLLQ ALREMVASPH VELAVHLFAD LALATSFLGV ALGLFDYLAD LFQRSNTVGG RLQTGAITFL PPLAFALFYP RGFVMALGYA GVALAVLALI IPSLLTWQSR KHNPQAGYRV KGGRPALVVV FLCGIAVIGV QFLIAAGLLP EVG
|
| |