Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01864 |
Symbol | tyrP |
ID | 8113737 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 1934137 |
End bp | 1935348 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 644848083 |
Product | hypothetical protein |
Protein accession | YP_002999656 |
Protein GI | 251785352 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0814] Amino acid permeases |
TIGRFAM ID | [TIGR00837] aromatic amino acid transport protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00693993 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAAACA GAACTCTGGG AAGTGTTTTT ATCGTGGCGG GAACCACAAT TGGCGCAGGC ATGCTGGCAA TGCCGCTGGC TGCGGCCGGT GTTGGTTTTA GCGTTACGTT AATCTTGTTG ATTGGGCTTT GGGCGTTGAT GTGCTACACG GCGCTATTAC TGCTGGAGGT GTACCAGCAT GTTCCGGCAG ATACCGGTCT TGGCACGCTG GCAAAACGCT ATCTGGGACG CTACGGTCAG TGGCTGACGG GCTTCAGTAT GATGTTCTTA ATGTATGCTC TGACTGCGGC ATACATCAGC GGTGCCGGTG AATTGTTGGC CTCCAGCATC AGCGACTGGA CAGGTATTTC AATGTCGGCA ACCGCTGGCG TGCTGTTGTT CACTTTTGTT GCTGGTGGCG TGGTTTGTGT CGGGACATCG CTGGTCGATT TGTTTAACCG TTTTCTGTTC AGCGCCAAAA TTATTTTTCT GGTGGTGATG CTGGTATTGC TGCTACCGCA TATTCACAAA GTGAATCTTT TAACCCTGCC GTTGCAACAG GGGCTGGCTC TGTCCGCAAT CCCGGTGATT TTTACCTCGT TTGGTTTTCA CGGTAGCGTG CCGAGTATTG TCAGCTATAT GGATGGCAAC GTTCGTAAGC TACGCTGGGT GTTTATAACC GGTAGTGCTA TCCCCCTGGT GGCATATATT TTCTGGCAGG TGGCAACGCT TGGCAGCATT GATTCAACAA CCTTTATGGG ATTGCTGGCT AATCATGCTG GATTAAACGG GCTGTTACAG GCGTTACGCG AAATGGTGGC CTCTCCGCAT GTTGAGCTGG CAGTGCATTT ATTTGCTGAT TTAGCCCTCG CCACGTCATT TCTCGGCGTT GCGTTAGGCT TATTTGATTA TCTGGCTGAT TTGTTTCAGC GTTCAAATAC CGTTGGTGGA CGATTGCAAA CTGGTGCAAT TACCTTTCTG CCGCCGTTGG CGTTTGCACT GTTTTATCCA CGAGGATTTG TGATGGCGCT GGGTTACGCC GGTGTGGCGC TGGCGGTACT GGCATTGATT ATCCCTTCAC TATTGACCTG GCAAAGCAGA AAGCACAATC CTCAGGCGGG TTACCGGGTG AAAGGTGGTC GTCCGGCGCT GGTGGTGGTG TTTCTCTGTG GTATTGCTGT GATTGGCGTG CAATTTTTGA TTGCGGCAGG GTTGTTACCA GAAGTGGGGT GA
|
Protein sequence | MKNRTLGSVF IVAGTTIGAG MLAMPLAAAG VGFSVTLILL IGLWALMCYT ALLLLEVYQH VPADTGLGTL AKRYLGRYGQ WLTGFSMMFL MYALTAAYIS GAGELLASSI SDWTGISMSA TAGVLLFTFV AGGVVCVGTS LVDLFNRFLF SAKIIFLVVM LVLLLPHIHK VNLLTLPLQQ GLALSAIPVI FTSFGFHGSV PSIVSYMDGN VRKLRWVFIT GSAIPLVAYI FWQVATLGSI DSTTFMGLLA NHAGLNGLLQ ALREMVASPH VELAVHLFAD LALATSFLGV ALGLFDYLAD LFQRSNTVGG RLQTGAITFL PPLAFALFYP RGFVMALGYA GVALAVLALI IPSLLTWQSR KHNPQAGYRV KGGRPALVVV FLCGIAVIGV QFLIAAGLLP EVG
|
| |