Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1034 |
Symbol | tyrP |
ID | 6272289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 951951 |
End bp | 953162 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641725176 |
Product | tyrosine-specific transport protein |
Protein accession | YP_001879698 |
Protein GI | 187734006 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0814] Amino acid permeases |
TIGRFAM ID | [TIGR00837] aromatic amino acid transport protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0000000311327 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAAACA GAACTCTGGG AAGTGTTTTT ATCGTGGCGG GAACCACAAT TGGCGCAGGC ATGCTGGCAA TGCCGCTGGC TGCGGCAGGT GTTGGTTTTA GCGTCACGTT AATCTTGTTG ATTGGGCTTT GGGCGTTGAT GTGCTACACG GCGCTATTAC TGCTGGAGGT GTACCAGCAT GTTCCGGCAG ATACCGGTCT GGGCACGCTG GCAAAACGCT ATCTGGGACG CTACGGTCAA TGGCTGACGG GCTTCAGTAT GATGTTCTTA ATGTATGCTC TGACTGCGGC ATACATCAGC GGTGCCGGTG AATTGTTGGC CTCCAGCATC AGCGACTGGA CAGGTATTTC AATGTCGGCA ACCGCTGGCG TGCTGTTGTT CACTTTTGTT GCCGGTGGCG TGGTTTGTGT TGGAACATCG CTGGTCGATT TATTTAACCG TTTTCTGTTC AGCGCCAAAA TTATTTTTCT GGTGGTGATG CTGGTATTGC TGCTACCGCA TATTCACAAA GTGAATCTTT TAACCCTGCC GTTGCAACAG GGGCTGGCTC TGTCAGCAAT CCCGGTGATT TTTACCTCAT TTGGTTTTCA CGGTAGCGTG CCGAGTATTG TCAGCTATAT GGATGGCAAC GTTCGTAAGC TACGCAGGGT GTTTATAACC GGTAGTGCGA TCCCCCTGGT GGCATATATT TTCTGGCAGG TGGCAACGCT TGGCAGCATT GATTCAACAA CCTTTATGGG ATTGCTGGCT AATCATGCTG GATTAAACGG GCTGTTACAG GCGTTACGCG AAATGGTGGC CTCTCCGCAT GTTGAGCTGG CAGTGCATTT ATTTGCTGAT TTAGCCCTCG CCACGTCATT TCTCGGCGTT GCGTTAGGCT TATTTGATTA TCTGGCTGAT TTATTTCAGC GTTCAAATAC CGTTGGTGGA CGGTTGCAAA CTGGTGCAAT TACCTTTCTG CCGCCGTTGG CGTTTGCACT GTTTTATCCA CGAGGATTTG TGATGGCGCT GGGTTACGCC GGTGTGGCGC TGGCGGTACT GGCATTGATT ATCCCTTCGC TGTTGACCTG GCAAAGCAGA AAGCACAATC CTCAGGCGGG TTACCGGGTC AAAGGTGGTC GTCCGGCGCT GGTGGTGGTG TTTCTCTGTG GTATTGCTGT GATTGGCGTG CAATTTTTGA TTGCGGCAGG GTTGTTACCA GAAGTGGGGT GA
|
Protein sequence | MKNRTLGSVF IVAGTTIGAG MLAMPLAAAG VGFSVTLILL IGLWALMCYT ALLLLEVYQH VPADTGLGTL AKRYLGRYGQ WLTGFSMMFL MYALTAAYIS GAGELLASSI SDWTGISMSA TAGVLLFTFV AGGVVCVGTS LVDLFNRFLF SAKIIFLVVM LVLLLPHIHK VNLLTLPLQQ GLALSAIPVI FTSFGFHGSV PSIVSYMDGN VRKLRRVFIT GSAIPLVAYI FWQVATLGSI DSTTFMGLLA NHAGLNGLLQ ALREMVASPH VELAVHLFAD LALATSFLGV ALGLFDYLAD LFQRSNTVGG RLQTGAITFL PPLAFALFYP RGFVMALGYA GVALAVLALI IPSLLTWQSR KHNPQAGYRV KGGRPALVVV FLCGIAVIGV QFLIAAGLLP EVG
|
| |