Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_1166 |
Symbol | tyrP |
ID | 4240667 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 1319140 |
End bp | 1320357 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638104729 |
Product | tyrosine-specific transport protein |
Protein accession | YP_719378 |
Protein GI | 113461309 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0814] Amino acid permeases |
TIGRFAM ID | [TIGR00837] aromatic amino acid transport protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAATA AAATTCTCGG CAGTGCATTA ATGATCGCCG GCACAACTAT AGGTGCGGGC ATGTTGGCTA TGCCATTGAC CTCAGCCGGC ATGGGGTTTT CCATGACGTT AGTGCTACTT GCAGGATTAT GGTTATTATT GACCTATACC GGCTTGTTGT TTATGGAAGT TTATCAAACT GCAAAAGAAA AAGATGTAGG CGTTGCAACT CTTGCAGAAC AGTATTTCGG TTTAACCGGA CGGTTTTTGG CGACTATTAG CTTGCTCGTT CTACTCTATG CCTTACTTGC CGCTTATATT ACCGGTGGCG GCTCTCTTTT ATCAGGCCTT TTAACGGGTA TTGGTGATGA AAAACAAACT TCTCAAATCT CAATATTACT CTTTACCCTA ATTTTAGGTG CATTTGTTGT GATTGGTATT AAAGGGGTTG ACGGCTTAAC CCGTCTCTTA TTTATCGGAA AAATCATTGG CTTTATTGCG GTCTTATTAA TGATGTTACC AAAGGCAAAA TTAGAAAACC TAGGGGCAAT TCCTTTAGAT AATCTATTGG TTATTTCTGC AATTCCAATT TTCTTTACCT CTTTTGGCTT CCACGTCATT ATGGGGAGTA TCAATAGCTA CCTTGATGCA GATATTCGTA AAATCCGTTT AGCCATTATT ATTGGAACGT TAATTCCATT ATTTGCCTAT TTATTATGGC AATTCGCAAC ACATGGCGTA TTAAGTCAAA CTCAATTTGT TAGCCTATTA AAACAAGATC CGACCCTAAA TGGATTAGTA AAAGCGACTA GTCAAATTAC TGGAAGTAGC CTATTAGGTG AGGTAGTTCG CCTATTCTCA TCTCTTGCAT TAATTACTTC TTTCTTAGGT GTTGCAATGG GAATTTTTGA AGGGGTTGGT GATCTATTAA AACGTATTAA TCTGCCGACA AATCGCCCGA TACTGGCACT TTTAACTTTT ATTCCGCCAC TAGCCTTTTC TCTATTCTAT CCAAACGGTT TTATCACCGC ACTAGGTTAT GCCGGCATTT TATTTGCCTT CTATGGGCTA CTTCTTCCTG TTGCTTTAGC TTGGAAAGCT CGTCAGTTGC ATCCCAACCT ACCTTATAGA GTCGCTGGCG GTAATCTTGC ATTAGTCATC GCATTTATTA TGGGCATCGT TATTATCGTT ATCCCGTTCT TAATGCAAGA AGGTATCCTT CCTACTGTTG CCGGATAA
|
Protein sequence | MKNKILGSAL MIAGTTIGAG MLAMPLTSAG MGFSMTLVLL AGLWLLLTYT GLLFMEVYQT AKEKDVGVAT LAEQYFGLTG RFLATISLLV LLYALLAAYI TGGGSLLSGL LTGIGDEKQT SQISILLFTL ILGAFVVIGI KGVDGLTRLL FIGKIIGFIA VLLMMLPKAK LENLGAIPLD NLLVISAIPI FFTSFGFHVI MGSINSYLDA DIRKIRLAII IGTLIPLFAY LLWQFATHGV LSQTQFVSLL KQDPTLNGLV KATSQITGSS LLGEVVRLFS SLALITSFLG VAMGIFEGVG DLLKRINLPT NRPILALLTF IPPLAFSLFY PNGFITALGY AGILFAFYGL LLPVALAWKA RQLHPNLPYR VAGGNLALVI AFIMGIVIIV IPFLMQEGIL PTVAG
|
| |