Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0780 |
Symbol | tyrP |
ID | 4240271 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | - |
Start bp | 833899 |
End bp | 835110 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638104334 |
Product | tyrosine-specific transport protein |
Protein accession | YP_718990 |
Protein GI | 113460923 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0814] Amino acid permeases |
TIGRFAM ID | [TIGR00837] aromatic amino acid transport protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000892482 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAAA CGCTTGGAAG TACGCTTATT GTAGCGGGAA CAACAATTGG CGGTGGCATG CTCGCTATGC CATTAACCTC AGCCGGAATC GGTTTTGGCT TTACACTTAC TTTATTAATC GGGCTTTGGC TATTACTCTC TTTTGCCGCC CTACTTTTCG TTGAGCTATA TCAGACTGTA GACAGTGATG CTGGAATTGG GACTTTGGCT GAAAAATATT ACGGATCTTT CGGACGTATT CTTTCTACTG TTGTACTGTT AATTTTTCTT TACGCTTTAG TTTCAGCTTA CATTTCCGGC GGAAGCTCTT TACTCGCAGG ACTGTTGCCC ACCATTCACG ATGAACAAAC CACATCTCGG ATTGCAGGTG TATTATTTAC TGTACTTTTC GGTATTTTCA TTATTTCAGG CACAAATAGC GTAGATAAGA TTAATCGCAT CATTTTTTTC AGTAAAATTG CTTTCTTTGT TGTAGTGCTT TTCTTATTAC TGCCTAAAGT CTCTTTAGAA AACTTACTCG CACTTCCAAT TGATAATGCC TTAATCATCT CAGCAAGCCC TATTTTCTTT ACCTCTTTCG GCTTTCACGG TTCTATTCCT AGCCTGAATA AATACCTTGA TGGCGATGTA AAGGCATTAC GCGTTTCTAT CTTAGTCGGA ACCTTTATTC CACTTGTTGC TTATATTCTA TGGCAATTAG CCACTCATGG CGTACTAAGC CAAACAGAAT TTTTGGCAAT CTTACAACAA GATCCAACCT TAAATGGATT GGTGACTGCA ACAATTACAA TTACAGGAAG TGAAATTATT GGCGGTGCAG TTCGTCTATT CTCTGCATTT GCATTAATTA CTTCATTCTT AGGTGTTGCC TTAGGTTTGT TCGAAGCGAT TGAAGATCTG CTAAAACGAG TAAATATTTC AGCTAATCGT GTTTCAGTTG GTCTGCTCAC TTTCTTACCG CCACTTGCTT TTGCACTATT CTATCCTCAA GGGTTTATCC TAGCTTTAGG ATATGCAGGA CAAATGTTTG CATTCTATGC AATCGTTTTA CCTATTGCAA TGGTATGGAA ATTACGTAAA ATTTACCCGC ACTTACCGTA TCGAGTAAAA GGTGGCTCAG CCTCCCTTAT CGCTGTATTA GTTATTGGTG TATTTATCGT GATTATTCCA TTCCTAGTTC AAGCAGGTTT ATTACCGAGT GTGGTAGGTT AA
|
Protein sequence | MNKTLGSTLI VAGTTIGGGM LAMPLTSAGI GFGFTLTLLI GLWLLLSFAA LLFVELYQTV DSDAGIGTLA EKYYGSFGRI LSTVVLLIFL YALVSAYISG GSSLLAGLLP TIHDEQTTSR IAGVLFTVLF GIFIISGTNS VDKINRIIFF SKIAFFVVVL FLLLPKVSLE NLLALPIDNA LIISASPIFF TSFGFHGSIP SLNKYLDGDV KALRVSILVG TFIPLVAYIL WQLATHGVLS QTEFLAILQQ DPTLNGLVTA TITITGSEII GGAVRLFSAF ALITSFLGVA LGLFEAIEDL LKRVNISANR VSVGLLTFLP PLAFALFYPQ GFILALGYAG QMFAFYAIVL PIAMVWKLRK IYPHLPYRVK GGSASLIAVL VIGVFIVIIP FLVQAGLLPS VVG
|
| |