Gene HS_1166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1166 
SymboltyrP 
ID4240667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1319140 
End bp1320357 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content40% 
IMG OID638104729 
Producttyrosine-specific transport protein 
Protein accessionYP_719378 
Protein GI113461309 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID[TIGR00837] aromatic amino acid transport protein 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA AAATTCTCGG CAGTGCATTA ATGATCGCCG GCACAACTAT AGGTGCGGGC 
ATGTTGGCTA TGCCATTGAC CTCAGCCGGC ATGGGGTTTT CCATGACGTT AGTGCTACTT
GCAGGATTAT GGTTATTATT GACCTATACC GGCTTGTTGT TTATGGAAGT TTATCAAACT
GCAAAAGAAA AAGATGTAGG CGTTGCAACT CTTGCAGAAC AGTATTTCGG TTTAACCGGA
CGGTTTTTGG CGACTATTAG CTTGCTCGTT CTACTCTATG CCTTACTTGC CGCTTATATT
ACCGGTGGCG GCTCTCTTTT ATCAGGCCTT TTAACGGGTA TTGGTGATGA AAAACAAACT
TCTCAAATCT CAATATTACT CTTTACCCTA ATTTTAGGTG CATTTGTTGT GATTGGTATT
AAAGGGGTTG ACGGCTTAAC CCGTCTCTTA TTTATCGGAA AAATCATTGG CTTTATTGCG
GTCTTATTAA TGATGTTACC AAAGGCAAAA TTAGAAAACC TAGGGGCAAT TCCTTTAGAT
AATCTATTGG TTATTTCTGC AATTCCAATT TTCTTTACCT CTTTTGGCTT CCACGTCATT
ATGGGGAGTA TCAATAGCTA CCTTGATGCA GATATTCGTA AAATCCGTTT AGCCATTATT
ATTGGAACGT TAATTCCATT ATTTGCCTAT TTATTATGGC AATTCGCAAC ACATGGCGTA
TTAAGTCAAA CTCAATTTGT TAGCCTATTA AAACAAGATC CGACCCTAAA TGGATTAGTA
AAAGCGACTA GTCAAATTAC TGGAAGTAGC CTATTAGGTG AGGTAGTTCG CCTATTCTCA
TCTCTTGCAT TAATTACTTC TTTCTTAGGT GTTGCAATGG GAATTTTTGA AGGGGTTGGT
GATCTATTAA AACGTATTAA TCTGCCGACA AATCGCCCGA TACTGGCACT TTTAACTTTT
ATTCCGCCAC TAGCCTTTTC TCTATTCTAT CCAAACGGTT TTATCACCGC ACTAGGTTAT
GCCGGCATTT TATTTGCCTT CTATGGGCTA CTTCTTCCTG TTGCTTTAGC TTGGAAAGCT
CGTCAGTTGC ATCCCAACCT ACCTTATAGA GTCGCTGGCG GTAATCTTGC ATTAGTCATC
GCATTTATTA TGGGCATCGT TATTATCGTT ATCCCGTTCT TAATGCAAGA AGGTATCCTT
CCTACTGTTG CCGGATAA
 
Protein sequence
MKNKILGSAL MIAGTTIGAG MLAMPLTSAG MGFSMTLVLL AGLWLLLTYT GLLFMEVYQT 
AKEKDVGVAT LAEQYFGLTG RFLATISLLV LLYALLAAYI TGGGSLLSGL LTGIGDEKQT
SQISILLFTL ILGAFVVIGI KGVDGLTRLL FIGKIIGFIA VLLMMLPKAK LENLGAIPLD
NLLVISAIPI FFTSFGFHVI MGSINSYLDA DIRKIRLAII IGTLIPLFAY LLWQFATHGV
LSQTQFVSLL KQDPTLNGLV KATSQITGSS LLGEVVRLFS SLALITSFLG VAMGIFEGVG
DLLKRINLPT NRPILALLTF IPPLAFSLFY PNGFITALGY AGILFAFYGL LLPVALAWKA
RQLHPNLPYR VAGGNLALVI AFIMGIVIIV IPFLMQEGIL PTVAG