Gene HS_0801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0801 
SymboltnaA 
ID4240292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp872174 
End bp873589 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content38% 
IMG OID638104355 
Producttryptophanase 
Protein accessionYP_719011 
Protein GI113460944 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3033] Tryptophanase 
TIGRFAM ID[TIGR02617] tryptophanase, leader peptide-associated 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000525745 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAATT TCAAACATTT ACCAGAACCG TTCCGTATTC GTGTTATTGA ACCTGTCAAA 
CGTACAACTC GTGCCTATCG TGATGAAGCT ATCTTGAAAG CAGGTATGAA TCTATTTTTA
TTAGACAGTG AAGATATTTT TATTGATCTA TTAACTGATA GTGGTACAGG TGCCGTTACT
CAAGATATGC AGGCTGCCAT GCTAAGAGGT GATGAAGCTT ATAGCGGAAG TCGCAGTTAC
TATGCTCTTG CCAATGCTGT GAAAGAAATC TTTGGTTATG AATACACAAT TCCAACCCAT
CAAGGTCGTG GTGCAGAACA AATTTATATT CCTGTCTTGA TCAAAAAACG TGAGCAAGAA
AAAGGGTTGG ATCGAAATAA AATGGTTGTT TTCTCTAACT ATTTTTTTGA CACTACTCAA
GGTCATAGCC AATTAAATGG TGCAACTGTA CGTAATGTCT ATATAAAAGA AGCTTTTGAC
ACAGATGTAG ATCATGATTT CAAAGGTAAT TTTGATTTAG AAAAACTGGA ACAAGGTATT
TTAGAAGTTG GAGCAAATAA TGTTCCTTAC ATTGTATGTA CTATTACCTG TAATTCTGCC
GGTGGACAAC CGGTATCTCT TGCCAATATG AAAGCCATGT ACCAAATTGC ACGTAAATAT
GATATTCCTG TGATTATGGA TTCGGCTCGC TTCGCTGAAA ATGCCTACTT TATTCAACAA
CGTGAAGCAG AATACAAAGA TTGGACTATT GAACAAATTA CTTACGAAAG CTATAAATAT
GCAGATGCCT TGGCTATGTC TGCAAAAAAA GATGCAATGG TACCTATGGG GGGACTACTC
TGCTTCAAAG ATAATTCAAT GGAAGATGTT TACAACGAGT GTCGTACACT TTGTGTTGTA
CAAGAAGGTT TCCCTACCTA TGGTGGTCTA GAAGGTGGTG CAATGGAACG CCTAGCTGTA
GGTTTACGTG ATGGTATGCG TCAAGATTGG TTAGCTTATC GTATTAGCCA AATTGAATAC
CTTGTACAAG GTTTAGAAAA GATCGGTGTT GTTTGTCAAC AACCTGGGGG ACATGCCGCC
TTTGTGGATG CAGGCAAATT ATTACCACAT ATCCCAGCAG AACAATTCCC TGCTCAGGCT
CTTGCTTGTG AATTATATAA AGTAGCAGGT ATTCGATCCG TAGAAATCGG TTCTCTCCTA
TTAGGACGAG ATCCGAAAAC AGGTCAACAA TTACCTTGCC CGGCTGAATT GTTACGTTTA
ACTATTCCTC GTGCGACCTA CACTCAAACA CATATGGACT TCATTATTGA AGCATTCAAA
CGAGTGAAAG AGAATGCAAA AAATATCAAA GGTTTAGATT TCACTTATGA ACCTAAAGTA
CTGCGTCATT TCACCGCTCG ATTAAAAGAG ATTTAG
 
Protein sequence
MENFKHLPEP FRIRVIEPVK RTTRAYRDEA ILKAGMNLFL LDSEDIFIDL LTDSGTGAVT 
QDMQAAMLRG DEAYSGSRSY YALANAVKEI FGYEYTIPTH QGRGAEQIYI PVLIKKREQE
KGLDRNKMVV FSNYFFDTTQ GHSQLNGATV RNVYIKEAFD TDVDHDFKGN FDLEKLEQGI
LEVGANNVPY IVCTITCNSA GGQPVSLANM KAMYQIARKY DIPVIMDSAR FAENAYFIQQ
REAEYKDWTI EQITYESYKY ADALAMSAKK DAMVPMGGLL CFKDNSMEDV YNECRTLCVV
QEGFPTYGGL EGGAMERLAV GLRDGMRQDW LAYRISQIEY LVQGLEKIGV VCQQPGGHAA
FVDAGKLLPH IPAEQFPAQA LACELYKVAG IRSVEIGSLL LGRDPKTGQQ LPCPAELLRL
TIPRATYTQT HMDFIIEAFK RVKENAKNIK GLDFTYEPKV LRHFTARLKE I