Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0801 |
Symbol | tnaA |
ID | 4240292 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | - |
Start bp | 872174 |
End bp | 873589 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638104355 |
Product | tryptophanase |
Protein accession | YP_719011 |
Protein GI | 113460944 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3033] Tryptophanase |
TIGRFAM ID | [TIGR02617] tryptophanase, leader peptide-associated |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000525745 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAATT TCAAACATTT ACCAGAACCG TTCCGTATTC GTGTTATTGA ACCTGTCAAA CGTACAACTC GTGCCTATCG TGATGAAGCT ATCTTGAAAG CAGGTATGAA TCTATTTTTA TTAGACAGTG AAGATATTTT TATTGATCTA TTAACTGATA GTGGTACAGG TGCCGTTACT CAAGATATGC AGGCTGCCAT GCTAAGAGGT GATGAAGCTT ATAGCGGAAG TCGCAGTTAC TATGCTCTTG CCAATGCTGT GAAAGAAATC TTTGGTTATG AATACACAAT TCCAACCCAT CAAGGTCGTG GTGCAGAACA AATTTATATT CCTGTCTTGA TCAAAAAACG TGAGCAAGAA AAAGGGTTGG ATCGAAATAA AATGGTTGTT TTCTCTAACT ATTTTTTTGA CACTACTCAA GGTCATAGCC AATTAAATGG TGCAACTGTA CGTAATGTCT ATATAAAAGA AGCTTTTGAC ACAGATGTAG ATCATGATTT CAAAGGTAAT TTTGATTTAG AAAAACTGGA ACAAGGTATT TTAGAAGTTG GAGCAAATAA TGTTCCTTAC ATTGTATGTA CTATTACCTG TAATTCTGCC GGTGGACAAC CGGTATCTCT TGCCAATATG AAAGCCATGT ACCAAATTGC ACGTAAATAT GATATTCCTG TGATTATGGA TTCGGCTCGC TTCGCTGAAA ATGCCTACTT TATTCAACAA CGTGAAGCAG AATACAAAGA TTGGACTATT GAACAAATTA CTTACGAAAG CTATAAATAT GCAGATGCCT TGGCTATGTC TGCAAAAAAA GATGCAATGG TACCTATGGG GGGACTACTC TGCTTCAAAG ATAATTCAAT GGAAGATGTT TACAACGAGT GTCGTACACT TTGTGTTGTA CAAGAAGGTT TCCCTACCTA TGGTGGTCTA GAAGGTGGTG CAATGGAACG CCTAGCTGTA GGTTTACGTG ATGGTATGCG TCAAGATTGG TTAGCTTATC GTATTAGCCA AATTGAATAC CTTGTACAAG GTTTAGAAAA GATCGGTGTT GTTTGTCAAC AACCTGGGGG ACATGCCGCC TTTGTGGATG CAGGCAAATT ATTACCACAT ATCCCAGCAG AACAATTCCC TGCTCAGGCT CTTGCTTGTG AATTATATAA AGTAGCAGGT ATTCGATCCG TAGAAATCGG TTCTCTCCTA TTAGGACGAG ATCCGAAAAC AGGTCAACAA TTACCTTGCC CGGCTGAATT GTTACGTTTA ACTATTCCTC GTGCGACCTA CACTCAAACA CATATGGACT TCATTATTGA AGCATTCAAA CGAGTGAAAG AGAATGCAAA AAATATCAAA GGTTTAGATT TCACTTATGA ACCTAAAGTA CTGCGTCATT TCACCGCTCG ATTAAAAGAG ATTTAG
|
Protein sequence | MENFKHLPEP FRIRVIEPVK RTTRAYRDEA ILKAGMNLFL LDSEDIFIDL LTDSGTGAVT QDMQAAMLRG DEAYSGSRSY YALANAVKEI FGYEYTIPTH QGRGAEQIYI PVLIKKREQE KGLDRNKMVV FSNYFFDTTQ GHSQLNGATV RNVYIKEAFD TDVDHDFKGN FDLEKLEQGI LEVGANNVPY IVCTITCNSA GGQPVSLANM KAMYQIARKY DIPVIMDSAR FAENAYFIQQ REAEYKDWTI EQITYESYKY ADALAMSAKK DAMVPMGGLL CFKDNSMEDV YNECRTLCVV QEGFPTYGGL EGGAMERLAV GLRDGMRQDW LAYRISQIEY LVQGLEKIGV VCQQPGGHAA FVDAGKLLPH IPAEQFPAQA LACELYKVAG IRSVEIGSLL LGRDPKTGQQ LPCPAELLRL TIPRATYTQT HMDFIIEAFK RVKENAKNIK GLDFTYEPKV LRHFTARLKE I
|
| |