Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_2049 |
Symbol | |
ID | 6315567 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 2165290 |
End bp | 2166411 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 642644437 |
Product | histidinol-phosphate aminotransferase |
Protein accession | YP_001918204 |
Protein GI | 188586659 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.679361 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.000000000231729 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGAACCA AAAAAACACA ATTTCCTCAA GATTCTGCCA GAAAGGTTTT AAATGAGTTT TCACCATATA TACCCGGGAA AAGTTTAGAG GAAATTAAAG AAAAATACGG TTTGGATAAG GTGATTAAAT TAGCCAGCAA CGAAAACCCA CACGGACCAT CACCAAAAGC AGTAAAAAAA CTAACCGATA ACAAAGATAT TCACTTGTAT CCCCAGAAAT CATATCAAAA TTTACAGTCC AAGATATCAC AAAAGCTTGG TACAAATCCT GGACAAGTAA TTATTGGTAA TGGTTCGGAT GAAATTATTA AACTACTGGC TGCAGCTTTT ATTAACCCTG GTGAAGAGGG GCTTATGGCT GATATTACTT TCCCTATATA TAAAATGGCA GTGAAAGAAC TTGATGGTAA AGTAACTCAT ATCCCCTTAA AAAAATATAC CCACGATATT GATCAGTTTA TTGCCCAAAT AACAGATAAC ACAAAATTAA TATTTATATG TAACCCAAAT AACCCTACTG GTTCCATCAT AACCCATGAA GAGGCCGAAA AATTATTAAG TAGTGTCAGT AAAGACACTA TAGTAGTCTT TGATGAAGCA TATCGAGAAT ATGTTACAAA TCCTGAATTT CCAAAAACAG AAATGTTAGT AGATAAATAT CCTAATTTAA TTGCTTTAAG AACTTTTTCT AAAATTTACG GTTTAGCTGC TCTAAGAGTT GGTTACGGAA TAGGTAGTGA GAAATTAATT GAAGTCCTTC ACAAGGTTAA ATTACCCTTT AATGTCAACG AACTAGGGTT AAGAGCTGCC CAAGAAGCAC TAGATGATAC AGAACATCTG AATTATAGTA AAGAACAAAA TGATCAGGGT AAAAAATGGC TAGAATCCAA ATTAAAGTCC AGTAAATTTT TCTCCCCAGT ACCAAGTCAG GCCAATTTTT TACTTGTAAA GACTGAATTC GATGCAGAAA AGCTGGCCGG TGAATTATTG AAACAAGGTG TTATAATAAG GGAAGGAACT TCCTTTGGAA TGCCGGACCA TTTTCGGATT ACAATAGGTT CAAAATCAGA TAATGAGTTT TTCATAGAAA AATTAAGTAA TTGCGAGGTG AATTTGAAAT GA
|
Protein sequence | MGTKKTQFPQ DSARKVLNEF SPYIPGKSLE EIKEKYGLDK VIKLASNENP HGPSPKAVKK LTDNKDIHLY PQKSYQNLQS KISQKLGTNP GQVIIGNGSD EIIKLLAAAF INPGEEGLMA DITFPIYKMA VKELDGKVTH IPLKKYTHDI DQFIAQITDN TKLIFICNPN NPTGSIITHE EAEKLLSSVS KDTIVVFDEA YREYVTNPEF PKTEMLVDKY PNLIALRTFS KIYGLAALRV GYGIGSEKLI EVLHKVKLPF NVNELGLRAA QEALDDTEHL NYSKEQNDQG KKWLESKLKS SKFFSPVPSQ ANFLLVKTEF DAEKLAGELL KQGVIIREGT SFGMPDHFRI TIGSKSDNEF FIEKLSNCEV NLK
|
| |