Gene Nther_2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2049 
Symbol 
ID6315567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2165290 
End bp2166411 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content34% 
IMG OID642644437 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_001918204 
Protein GI188586659 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.679361 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.000000000231729 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGAACCA AAAAAACACA ATTTCCTCAA GATTCTGCCA GAAAGGTTTT AAATGAGTTT 
TCACCATATA TACCCGGGAA AAGTTTAGAG GAAATTAAAG AAAAATACGG TTTGGATAAG
GTGATTAAAT TAGCCAGCAA CGAAAACCCA CACGGACCAT CACCAAAAGC AGTAAAAAAA
CTAACCGATA ACAAAGATAT TCACTTGTAT CCCCAGAAAT CATATCAAAA TTTACAGTCC
AAGATATCAC AAAAGCTTGG TACAAATCCT GGACAAGTAA TTATTGGTAA TGGTTCGGAT
GAAATTATTA AACTACTGGC TGCAGCTTTT ATTAACCCTG GTGAAGAGGG GCTTATGGCT
GATATTACTT TCCCTATATA TAAAATGGCA GTGAAAGAAC TTGATGGTAA AGTAACTCAT
ATCCCCTTAA AAAAATATAC CCACGATATT GATCAGTTTA TTGCCCAAAT AACAGATAAC
ACAAAATTAA TATTTATATG TAACCCAAAT AACCCTACTG GTTCCATCAT AACCCATGAA
GAGGCCGAAA AATTATTAAG TAGTGTCAGT AAAGACACTA TAGTAGTCTT TGATGAAGCA
TATCGAGAAT ATGTTACAAA TCCTGAATTT CCAAAAACAG AAATGTTAGT AGATAAATAT
CCTAATTTAA TTGCTTTAAG AACTTTTTCT AAAATTTACG GTTTAGCTGC TCTAAGAGTT
GGTTACGGAA TAGGTAGTGA GAAATTAATT GAAGTCCTTC ACAAGGTTAA ATTACCCTTT
AATGTCAACG AACTAGGGTT AAGAGCTGCC CAAGAAGCAC TAGATGATAC AGAACATCTG
AATTATAGTA AAGAACAAAA TGATCAGGGT AAAAAATGGC TAGAATCCAA ATTAAAGTCC
AGTAAATTTT TCTCCCCAGT ACCAAGTCAG GCCAATTTTT TACTTGTAAA GACTGAATTC
GATGCAGAAA AGCTGGCCGG TGAATTATTG AAACAAGGTG TTATAATAAG GGAAGGAACT
TCCTTTGGAA TGCCGGACCA TTTTCGGATT ACAATAGGTT CAAAATCAGA TAATGAGTTT
TTCATAGAAA AATTAAGTAA TTGCGAGGTG AATTTGAAAT GA
 
Protein sequence
MGTKKTQFPQ DSARKVLNEF SPYIPGKSLE EIKEKYGLDK VIKLASNENP HGPSPKAVKK 
LTDNKDIHLY PQKSYQNLQS KISQKLGTNP GQVIIGNGSD EIIKLLAAAF INPGEEGLMA
DITFPIYKMA VKELDGKVTH IPLKKYTHDI DQFIAQITDN TKLIFICNPN NPTGSIITHE
EAEKLLSSVS KDTIVVFDEA YREYVTNPEF PKTEMLVDKY PNLIALRTFS KIYGLAALRV
GYGIGSEKLI EVLHKVKLPF NVNELGLRAA QEALDDTEHL NYSKEQNDQG KKWLESKLKS
SKFFSPVPSQ ANFLLVKTEF DAEKLAGELL KQGVIIREGT SFGMPDHFRI TIGSKSDNEF
FIEKLSNCEV NLK