Gene Tpet_1709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_1709 
SymbolhisD 
ID5171584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp1708185 
End bp1709471 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content50% 
IMG OID640564235 
Producthistidinol dehydrogenase 
Protein accessionYP_001245290 
Protein GI148270830 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTTGA TGAACAACCC CGGAGATAAA GAAGTGCTCA GACTGTTGAA ACAGAGGATG 
GAATCCGTAT CGCAGGTGGA AGAGGCCGTA AAAGAGATCA TCAGGAGAGT GAAAGAAGAA
GGAGACAGAG CCCTCGAAGA ATTCCTCAAA AGGTTTGAGA AGCACCCGGT GGGCATCGAG
AACCTTCGTG TCACAAAGAA GGAAATATCG GAAGCCCAAG TAGAAGAAGA ATTCGTTGAA
ACGATAAAAA TGGTGATAGA AGACCTGAAA GAATTCCATC GAAGACAGGA GGAAAGATCT
TTCTTTTTCA CGACGAAGGG TGGAAGTTTC CTGGGAGAGA TGGTTGTCCC TCTCGAGAGC
GTGGGAATTT ACGTTCCTGG AGGAAAGGTT CCGTACTTCT CTACGCTCCT CATGTGTGCG
GTTCCTGCTA TCGTCGCTGG TGTGGAAAGG ATCGTCGTCA CAACACCACC AGATGAAAAC
GGAGGCATCT CCCCATACAT CCTGAAAACC TGCGAAATCC TCGGTCTGAA AGAGATCTAT
CGGATGGGAG GAGCTCATGC GGTGGCTGCT CTCACGTACG GCACGGAAAC AGTGAAACCC
GTTGACAAGA TCGTGGGACC AGGTGGAGTC TTCGTCACGC TCGCGAAGAA ACACGTTTAC
GGAGACGTCG GAATCGACTC CATAGCAGGT CCCAGTGAGA TCACGATCGT GACGGACGGT
AGCGTTGATC TGGATCTCGT AGCCGCGGAT TTTCTCTCCC AGGCGGAGCA CGACGAGAAC
GCGATGAGCG TGGTGATAAC CACCTCGAGA GAAGTCTTCG AAAAATTGCC TCATGTCATA
GAAAGACACC TGGAAGCTCT TCCAGAAGAG AGAAGAAAAA CGGCCAGGAT CTCAACGGAA
AACTTCGGTA CCATCATCTT GACAGACAGT CTGAAAAGGG CCTTTGAGAT CTCCAACCTC
ATCGCTCCCG AACATCTGGA AGTCCTCGTG GAAAAGCCGT TTGAATCGCT GGGATATATA
AAGAACGCGG GATCTGTTTT CCTTGGAAAG TACACCTGTG AGTCTGTGGG AGACTACGGT
GCGGGACCGA ACCACGTCCT TCCCACCTTC AGATCCGCGA GATTCTCCTC CGGACTCAGA
GTTTCCGATT TCACGAAGAA GATATTCATC ACGTATCTCT CCGAAGAAGA TTTCAAAAGA
AAGGGCGAGC TTTACTCGAA GATGGCACGT TGGGAAGGTT TTGAAGCCCA CGCTCGGGCA
ATAGACGTCA GGAGGGGAAA ACTGTGA
 
Protein sequence
MILMNNPGDK EVLRLLKQRM ESVSQVEEAV KEIIRRVKEE GDRALEEFLK RFEKHPVGIE 
NLRVTKKEIS EAQVEEEFVE TIKMVIEDLK EFHRRQEERS FFFTTKGGSF LGEMVVPLES
VGIYVPGGKV PYFSTLLMCA VPAIVAGVER IVVTTPPDEN GGISPYILKT CEILGLKEIY
RMGGAHAVAA LTYGTETVKP VDKIVGPGGV FVTLAKKHVY GDVGIDSIAG PSEITIVTDG
SVDLDLVAAD FLSQAEHDEN AMSVVITTSR EVFEKLPHVI ERHLEALPEE RRKTARISTE
NFGTIILTDS LKRAFEISNL IAPEHLEVLV EKPFESLGYI KNAGSVFLGK YTCESVGDYG
AGPNHVLPTF RSARFSSGLR VSDFTKKIFI TYLSEEDFKR KGELYSKMAR WEGFEAHARA
IDVRRGKL