Gene Mthe_0843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0843 
SymbolhisD 
ID4463056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp909585 
End bp910865 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content56% 
IMG OID639699862 
Producthistidinol dehydrogenase 
Protein accessionYP_843272 
Protein GI116754154 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.567726 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGTGA AGAAGCTGAA TGAGCTGAGT TCAGAGGACC TGAAGGTTCT TCTGTCTAGA 
GAGATCGGAA TACAGGACGT CCTGCAGAGG GTCAACGACA TTGTGATGGA TGTCGCAGAG
AACGGCGATG AAGCTCTGAG GAGATACACG GAGCAGTTCG ATGGCGTGCG CCTTGAGAGC
TTCAGGGTAT CAGATGATGA GATCGAGGAG GCGTACGATG CTGTGGATGA GAGTCTTCTC
AGATCTCTGG AGCTTGCGGC TCAGAACATC TACGCATTCC ATGATGAGGA GCGCACGAAG
GATCTCTGGT TGTACCAGGT CGCGCCTGGG GTAGTCGCAG GGCAGAAGGT CGTGCCGCTG
GAGAGCGTCG GTGCTTATGT TCCCGGCGGA AGAGCCGCGT ATCCGAGCTC TGCGCTGATG
TGTGTTATTC CCGCGAAGGT CGCGGGCGTT GAGAGAATGG TCGTGTGCAC GCCGCCGGAC
AGATCTGGCA GGATCTCGCC TCTAACGTTA GCCGCCGCGG ATATTGCCGG TGCGGATGAG
ATCTACAAGC TTGGGGGAGC TCAGGCGATA GCTGCCATGG CCCTCGGAAC CGAGAGCATT
GAGCGTGTCG AGAAGATAGT CGGACCCGGG AATATATACG TCACTGCAGC GAAAATGCTC
GTGAGAGGCA GCGTCGAGAT AGATTTTCCA GCGGGACCCT CAGAGGTTTT AATCATAGCG
GATTCATCTG CGGATCCCGA GTTCGTAGCA GCGGATATGA TCGCCCAGGC CGAGCACGAT
CCATCATCGA TAGCGGTAGT TGTCACGACC AGCGAACCGC TGGCTAAGGC CATCGAGACG
GAGATACCAT CTCAGATTGA GAAAGCGGAG CGGAGGGATA TAGTCCAGGC GAGCCTTGAG
CGATGTGCCA TCCTGTTGGC CGAGAGTCTG GATGACGCTT TGGCATTTTC GAACGCATTC
GCGCCTGAGC ACCTGGAGCT GATGGTCAGG GATCCGATGG ATGCCCTGAA CCTTGTGAGA
AGCGCTGGAT CTGTGTTTCT CGGGCACTAT ACACCCGTAG CTGCTGGCGA TTACGCAACA
GGAACGAACC ATGTGCTTCC CACAGCTGGT TACGCGAGGA TCTTCTCTGG TTTGAACATA
GATGCGTTCA CCAAGAAGAT ATCCATACAG AGCATGACCG CAGATGGGCT TGAGTCGCTC
GCGGATGCTA TAATAAAGAT GGCCGAGTCT GAGGGGCTCA GGGCACACGC TGAATCCGTG
CGGATCCGGA TGAGGAGATG A
 
Protein sequence
MIVKKLNELS SEDLKVLLSR EIGIQDVLQR VNDIVMDVAE NGDEALRRYT EQFDGVRLES 
FRVSDDEIEE AYDAVDESLL RSLELAAQNI YAFHDEERTK DLWLYQVAPG VVAGQKVVPL
ESVGAYVPGG RAAYPSSALM CVIPAKVAGV ERMVVCTPPD RSGRISPLTL AAADIAGADE
IYKLGGAQAI AAMALGTESI ERVEKIVGPG NIYVTAAKML VRGSVEIDFP AGPSEVLIIA
DSSADPEFVA ADMIAQAEHD PSSIAVVVTT SEPLAKAIET EIPSQIEKAE RRDIVQASLE
RCAILLAESL DDALAFSNAF APEHLELMVR DPMDALNLVR SAGSVFLGHY TPVAAGDYAT
GTNHVLPTAG YARIFSGLNI DAFTKKISIQ SMTADGLESL ADAIIKMAES EGLRAHAESV
RIRMRR