Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0843 |
Symbol | hisD |
ID | 4463056 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | - |
Start bp | 909585 |
End bp | 910865 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639699862 |
Product | histidinol dehydrogenase |
Protein accession | YP_843272 |
Protein GI | 116754154 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0141] Histidinol dehydrogenase |
TIGRFAM ID | [TIGR00069] histidinol dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.567726 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGTGA AGAAGCTGAA TGAGCTGAGT TCAGAGGACC TGAAGGTTCT TCTGTCTAGA GAGATCGGAA TACAGGACGT CCTGCAGAGG GTCAACGACA TTGTGATGGA TGTCGCAGAG AACGGCGATG AAGCTCTGAG GAGATACACG GAGCAGTTCG ATGGCGTGCG CCTTGAGAGC TTCAGGGTAT CAGATGATGA GATCGAGGAG GCGTACGATG CTGTGGATGA GAGTCTTCTC AGATCTCTGG AGCTTGCGGC TCAGAACATC TACGCATTCC ATGATGAGGA GCGCACGAAG GATCTCTGGT TGTACCAGGT CGCGCCTGGG GTAGTCGCAG GGCAGAAGGT CGTGCCGCTG GAGAGCGTCG GTGCTTATGT TCCCGGCGGA AGAGCCGCGT ATCCGAGCTC TGCGCTGATG TGTGTTATTC CCGCGAAGGT CGCGGGCGTT GAGAGAATGG TCGTGTGCAC GCCGCCGGAC AGATCTGGCA GGATCTCGCC TCTAACGTTA GCCGCCGCGG ATATTGCCGG TGCGGATGAG ATCTACAAGC TTGGGGGAGC TCAGGCGATA GCTGCCATGG CCCTCGGAAC CGAGAGCATT GAGCGTGTCG AGAAGATAGT CGGACCCGGG AATATATACG TCACTGCAGC GAAAATGCTC GTGAGAGGCA GCGTCGAGAT AGATTTTCCA GCGGGACCCT CAGAGGTTTT AATCATAGCG GATTCATCTG CGGATCCCGA GTTCGTAGCA GCGGATATGA TCGCCCAGGC CGAGCACGAT CCATCATCGA TAGCGGTAGT TGTCACGACC AGCGAACCGC TGGCTAAGGC CATCGAGACG GAGATACCAT CTCAGATTGA GAAAGCGGAG CGGAGGGATA TAGTCCAGGC GAGCCTTGAG CGATGTGCCA TCCTGTTGGC CGAGAGTCTG GATGACGCTT TGGCATTTTC GAACGCATTC GCGCCTGAGC ACCTGGAGCT GATGGTCAGG GATCCGATGG ATGCCCTGAA CCTTGTGAGA AGCGCTGGAT CTGTGTTTCT CGGGCACTAT ACACCCGTAG CTGCTGGCGA TTACGCAACA GGAACGAACC ATGTGCTTCC CACAGCTGGT TACGCGAGGA TCTTCTCTGG TTTGAACATA GATGCGTTCA CCAAGAAGAT ATCCATACAG AGCATGACCG CAGATGGGCT TGAGTCGCTC GCGGATGCTA TAATAAAGAT GGCCGAGTCT GAGGGGCTCA GGGCACACGC TGAATCCGTG CGGATCCGGA TGAGGAGATG A
|
Protein sequence | MIVKKLNELS SEDLKVLLSR EIGIQDVLQR VNDIVMDVAE NGDEALRRYT EQFDGVRLES FRVSDDEIEE AYDAVDESLL RSLELAAQNI YAFHDEERTK DLWLYQVAPG VVAGQKVVPL ESVGAYVPGG RAAYPSSALM CVIPAKVAGV ERMVVCTPPD RSGRISPLTL AAADIAGADE IYKLGGAQAI AAMALGTESI ERVEKIVGPG NIYVTAAKML VRGSVEIDFP AGPSEVLIIA DSSADPEFVA ADMIAQAEHD PSSIAVVVTT SEPLAKAIET EIPSQIEKAE RRDIVQASLE RCAILLAESL DDALAFSNAF APEHLELMVR DPMDALNLVR SAGSVFLGHY TPVAAGDYAT GTNHVLPTAG YARIFSGLNI DAFTKKISIQ SMTADGLESL ADAIIKMAES EGLRAHAESV RIRMRR
|
| |