Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0900 |
Symbol | hisD |
ID | 7084758 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 991829 |
End bp | 993136 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643697923 |
Product | histidinol dehydrogenase |
Protein accession | YP_002354563 |
Protein GI | 217969329 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0141] Histidinol dehydrogenase |
TIGRFAM ID | [TIGR00069] histidinol dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.69214 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAGA CCCCGATCCG CCGCCTGGCC GCGCGCGAAC CCGAATTCCT GTCCACCCTC GATGCCCTGC TCGCCTTCGA GGCCGAGGCC GACGGCCGCA TCGACGCGGC GGTCACCGAG ATCCTGCAGG CGGTGCGCAC CACCGGCGAC GCCGCGGTGG TCGAATACAC CCGCCGCTTC GACGGGCTCG ACGTGCAATC CATGGTCGCG CTCGAGCTGC CCAGGAGCGA GCTGCTGCTC GCGCTCGACA GCCTGCGCCC CGAGCAGCGC GAGGCGCTCA CCATCGCCGC CGACCGCGTG CGCGTCTATC ACGAGCGCCA GAAGGGCGAG TCCTGGGAAT TCACCGAGGC CGACGGCACC CGCCTGGGCC AGAAGGTCAC CCCGCTCGAC CGCGTCGGCC TCTACGTGCC GGGCGGGCGC GCCTCCTACC CGAGCTCGGT GCTGATGAAT GCGATCCCGG CCAAGGTCGC CGGTGTCGGC GAACTGATCA TGGTCGTGCC CACCCCGCGT GGCGAGAAGA ATCCGCTGGT GCTGGCGGCG GCGGCGATCA CCGGCGTCGA CCGCGTGTTC ACCATCGGCG GCGCGCAGGC GGTGGCGGCG CTGGCCTACG GCACGCAGAC CATCCCGCAG GTGGACAAGA TCGTCGGCCC GGGCAATGCC TACGTGGCCG AGGCCAAGCG CCGCGTGTTC GGCACCGTCG GCATCGACAT GGTCGCCGGC CCGTCTGAAG TGCTGATCAT CTCCGATGGC TCCGGCCACG CCGACTGGGT GGCAATGGAC CTCTTCGCCC AGGCCGAGCA CGACGAGCTC GCGCAGTCCA TCCTGCTGTG TACCGACGCC GGCTTCATCG ACGCGGTGCA CGACGCGATC GACCGCCTGC TGCCCACCAT GCCGCGCCGC GACACGATCG CCAGGTCGCT TGCCAACCGC GGCGCGCTGA TCCACGTCGA CAGCCTGGAG CAGGCCTGCG CGCTCGCCAA CCGCATCGCG CCCGAGCACC TCGAGCTGGC GCTGGAAGAT GCCGAGGACT GGATCGCCCA CATCCGTCAC GCCGGCGCGA TCTTCGTCGG CCACTGGGCG GTGGAGGCGC TCGGCGACTA CTGCGCCGGC CCCAACCACG TGCTGCCGAC CATGCGCAGC GCGCGTTTCT CCTCGCCGCT CGGCACCTAC GACTTCCAGA AGCGCACCAG CATCGTCCAC ATCTCGCAGG CGGGCGCGCA GCACCTGGGC AAGGTGGCTT CCATCCTGGC CCATGGCGAG GGTCTGCAGG CGCACGCCCG CTCGGCGGAG ATGCGGCTGA AGGTTTGA
|
Protein sequence | MSQTPIRRLA AREPEFLSTL DALLAFEAEA DGRIDAAVTE ILQAVRTTGD AAVVEYTRRF DGLDVQSMVA LELPRSELLL ALDSLRPEQR EALTIAADRV RVYHERQKGE SWEFTEADGT RLGQKVTPLD RVGLYVPGGR ASYPSSVLMN AIPAKVAGVG ELIMVVPTPR GEKNPLVLAA AAITGVDRVF TIGGAQAVAA LAYGTQTIPQ VDKIVGPGNA YVAEAKRRVF GTVGIDMVAG PSEVLIISDG SGHADWVAMD LFAQAEHDEL AQSILLCTDA GFIDAVHDAI DRLLPTMPRR DTIARSLANR GALIHVDSLE QACALANRIA PEHLELALED AEDWIAHIRH AGAIFVGHWA VEALGDYCAG PNHVLPTMRS ARFSSPLGTY DFQKRTSIVH ISQAGAQHLG KVASILAHGE GLQAHARSAE MRLKV
|
| |