Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2035 |
Symbol | |
ID | 3831410 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2124247 |
End bp | 2125521 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637829964 |
Product | histidinol dehydrogenase |
Protein accession | YP_430874 |
Protein GI | 83590865 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0141] Histidinol dehydrogenase |
TIGRFAM ID | [TIGR00069] histidinol dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCTACCAC TAATTGATGG TAAAGAGGTA AAGCGCCGCT GGTCCGGGCG TCTCCTGGCC CGGGAAGGGG TGGCAGCCAG GGTGCGGGAG ATTATTGCCG CCGTGAAGAG GGAAGGCCAG GCTGCGGTGG AGCGCTATAC CCTGGAACTG GACGGTGTCG ACCTTAAGGA GGCTGGCTTC CGGGTAACCA GAGAAGAGAT TGGGGCCGCT TACAGGGCCG TTAGCCCGGA CCTTCTGGAA GCCCTGAGGA TCGCCAGGGA CAATATCGCC ACTTATCACC GCCGCCAACC CCGCGGTTCC TGGATGGAGA CGGCAGCGGA CGGCACCATC CTGGGCCAGA TCTGCCGGCC CCTGGGACGG GTGGGGCTTT ATGTGCCAGG TGGCACGGCG GCTTACCCTT CGTCGGTATT GATGACCGCT GTACCGGCCC GGGTGGCCGG GGTCAGGGAG ATTGCCCTAG CGACACCGCC GCGGCGGGAC GGGACACTAC CGCCCCTGCT CTTGGTGGCG GCAGCGGAAG CCGGAGTAGA AGAGATCTAC AAAATGGGGG GCGCCCAGGC CGTGGCCGCC CTGGCCTACG GTACGGAGAA AGTGGCCCCG GTGGATAAGA TCGCCGGGCC GGGGAATATC TACGTTACCC TGGCGAAGAA GGAAGTCTAC GGCCAGGTGG ATATCGACAT GCTGGCCGGG CCCAGTGAGA TTGTCGTGAT CGCCGATGGA AAGGCCCGGC CGGACTGGGT GGCGGCCGAC CTTCTCTCCC AGGCCGAACA CGACGCCCTG GCCGGGGCAG TCCTCATCAC GCCGGATGCC GGCCTGGCCC GGGCGGTGGG GGAGGAAGTT ACCCGCCAGC TCGAAGCCTT GCCCAGGCGG GAGATTGCCA GCCGTTCCCT GGCCGATTAC GGCGCCGCCG TAGTGGTGAC GGGCCTGGAC GCTGCCATGG ACCTGGCCAA CTCCCTGGCC CCGGAGCACC TGGAGCTGTA CGTATCTGAA CCCTGGTCAT GGCTGGGCCG GGTGGAGAAT GCCGGGGCGA TTTTCCTGGG GCCTTATAGT TCCGAGCCCC TGGGCGATTA CCTGGCCGGT CCCAGCCACG TCCTACCCAC CGGCGGCACG GCCAGGTTCT ATTCACCCCT GAGCGTAGAC ACCTTTTTAA AGAAAAGTAG CTTGATTGCC TGCAACCGGG CGGGCTTCCG GGCTGCCGCG GGATATATCC AGGCTCTGGC CCGGGCCGAG GGCCTGGAGG GGCACGCCCG GGCCATCGAG CTACGGGAGG AATGA
|
Protein sequence | MLPLIDGKEV KRRWSGRLLA REGVAARVRE IIAAVKREGQ AAVERYTLEL DGVDLKEAGF RVTREEIGAA YRAVSPDLLE ALRIARDNIA TYHRRQPRGS WMETAADGTI LGQICRPLGR VGLYVPGGTA AYPSSVLMTA VPARVAGVRE IALATPPRRD GTLPPLLLVA AAEAGVEEIY KMGGAQAVAA LAYGTEKVAP VDKIAGPGNI YVTLAKKEVY GQVDIDMLAG PSEIVVIADG KARPDWVAAD LLSQAEHDAL AGAVLITPDA GLARAVGEEV TRQLEALPRR EIASRSLADY GAAVVVTGLD AAMDLANSLA PEHLELYVSE PWSWLGRVEN AGAIFLGPYS SEPLGDYLAG PSHVLPTGGT ARFYSPLSVD TFLKKSSLIA CNRAGFRAAA GYIQALARAE GLEGHARAIE LREE
|
| |