Gene Moth_2035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2035 
Symbol 
ID3831410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2124247 
End bp2125521 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content66% 
IMG OID637829964 
Producthistidinol dehydrogenase 
Protein accessionYP_430874 
Protein GI83590865 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTACCAC TAATTGATGG TAAAGAGGTA AAGCGCCGCT GGTCCGGGCG TCTCCTGGCC 
CGGGAAGGGG TGGCAGCCAG GGTGCGGGAG ATTATTGCCG CCGTGAAGAG GGAAGGCCAG
GCTGCGGTGG AGCGCTATAC CCTGGAACTG GACGGTGTCG ACCTTAAGGA GGCTGGCTTC
CGGGTAACCA GAGAAGAGAT TGGGGCCGCT TACAGGGCCG TTAGCCCGGA CCTTCTGGAA
GCCCTGAGGA TCGCCAGGGA CAATATCGCC ACTTATCACC GCCGCCAACC CCGCGGTTCC
TGGATGGAGA CGGCAGCGGA CGGCACCATC CTGGGCCAGA TCTGCCGGCC CCTGGGACGG
GTGGGGCTTT ATGTGCCAGG TGGCACGGCG GCTTACCCTT CGTCGGTATT GATGACCGCT
GTACCGGCCC GGGTGGCCGG GGTCAGGGAG ATTGCCCTAG CGACACCGCC GCGGCGGGAC
GGGACACTAC CGCCCCTGCT CTTGGTGGCG GCAGCGGAAG CCGGAGTAGA AGAGATCTAC
AAAATGGGGG GCGCCCAGGC CGTGGCCGCC CTGGCCTACG GTACGGAGAA AGTGGCCCCG
GTGGATAAGA TCGCCGGGCC GGGGAATATC TACGTTACCC TGGCGAAGAA GGAAGTCTAC
GGCCAGGTGG ATATCGACAT GCTGGCCGGG CCCAGTGAGA TTGTCGTGAT CGCCGATGGA
AAGGCCCGGC CGGACTGGGT GGCGGCCGAC CTTCTCTCCC AGGCCGAACA CGACGCCCTG
GCCGGGGCAG TCCTCATCAC GCCGGATGCC GGCCTGGCCC GGGCGGTGGG GGAGGAAGTT
ACCCGCCAGC TCGAAGCCTT GCCCAGGCGG GAGATTGCCA GCCGTTCCCT GGCCGATTAC
GGCGCCGCCG TAGTGGTGAC GGGCCTGGAC GCTGCCATGG ACCTGGCCAA CTCCCTGGCC
CCGGAGCACC TGGAGCTGTA CGTATCTGAA CCCTGGTCAT GGCTGGGCCG GGTGGAGAAT
GCCGGGGCGA TTTTCCTGGG GCCTTATAGT TCCGAGCCCC TGGGCGATTA CCTGGCCGGT
CCCAGCCACG TCCTACCCAC CGGCGGCACG GCCAGGTTCT ATTCACCCCT GAGCGTAGAC
ACCTTTTTAA AGAAAAGTAG CTTGATTGCC TGCAACCGGG CGGGCTTCCG GGCTGCCGCG
GGATATATCC AGGCTCTGGC CCGGGCCGAG GGCCTGGAGG GGCACGCCCG GGCCATCGAG
CTACGGGAGG AATGA
 
Protein sequence
MLPLIDGKEV KRRWSGRLLA REGVAARVRE IIAAVKREGQ AAVERYTLEL DGVDLKEAGF 
RVTREEIGAA YRAVSPDLLE ALRIARDNIA TYHRRQPRGS WMETAADGTI LGQICRPLGR
VGLYVPGGTA AYPSSVLMTA VPARVAGVRE IALATPPRRD GTLPPLLLVA AAEAGVEEIY
KMGGAQAVAA LAYGTEKVAP VDKIAGPGNI YVTLAKKEVY GQVDIDMLAG PSEIVVIADG
KARPDWVAAD LLSQAEHDAL AGAVLITPDA GLARAVGEEV TRQLEALPRR EIASRSLADY
GAAVVVTGLD AAMDLANSLA PEHLELYVSE PWSWLGRVEN AGAIFLGPYS SEPLGDYLAG
PSHVLPTGGT ARFYSPLSVD TFLKKSSLIA CNRAGFRAAA GYIQALARAE GLEGHARAIE
LREE