Gene Moth_1911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1911 
Symbol 
ID3830835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1979946 
End bp1981145 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content54% 
IMG OID637829844 
ProductIron-containing alcohol dehydrogenase 
Protein accessionYP_430754 
Protein GI83590745 
COG category[C] Energy production and conversion 
COG ID[COG1454] Alcohol dehydrogenase, class IV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000928726 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGGAAA CAAAAATAAA CATCAACGAA GTCCGGGAAA TCCGGGCTAA AACAACCGTC 
TACTTTGGAG TTGGAGCTAT TAAAAAGATT GACGACATAG CCAGGGAATT TAAGGAAAAG
GGATACGATA GGATCATCGT AATAACCGGC AAGGGGGCTT ATAAAGCCAC CGGCGCGTGG
GAATATATAG TTCCGGCCTT AAATAAAAAC CAGATAACCT ATATCCATTA CGACCAGGTG
ACGCCCAACC CGACGGTAGA CCAGGTTGAC GAGGCAACCA AACAAGCCCG GGAATTCGGT
GCCCGAGCCG TCCTGGCCAT CGGCGGGGGT AGCCCCATTG ATGCCGCTAA AAGCGTAGCC
GTCTTGCTCT CCTACCCCGA CAAAAATGCC CGACAGCTCT ACCAGTTAGA ATTTACACCT
GTTAAGGCCG CACCTATCAT CGCTATTAAT CTTACCCATG GTACGGGGAC GGAAGCCGAT
CGCTTTGCCG TTGTCAGCAT CCCTGAAAAG GCATATAAAC CCGCTATTGC CTATGATTGC
ATTTACCCCT TATATTCAAT TGACGACCCG GCCCTCATGG TAAAACTGCC GTCCGACCAG
ACAGCTTATG TCTCTGTTGA TGCCCTCAAC CATGTCGTCG AAGCAGCCAC CAGCAAAGTA
GCCAGCCCCT ATACTATTAT CCTGGCCAAG GAAACGGTAC GGCTCATCGC CCGATACCTG
CCCCAGGCCC TGTCCCATCC GGCGGATTTG ACGGCCAGGT ATTATCTCCT CTATGCTTCC
CTGATTGCCG GAATAGCCTT TGACAACGGT TTGCTCCACT TCACCCACGC CCTGGAACAC
CCCCTGAGCG CCGTCAAACC GGAGCTCGCC CACGGTCTGG GGCTGGGTAT GCTGCTGCCG
GCCGTAGTCA AGCAGATTTA CCCGGCAACC CCGGAGGTAC TGGCGGAGAT ACTGGAGCCC
ATTGTTCCCG ATCTCAAAGG CGTTCCCGGT GAAGCAGAAA AGGCTGCCAG CGGGGTGGCA
AAATGGCTTG CCGGAGCCGG TATTACCATG AAGCTAAAAG ATGCGGGCTT TCAAGCGGAA
GATATCGCCA GGTTAACTGA CCTGGCCTTT ACCACCCCGA GTCTCGAGCT TCTCCTGAGT
ATGGCCCCGG TAACGGCCGA CAGGGAAAGG GTTAAGGCAA TTTACCAGGA CGCCTTTTAA
 
Protein sequence
MWETKININE VREIRAKTTV YFGVGAIKKI DDIAREFKEK GYDRIIVITG KGAYKATGAW 
EYIVPALNKN QITYIHYDQV TPNPTVDQVD EATKQAREFG ARAVLAIGGG SPIDAAKSVA
VLLSYPDKNA RQLYQLEFTP VKAAPIIAIN LTHGTGTEAD RFAVVSIPEK AYKPAIAYDC
IYPLYSIDDP ALMVKLPSDQ TAYVSVDALN HVVEAATSKV ASPYTIILAK ETVRLIARYL
PQALSHPADL TARYYLLYAS LIAGIAFDNG LLHFTHALEH PLSAVKPELA HGLGLGMLLP
AVVKQIYPAT PEVLAEILEP IVPDLKGVPG EAEKAASGVA KWLAGAGITM KLKDAGFQAE
DIARLTDLAF TTPSLELLLS MAPVTADRER VKAIYQDAF