Gene Moth_1024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1024 
Symbol 
ID3832644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1052627 
End bp1053727 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content58% 
IMG OID637828952 
ProductIron-containing alcohol dehydrogenase 
Protein accessionYP_429881 
Protein GI83589872 
COG category[C] Energy production and conversion 
COG ID[COG1454] Alcohol dehydrogenase, class IV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00115837 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000647947 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGTCCTTCA GTTTTTATTT ACCAACAAAG GTCTTTTTTG GCGAAGGGGC TGTTAACAAT 
CATGGCGTCT TTCTTAAAGG CAGGGGCCGC CGGGCCCTGG TAGTCACCGG GCGCCACAGC
GCTACAGCCA GCGGTGCTAT GGCCGATATT GAAGCCTTGG CCAAGAAACT AGATATAACC
CTGGCGACCT TCAATCAGGT CCCTTCTAAC CCCACCCTGG AAGTAGTGGG CCGGGGTGTT
GAAATGGCCC GCAGCGAGGG GGCGGATTTT ATCATCGGAA TTGGCGGTGG TTCTCCCCTG
GATACGGCCA AAGCCATCGC CCTCCTGGCA ACCAACAAAG TACCGGCAAC CGCCCTTTAT
GAGGCTGAGC TACCGGAGCC GCCTCTACCT GTGATTGCTA TACCGACTAC TGCCGGCACC
GGGAGCGAAG TCACCCAGCA TGCCGTTTTT ACCCTGCCGG AAAAGAAAAT CAAGAAGGGC
TTTAGTGACG ACCGCTGTTT TCCCCTGGCA GCATTGGTGG ATCCCCGTTA TACCGCCTCT
CTCCCCCTGG AGGTTACCAT TGATACCGCC CTGGACGCCC TGAGCCATGC CATCGAGGGT
TACCTGTCCA GGCGGGCGAC GCCTTTAAGC GATACTCTGG CCCTTGAGGC CATGGGCCTC
TTCGCCAGGC ATAAGGAAGC CCTGGTGAGG GGAGAGTTGA CTCCTGCTAC CAGGTACGAT
CTCATGTATG CTTCCACCCT TGGTGGCATG GTCATTGCCC AGACGCGTAC GACCATCCTG
CATACCCTGG GTTATCCTCT AACCTTCAGC CATAATATCC CCCATGGCCG GGCCAATGGC
CTCCTGCTGG CAGCCTACCT GGAGTTCGTA CAACCGGCGG AACCGGTGAA GGTCGCTCGC
ATCCTGACTG TTTTAGGGAT GACCTCCCTG GCAGAGGTGC AACAGATGAT CCGGCTACTC
CTGCCGGTGC CGGGAAAATA TCCGGAGAAG GAATTGGAAC GCATGGCGGA TCTGGTAACC
GGGGCCAGCA GTATGGCCTG GACGGCCCGC CAGGGGACCC GGGCCGATCT GGTCCGGATA
CTGCGCCAGA GTTTGGGTTA G
 
Protein sequence
MSFSFYLPTK VFFGEGAVNN HGVFLKGRGR RALVVTGRHS ATASGAMADI EALAKKLDIT 
LATFNQVPSN PTLEVVGRGV EMARSEGADF IIGIGGGSPL DTAKAIALLA TNKVPATALY
EAELPEPPLP VIAIPTTAGT GSEVTQHAVF TLPEKKIKKG FSDDRCFPLA ALVDPRYTAS
LPLEVTIDTA LDALSHAIEG YLSRRATPLS DTLALEAMGL FARHKEALVR GELTPATRYD
LMYASTLGGM VIAQTRTTIL HTLGYPLTFS HNIPHGRANG LLLAAYLEFV QPAEPVKVAR
ILTVLGMTSL AEVQQMIRLL LPVPGKYPEK ELERMADLVT GASSMAWTAR QGTRADLVRI
LRQSLG