Gene Moth_0475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0475 
Symbol 
ID3832413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp478293 
End bp479330 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content42% 
IMG OID637828409 
Productalcohol dehydrogenase 
Protein accessionYP_429348 
Protein GI83589339 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCTT TAGTGTTTGA GGGACCTAAC GAGCTTAAGC TAAGAGATGT ACCACTTCCG 
GAACCGGGAG AGGATGAGAT TTTAGTAAAA GTTGCTGCGT GTTTAATCTG TGGTACAGAT
CTTAGAATTT TTAGGGGGGC CAAAACAAGA GGGGTCCGGA TACCCTCTAT TTTGGGCCAC
GAATTTGCTG GGGTTGTTGA GGCAACTGGA GTTAATGTAA AAGAATTTCA TGTTGGTGAT
AGAGTAGGTG TGGCGCCTGT TATTCCCTGC CATACATGTT TCTATTGTAA AAATGGATTA
GAAAACGTAT GCGCTAATCG TACAGCCCTG GGTTACGAGT ATGAAGGTGC ATTTGCAGAA
TATGTATGTA TTCCAGCCCC TGCGGTTAAA GGGGGGAATG TGTATCACTT GCCTTCCAAC
ATTTCCCTTG AGGAAGCAGC TTTAGCAGAG CCTCTTGCAT GCTGTTTGAA CGGACATCAT
AATTCTAAAG TAAAATTAGG AGATGTGGTT GTAATTTTAG GCGCGGGGCC AATTGGACTA
ATGCACTTGC AATTAGCTAA AAGTTCTGGG GCCAGTTATG TAATTATCAG TGAACCCAAT
GAACATCGAC GTGCTATAGC CAAAGAATTC GGAGCAGATC GTGTAGTAGA CCCTCAGACC
GAGGATCTAA ATAGCATAGT TAAAAATGTT ACTGATGGAT TAGGAGCAGA TATAATTTTT
TTGGCCATAG GCATACCGGC CTTAGCACAG GATGCCCTTA CCCTTGTAAA AAAAGGAGGT
ACAATTAATT TCTTTGCTGG CTTTTCTGTT GGTGACAAAG CAGCACTTGA TGTAAATCTC
ATTCATTATA ACGAAATAAA AATCACTGGT ACAAGTGCTG CAAGGCGTGA TGACTATAGA
AAGGCTCTGG ATTTAATCGC AAAGGGTAAA GTAAAGGCTT CTAAAATGAT CACCCATCGT
TTCCCTTTAG ATAAGGCGGA AGAAGCCTTT AGAATTGCAG GATCAAGTCA AGGTATTAAG
GTGGCGATTA TTCCTTGA
 
Protein sequence
MKALVFEGPN ELKLRDVPLP EPGEDEILVK VAACLICGTD LRIFRGAKTR GVRIPSILGH 
EFAGVVEATG VNVKEFHVGD RVGVAPVIPC HTCFYCKNGL ENVCANRTAL GYEYEGAFAE
YVCIPAPAVK GGNVYHLPSN ISLEEAALAE PLACCLNGHH NSKVKLGDVV VILGAGPIGL
MHLQLAKSSG ASYVIISEPN EHRRAIAKEF GADRVVDPQT EDLNSIVKNV TDGLGADIIF
LAIGIPALAQ DALTLVKKGG TINFFAGFSV GDKAALDVNL IHYNEIKITG TSAARRDDYR
KALDLIAKGK VKASKMITHR FPLDKAEEAF RIAGSSQGIK VAIIP