Gene Athe_2244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2244 
Symbol 
ID7407663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2381789 
End bp2382976 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content40% 
IMG OID643716610 
Productiron-containing alcohol dehydrogenase 
Protein accessionYP_002574089 
Protein GI222530207 
COG category[C] Energy production and conversion 
COG ID[COG1454] Alcohol dehydrogenase, class IV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAACT TGGTTTTCAA TTATTTTATA CCCACAAAGA TCTTGTTTGG ACCAAGAAGT 
TTGAACAGAC TCAAAGACGA AAATTTGCCC GGAAAGAAAG CTCTGATTGT AATATCTGCT
GGAACATCTA TGAAAAAGTA TGGTTATCTG GACAGGCTTA CAGCGATATT AAAAGAAAAG
GGAATTGAGT ATGTTGTATT TGATAAGATT TTGCCCAATC CTATCAAGAA ACATGTTATG
GAAGGGGCAA AATTAGCAAA AGAAGAAGGC TGCGACTTTG TAATTGGTCT TGGTGGAGGA
AGTAGCATTG ACTCTGCAAA AAGTATTGCT CTGATGGCAA AACACGATGG TGATTACTGG
GACTATATAG TAGGCGGGAC AGGCAAAGGC AAAGTACCTG TAAATGGTGC ACTTCCAATT
GTTGCAATAA CCACAACAGC TGGAACTGGA ACAGAGGCTG ACCCTTGGAC TGTTATAACA
AATGAAGAGA CAAACGAGAA GATAGGTTAT GGCAACCAGT ACACATTCCC AACTTTGTCG
GTTGTAGACC CAGAGCTTAT GCTAAGTGTA CCTCCGCACC TTACTGCTTA CCAAGGGTTT
GATGCTTTCT TCCATGCAGT TGAAGGGTAT ATTGCAAAGA TTGCAACTCC AGTCAGTAGC
ATGTTTGCAC TCAAAAGCGT TGAGCTTATA GCAAAGTACT TGCCGATTTG TGTAAAAGAT
GGGACAAATA TAGAGGCAAG AACATATGTT GCACTTGCGA ACACGTTGGC AGGGTTTGTT
GAGTCGACTT CCAGCTGTAC ATCAGAGCAT TCAATGGAAC ATGCCCTCTC TGCATTTTAT
CCAGATTTGC CACATGGTGC AGGACTTATA ATGCTATCTG AAGCTTATCA TACATTCTTT
GCATCAAAAG TGCCTAAAAA ATACATTCAG CTTGCAAGAG CAATGGGTGT TGATGTAGAA
CAACTTCCAG AAGATGAAAG ACCGTTTGCA TTTGTCAAGG CAATGAAAAA GCTTCAGGAA
GAGTGTGGAG TTGGGAATTT AAAGATGTCT GACTATGGAA TTAAAGAGGA TGAGATTGAA
AAGCTTGCTG ATAATGCTAT AAAAACAATG GGCGGACTTT TTGAGGTTGA TCCATATAAG
CTTTCTTTTG AAGAGACAGT AGGAATTATG AGAAAGGCTT ATAAATAA
 
Protein sequence
MENLVFNYFI PTKILFGPRS LNRLKDENLP GKKALIVISA GTSMKKYGYL DRLTAILKEK 
GIEYVVFDKI LPNPIKKHVM EGAKLAKEEG CDFVIGLGGG SSIDSAKSIA LMAKHDGDYW
DYIVGGTGKG KVPVNGALPI VAITTTAGTG TEADPWTVIT NEETNEKIGY GNQYTFPTLS
VVDPELMLSV PPHLTAYQGF DAFFHAVEGY IAKIATPVSS MFALKSVELI AKYLPICVKD
GTNIEARTYV ALANTLAGFV ESTSSCTSEH SMEHALSAFY PDLPHGAGLI MLSEAYHTFF
ASKVPKKYIQ LARAMGVDVE QLPEDERPFA FVKAMKKLQE ECGVGNLKMS DYGIKEDEIE
KLADNAIKTM GGLFEVDPYK LSFEETVGIM RKAYK