Gene Athe_2083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2083 
Symbol 
ID7408792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2205049 
End bp2206137 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content39% 
IMG OID643716450 
ProductAlcohol dehydrogenase zinc-binding domain protein 
Protein accessionYP_002573933 
Protein GI222530051 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.6772 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCTT ATGCAATGGT ATTAGAAGAA TTTAACAAGC CGTTAAAAGC AAAAGAGTTT 
GAACTAATCA AGCCTTCTGA TGGTGAACTT CTTGTTAAAA TTGAAGCGGC GGGTGTTTGT
GGATCTGATG TGCATATGTT CAGAGGTAAT GACCCGCGTA CAAAACTTCC CATGATTTTA
GGGCATGAAG GTGTTGGACG TGTGTATGCT ATTTCAGGTC AGTGGCGTGA TATAAATGGC
GAGAAAATTC AAGAGGGAGA TTTGATAATT TGGGACAGGG GTGTTGTGTG TGGTAGGTGC
TACTTTTGTG CTGTCAAAAA AGAAAGCTAT CTGTGTCCGC ACAGATGGAC ATATGGGATA
AGCGTTAGCT GCGCAGAGCC TCCGCATTTG AGAGGCTGCT ATTCGGAGTA CATTTATCTT
CACAAAGATA CGAAAGTGAT AAAAATAAAA GAGAATGTTG ATCCAGAAAT TTTAGTATCT
GCCTCATGTT CTGGTGCAAC GTGTGCTCAT GCTTTTGACA TTGTTTCACC TGATTTTGGT
GACAGTGTCC TAATTCAAGG GCCAGGTCCT ATAGGGCTTT ATGCAATCAT TTTTGCAAAA
CTTAGAGGAG CACGAAATAT AATTGTGATT GGTGGCACAA AAGAAAGACT TAAAATGTGT
GAAGAATTTG GGGCAACGCA TGTGCTTGAT AGAAATTCAA CTACAGCTTG CCAAAGACAG
GAAATAATAA TGGATATCAC AAATGGGCGT GGAGTCGATT TGGCAATTGA AGCTGTGGGA
CATCCATCAG CAGTAAGTGA GGGAATAAAA CTTGTTCGAA ATGGTGGAAG CTACTTATCA
CTTGGTTTTG GTGACCCAAA CGGCAGCGTT ACACTCGATT GTTACTATGA TATTGTGAGA
AAAAATTTAA GATATCAAGG GGTATGGGTC AGCGATACAA AACATTTATA TATGGCAGTG
AATGTTGTGC TCCAGAACAG GGAACTTTTC AAAAAGATGA TTACAAATGT TTATAAGTTG
ACTGATGCGA CAAAAGCTCT TGAGGATATG GAAAACAAAA ATACAATAAA ATCTGTTCTA
AAGCCTTGA
 
Protein sequence
MKAYAMVLEE FNKPLKAKEF ELIKPSDGEL LVKIEAAGVC GSDVHMFRGN DPRTKLPMIL 
GHEGVGRVYA ISGQWRDING EKIQEGDLII WDRGVVCGRC YFCAVKKESY LCPHRWTYGI
SVSCAEPPHL RGCYSEYIYL HKDTKVIKIK ENVDPEILVS ASCSGATCAH AFDIVSPDFG
DSVLIQGPGP IGLYAIIFAK LRGARNIIVI GGTKERLKMC EEFGATHVLD RNSTTACQRQ
EIIMDITNGR GVDLAIEAVG HPSAVSEGIK LVRNGGSYLS LGFGDPNGSV TLDCYYDIVR
KNLRYQGVWV SDTKHLYMAV NVVLQNRELF KKMITNVYKL TDATKALEDM ENKNTIKSVL
KP