Gene Athe_1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1994 
Symbol 
ID7408208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2103828 
End bp2104925 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content37% 
IMG OID643716370 
ProductD-isomer specific 2-hydroxyacid dehydrogenase NAD-binding 
Protein accessionYP_002573854 
Protein GI222529972 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0111] Phosphoglycerate dehydrogenase and related dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA AAATGAAAAT AATGGTAATT GGAGACGCAA TGATACCGGG TAAAGATTTT 
GAATCAGCAG CTAAAAAATA TTTATCTGAT TATGTGGAAG AAATAATTAC AGGAGATTGG
GAAAATAATT GGGACAATTT ACAAAGAAGA AGATTGGAAG TAGAAAAGAA AGGGCCCGAG
ATTGAGGAAG TAGTTCCTTT AATAAAGGAA AAAGGGCAAG ATGTTTCAAT GTTGTTTGGT
TTATTTGTTC CCATTTCCAA AGAAACATTC AATTACTTGC CAAAGGTAAA GATTATTGGG
GTTTCGCGAG CAGGCTTAGA AAATGTAAAC GTAAAAGAAG CAACCCAACG AGGAGTTTTA
GTGTTCAATG TCCAGGGAAG AAATGCAGAA GCTGTTTCTG ACTTTGCAAT AGGTTTGCTT
TTGGCAGAAT GTAGAAACAT TGCGAGAGCC CACTATGCAA TAAAGAATGG CCAGTGGCGG
AAAGAATTTT CTAATTCTGA TTGGATTCCG GAACTAAAAG GCAAAACAGT TGGTATTATT
GGTTTTGGAT ATATTGGTAG ACTGGTAGCA AAAAAACTCT CTGGATTTGA AGTTAGAAGA
CTTGTGTACG ATCCTTATGT AAGTGAAGAG GAAATTAGAG AATGCGGATG TATACCAGTA
GACAAAGAGA CTTTGTTCAA AGAAAGTGAT TTTATTACTC TCCATGCACG CCTCACAGAA
GAGAATAAAA ATTTGGTTGG CAAATATGAG ATTTCATTGA TGAAACCAAC AGCATACATT
ATTAACACTG CACGGGCAGG TCTAATTGAT AAAGAAGCAT TAATAGAGGC TCTAAAGACA
AAGAGAATAG CAGGAGCAGC ACTGGATGTG TTCTGGGAAG AACCTATTCC TTCGGACAGT
GAGTTGTTAG AATTGGACAA TGTTACTCTT ACAAGTCATT TAGCAGGAAC AACCAAAGAA
GCACTTACAA GATCACCTGA GCTTTTAATG GAGGATGTCA AGAAGTTTAT TGAAGGGCAG
AAAGCAAGAT TTATTGTGAA TCCAGAGGTT TTGGAAAACC AAGAGTTCAA GAAATGGCTG
GAGGGTGTGA AGAAATGA
 
Protein sequence
MSKKMKIMVI GDAMIPGKDF ESAAKKYLSD YVEEIITGDW ENNWDNLQRR RLEVEKKGPE 
IEEVVPLIKE KGQDVSMLFG LFVPISKETF NYLPKVKIIG VSRAGLENVN VKEATQRGVL
VFNVQGRNAE AVSDFAIGLL LAECRNIARA HYAIKNGQWR KEFSNSDWIP ELKGKTVGII
GFGYIGRLVA KKLSGFEVRR LVYDPYVSEE EIRECGCIPV DKETLFKESD FITLHARLTE
ENKNLVGKYE ISLMKPTAYI INTARAGLID KEALIEALKT KRIAGAALDV FWEEPIPSDS
ELLELDNVTL TSHLAGTTKE ALTRSPELLM EDVKKFIEGQ KARFIVNPEV LENQEFKKWL
EGVKK