Gene Cthe_2238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2238 
Symbol 
ID4809976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2666100 
End bp2667518 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content40% 
IMG OID640107644 
Productaldehyde dehydrogenase 
Protein accessionYP_001038633 
Protein GI125974723 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGAAA ATCATGACTA TGAGAGAGAA ATAAACAGAT TGTTTGAATT GCAGAAGAAA 
AACGTTGTAC GGCTTCGCAC ATCGAGTATA GATGAGAGAA TTGCGAAACT GAAAAAATTG
AAAGAATATA TTTGGGAAAA TAAAGAAAAG ATTCAGGAGG CGGTTTATAA CGATTTGAGA
AAGCCTCCGG AGGAGGTTTT ATTAACCGAA ATATATCCTG TTGTCTCGGA AATCAGGCAT
GTAATAAAGA ATTTGAAAAA ATGGACAAAG CCTAAGAAAG TCAGAACACC CATATCTCTT
TTTGGGGCAA AAAGCTATTA CAGATTTGAG GCAAAAGGGG TGGTGCTGAT TATTTCACCG
TGGAACTATC CCTTTGAACT CTCAATAGGC CCGTTAATCA CTGCCATTGC TGCCGGGAAT
GCGGTTGTAT TGAAGCCTTC GGAATTGAGT CCCCATACAT CCGGTTATAT AAAGAAACTT
GTGGCAGACA TTTTTGATGA AAGTGAGGTT GCTGTTGTTG AAGGGGATGC GGTGGTGGCC
CAAAAACTGC TGGAGATGGG TTTTAATCAT ATATTTTTTA CCGGAAGTAC AAAGGTTGCG
AAAGCTGTGC TAAAGAAGGC CTCTGAGACA TTGTCTTCGG TAACCCTTGA ACTAGGAGGA
AAAAGTCCGG TAATTATTGA CGGCAAATTT GATATTGAAG AGGCTGCTAA AAAAATAACA
TGGGGTAAAT ATTTAAATGC AGGGCAGACA TGCATAGCTC CGGATTACGT TTTTGTAAAA
AAAGAGCTTT TAGGGGATTT TGTAAGCCAC TTAAAACATT ACATAAAAAA ATATTATTAT
TCTGACGGCA GCGGAAGATG CAGCAACTAC TGCGGTATTA TCAACGAACG TCACTTTAAC
AGGCTGAAAA ATGTGTTTGA GGTGACGGTA AAAGAGGGGG CAAAAGTTTG TGAGGGCGGT
CTGTTTGTTG AGAATGAATG CTATATATCA CCTACTGTTT TGACGGATGT GGGCAGAGAC
TCATATATAA TGGAGGAGGA AATTTTCGGG CCGATTTTGC CGGTGCTGAC TTATGAAAAA
ATCGATGATG TCATTGAGTA TATAAACTCA AAGCCTGCTC CTTTGGTGTT GTATGTTTTC
AGCAGGGACA GGAAATTTTA CAGACATGTG ATTAATAACG TAATTTCCGG GGATTGTCTG
ATAAATGATG TGATAGCGCA CTTTGCCAAT CCCAGGCTGC CTTTTGGAGG GCACAACGCC
AGTGGAATCG GAAAGTCCCA TGGTTATTAC GGATTTAGAG AATTTTCCCA CCTGCGTTCA
ATCATGATTC AACCAAAACG CACAATGTTG CAGTTGCTCT ACCCTCCGTA CGGCGAGTTT
GTAAAAAAGT TGATTGAGTG GAGTACGAAA TATTTTTAG
 
Protein sequence
MAENHDYERE INRLFELQKK NVVRLRTSSI DERIAKLKKL KEYIWENKEK IQEAVYNDLR 
KPPEEVLLTE IYPVVSEIRH VIKNLKKWTK PKKVRTPISL FGAKSYYRFE AKGVVLIISP
WNYPFELSIG PLITAIAAGN AVVLKPSELS PHTSGYIKKL VADIFDESEV AVVEGDAVVA
QKLLEMGFNH IFFTGSTKVA KAVLKKASET LSSVTLELGG KSPVIIDGKF DIEEAAKKIT
WGKYLNAGQT CIAPDYVFVK KELLGDFVSH LKHYIKKYYY SDGSGRCSNY CGIINERHFN
RLKNVFEVTV KEGAKVCEGG LFVENECYIS PTVLTDVGRD SYIMEEEIFG PILPVLTYEK
IDDVIEYINS KPAPLVLYVF SRDRKFYRHV INNVISGDCL INDVIAHFAN PRLPFGGHNA
SGIGKSHGYY GFREFSHLRS IMIQPKRTML QLLYPPYGEF VKKLIEWSTK YF