Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2238 |
Symbol | |
ID | 4809976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2666100 |
End bp | 2667518 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640107644 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001038633 |
Protein GI | 125974723 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGAAA ATCATGACTA TGAGAGAGAA ATAAACAGAT TGTTTGAATT GCAGAAGAAA AACGTTGTAC GGCTTCGCAC ATCGAGTATA GATGAGAGAA TTGCGAAACT GAAAAAATTG AAAGAATATA TTTGGGAAAA TAAAGAAAAG ATTCAGGAGG CGGTTTATAA CGATTTGAGA AAGCCTCCGG AGGAGGTTTT ATTAACCGAA ATATATCCTG TTGTCTCGGA AATCAGGCAT GTAATAAAGA ATTTGAAAAA ATGGACAAAG CCTAAGAAAG TCAGAACACC CATATCTCTT TTTGGGGCAA AAAGCTATTA CAGATTTGAG GCAAAAGGGG TGGTGCTGAT TATTTCACCG TGGAACTATC CCTTTGAACT CTCAATAGGC CCGTTAATCA CTGCCATTGC TGCCGGGAAT GCGGTTGTAT TGAAGCCTTC GGAATTGAGT CCCCATACAT CCGGTTATAT AAAGAAACTT GTGGCAGACA TTTTTGATGA AAGTGAGGTT GCTGTTGTTG AAGGGGATGC GGTGGTGGCC CAAAAACTGC TGGAGATGGG TTTTAATCAT ATATTTTTTA CCGGAAGTAC AAAGGTTGCG AAAGCTGTGC TAAAGAAGGC CTCTGAGACA TTGTCTTCGG TAACCCTTGA ACTAGGAGGA AAAAGTCCGG TAATTATTGA CGGCAAATTT GATATTGAAG AGGCTGCTAA AAAAATAACA TGGGGTAAAT ATTTAAATGC AGGGCAGACA TGCATAGCTC CGGATTACGT TTTTGTAAAA AAAGAGCTTT TAGGGGATTT TGTAAGCCAC TTAAAACATT ACATAAAAAA ATATTATTAT TCTGACGGCA GCGGAAGATG CAGCAACTAC TGCGGTATTA TCAACGAACG TCACTTTAAC AGGCTGAAAA ATGTGTTTGA GGTGACGGTA AAAGAGGGGG CAAAAGTTTG TGAGGGCGGT CTGTTTGTTG AGAATGAATG CTATATATCA CCTACTGTTT TGACGGATGT GGGCAGAGAC TCATATATAA TGGAGGAGGA AATTTTCGGG CCGATTTTGC CGGTGCTGAC TTATGAAAAA ATCGATGATG TCATTGAGTA TATAAACTCA AAGCCTGCTC CTTTGGTGTT GTATGTTTTC AGCAGGGACA GGAAATTTTA CAGACATGTG ATTAATAACG TAATTTCCGG GGATTGTCTG ATAAATGATG TGATAGCGCA CTTTGCCAAT CCCAGGCTGC CTTTTGGAGG GCACAACGCC AGTGGAATCG GAAAGTCCCA TGGTTATTAC GGATTTAGAG AATTTTCCCA CCTGCGTTCA ATCATGATTC AACCAAAACG CACAATGTTG CAGTTGCTCT ACCCTCCGTA CGGCGAGTTT GTAAAAAAGT TGATTGAGTG GAGTACGAAA TATTTTTAG
|
Protein sequence | MAENHDYERE INRLFELQKK NVVRLRTSSI DERIAKLKKL KEYIWENKEK IQEAVYNDLR KPPEEVLLTE IYPVVSEIRH VIKNLKKWTK PKKVRTPISL FGAKSYYRFE AKGVVLIISP WNYPFELSIG PLITAIAAGN AVVLKPSELS PHTSGYIKKL VADIFDESEV AVVEGDAVVA QKLLEMGFNH IFFTGSTKVA KAVLKKASET LSSVTLELGG KSPVIIDGKF DIEEAAKKIT WGKYLNAGQT CIAPDYVFVK KELLGDFVSH LKHYIKKYYY SDGSGRCSNY CGIINERHFN RLKNVFEVTV KEGAKVCEGG LFVENECYIS PTVLTDVGRD SYIMEEEIFG PILPVLTYEK IDDVIEYINS KPAPLVLYVF SRDRKFYRHV INNVISGDCL INDVIAHFAN PRLPFGGHNA SGIGKSHGYY GFREFSHLRS IMIQPKRTML QLLYPPYGEF VKKLIEWSTK YF
|
| |