Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2209 |
Symbol | |
ID | 4811074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2638003 |
End bp | 2639091 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640107615 |
Product | 3-isopropylmalate dehydrogenase |
Protein accession | YP_001038604 |
Protein GI | 125974694 |
COG category | [C] Energy production and conversion [E] Amino acid transport and metabolism |
COG ID | [COG0473] Isocitrate/isopropylmalate dehydrogenase |
TIGRFAM ID | [TIGR00169] 3-isopropylmalate dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000258319 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAAAGT ATAAAATAGC GGTACTTCCG GGTGACGGTA TAGGGCCGGA AGTTATTGAA CAGGCGGTTA AGGTAATTGA AGCTGTGGGT GAGATATACA ATCACAGTTT TGAATTAAAA GAAGGCCTGC TCGGTGGATG TGCCATTGAT GCCACCGGCG AGCCCTTCCC AAAAGAGACA CTGGAACTCT GCAAGTCCTC TGATGCAGTT TTGCTGGGGG CCGTAGGCGG ACCGAAATGG GATAACCTTC CGGGGGACAA AAGACCTGAA GCGGGTCTTC TTGGAATACG CGGAGCATTG GGCTTGTATG CAAATTTAAG GCCTGCGGTT ATCTATCAGG CACTCAAAGG TGCGTCGCCA CTGCGCTCCG ATATTGTAAA AGACGGCATT GACATAATGG TGGTAAGAGA GCTTACCGGC GGTATGTATT TTGGCGAAAG AGGAAGAGTG CAGACGGAGA ATATGGGTCA GGCCGCTTTT GACACAGAAA AGTACAGCGA GTTTGAAGTT GAAAGAATTG CACGGCTGGC GTTTGAGACG GCCATGAAGA GAAATAAGAA ACTCACATCT GTGGACAAAG CCAATGTACT GGAAAGCTCA AGACTTTGGA GGGAAGTTGT TAACAGGGTT GCTTCAGACT ATCCGGAAGT TGAGCTTAAT CATATGTATG TGGACAATGC CGCCATGCAG CTGGTAAGGA ATCCTGCGCA GTTTGACGTT ATAGTTACAT CAAATATGTT CGGTGATATT CTCTCCGACG AGGCGTCTAT GATTACCGGC TCAATAGGCA TGCTTCCTTC GGCAAGTCTT GGAGAAGGCT CATTGGGGCT TTATGAACCA ATACATGGTT CCGCGCCGGA CATAGCGGGG CAGGACAAAG CAAATCCCAT TGCCACAATA CTTTCAGTAG CTATGATGAT GAAATACTCC TTCGGTCTTG AAGATGCTTT CAGGGCTATT GAAAATGCCG TCGTAAATGT GCTTGGCATG GGATACAGAA CGGCGGACAT TGCTTCCCCG GATACTCCTC GTGAGAATAT AGTCGGAACA AAGGAAATGG GAAGGTTAAT AATTTCAAAA TTAAAATAA
|
Protein sequence | MGKYKIAVLP GDGIGPEVIE QAVKVIEAVG EIYNHSFELK EGLLGGCAID ATGEPFPKET LELCKSSDAV LLGAVGGPKW DNLPGDKRPE AGLLGIRGAL GLYANLRPAV IYQALKGASP LRSDIVKDGI DIMVVRELTG GMYFGERGRV QTENMGQAAF DTEKYSEFEV ERIARLAFET AMKRNKKLTS VDKANVLESS RLWREVVNRV ASDYPEVELN HMYVDNAAMQ LVRNPAQFDV IVTSNMFGDI LSDEASMITG SIGMLPSASL GEGSLGLYEP IHGSAPDIAG QDKANPIATI LSVAMMMKYS FGLEDAFRAI ENAVVNVLGM GYRTADIASP DTPRENIVGT KEMGRLIISK LK
|
| |