Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2211 |
Symbol | |
ID | 4811076 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2639652 |
End bp | 2640911 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640107617 |
Product | 3-isopropylmalate dehydratase large subunit |
Protein accession | YP_001038606 |
Protein GI | 125974696 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0065] 3-isopropylmalate dehydratase large subunit |
TIGRFAM ID | [TIGR01343] homoaconitate hydratase family protein [TIGR02083] 3-isopropylmalate dehydratase, large subunit [TIGR02086] 3-isopropylmalate dehydratase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00871543 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAATGA CTATGACACA GAAAATACTT GCAGATCACG CAGGTCTTGA CAAGGTTTCA CCGGGTCAGC TCATAAAAGC AAAACTTGAC ATGGTTTTGG GAAATGATAT AACAACACCT GTGGCTGTGA AGGAATTTAG AAAAATTGGC GTGAACAAGG TGTTCGATGT AAATAAAATA GCAATTGTTC CTGACCATTT TACACCCAAC AAAGACATCA AGTCCGCGGA GCAGGTCAAG TTTATCAGAG AATTTGCAAG GGAAATGGGA ATAGTAAACT TCTTTGAAGT CGGACAAATG GGTGTTGAGC ATGCCCTGCT TCCGGAAAAG GGCCTTGTAG TTCCGGGAGA CGTGGTAATA GGTGCCGACT CGCATACATG TACTTATGGA GCTTTGGGAG CTTTCTCAAC GGGAATAGGA AGTACCGACA TGGCTGCCGG AATGGCAACC GGAGAAGCAT GGTTTAAAGT GCCCGAGGCC ATGAAATTCG TATTGAAGGG AAAACCCGGA AAATGGGTGA GCGGCAAGGA CATAATCCTT CATATAATTG GAATGATAGG GGTGGACGGA GCTTTGTACC GCTCCATGGA ATTCACGGGA GACGGTGTGG CCCACCTTTC AATGGATGAC AGGTTTGCAA TGGCGAACAT GGCCATTGAG GCAGGAGCAA AGAACGGAAT CTTTGAAGTT GACGAAAAGA CAATTGAGTA TGTAAAAGAA CATTCCACAA GGCAGTACAA GGTATACAAG GCGGATGAAG ACGCAGAATA TGTGGCCACT TACGAAATTG ACCTTTCACA GGTAAAACCC ACGGTTGCGT TCCCGCATCT TCCGTCCAAT ACAAGAACCA TTGACAATGT GGGCAATATC AAAATCGACC AGGTTGTAAT AGGATCATGT ACAAACGGAA GAATTGAGGA TTTGAGGGTG GCCGCGGAAG TCCTCAAGGG AAGAAAAGTG CACAAGGACG TAAGATGTAT AATCATCCCT GCAACTCAGA AGATATGGAA ACAGGCAATG AATGAAGGTC TGTTTGACAT ATTTATTGAT GCGGGAGCTG CGGTAAGTAC TCCCACCTGC GGACCGTGTC TTGGAGGTCA TATGGGTATT CTGGCAAAAG GAGAAAGAGC TGTGGCAACC ACCAACAGAA ACTTTGTGGG AAGAATGGGA CATCCCGAAA GCGAGATTTA CCTCGCAAGT CCGGCTGTAG CTGCGGCATC GGCTGTTTTG GGAAGAATAG GTTCACCGGA TGAACTTTAA
|
Protein sequence | MGMTMTQKIL ADHAGLDKVS PGQLIKAKLD MVLGNDITTP VAVKEFRKIG VNKVFDVNKI AIVPDHFTPN KDIKSAEQVK FIREFAREMG IVNFFEVGQM GVEHALLPEK GLVVPGDVVI GADSHTCTYG ALGAFSTGIG STDMAAGMAT GEAWFKVPEA MKFVLKGKPG KWVSGKDIIL HIIGMIGVDG ALYRSMEFTG DGVAHLSMDD RFAMANMAIE AGAKNGIFEV DEKTIEYVKE HSTRQYKVYK ADEDAEYVAT YEIDLSQVKP TVAFPHLPSN TRTIDNVGNI KIDQVVIGSC TNGRIEDLRV AAEVLKGRKV HKDVRCIIIP ATQKIWKQAM NEGLFDIFID AGAAVSTPTC GPCLGGHMGI LAKGERAVAT TNRNFVGRMG HPESEIYLAS PAVAAASAVL GRIGSPDEL
|
| |