Gene Cthe_2211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2211 
Symbol 
ID4811076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2639652 
End bp2640911 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content46% 
IMG OID640107617 
Product3-isopropylmalate dehydratase large subunit 
Protein accessionYP_001038606 
Protein GI125974696 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR01343] homoaconitate hydratase family protein
[TIGR02083] 3-isopropylmalate dehydratase, large subunit
[TIGR02086] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00871543 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAATGA CTATGACACA GAAAATACTT GCAGATCACG CAGGTCTTGA CAAGGTTTCA 
CCGGGTCAGC TCATAAAAGC AAAACTTGAC ATGGTTTTGG GAAATGATAT AACAACACCT
GTGGCTGTGA AGGAATTTAG AAAAATTGGC GTGAACAAGG TGTTCGATGT AAATAAAATA
GCAATTGTTC CTGACCATTT TACACCCAAC AAAGACATCA AGTCCGCGGA GCAGGTCAAG
TTTATCAGAG AATTTGCAAG GGAAATGGGA ATAGTAAACT TCTTTGAAGT CGGACAAATG
GGTGTTGAGC ATGCCCTGCT TCCGGAAAAG GGCCTTGTAG TTCCGGGAGA CGTGGTAATA
GGTGCCGACT CGCATACATG TACTTATGGA GCTTTGGGAG CTTTCTCAAC GGGAATAGGA
AGTACCGACA TGGCTGCCGG AATGGCAACC GGAGAAGCAT GGTTTAAAGT GCCCGAGGCC
ATGAAATTCG TATTGAAGGG AAAACCCGGA AAATGGGTGA GCGGCAAGGA CATAATCCTT
CATATAATTG GAATGATAGG GGTGGACGGA GCTTTGTACC GCTCCATGGA ATTCACGGGA
GACGGTGTGG CCCACCTTTC AATGGATGAC AGGTTTGCAA TGGCGAACAT GGCCATTGAG
GCAGGAGCAA AGAACGGAAT CTTTGAAGTT GACGAAAAGA CAATTGAGTA TGTAAAAGAA
CATTCCACAA GGCAGTACAA GGTATACAAG GCGGATGAAG ACGCAGAATA TGTGGCCACT
TACGAAATTG ACCTTTCACA GGTAAAACCC ACGGTTGCGT TCCCGCATCT TCCGTCCAAT
ACAAGAACCA TTGACAATGT GGGCAATATC AAAATCGACC AGGTTGTAAT AGGATCATGT
ACAAACGGAA GAATTGAGGA TTTGAGGGTG GCCGCGGAAG TCCTCAAGGG AAGAAAAGTG
CACAAGGACG TAAGATGTAT AATCATCCCT GCAACTCAGA AGATATGGAA ACAGGCAATG
AATGAAGGTC TGTTTGACAT ATTTATTGAT GCGGGAGCTG CGGTAAGTAC TCCCACCTGC
GGACCGTGTC TTGGAGGTCA TATGGGTATT CTGGCAAAAG GAGAAAGAGC TGTGGCAACC
ACCAACAGAA ACTTTGTGGG AAGAATGGGA CATCCCGAAA GCGAGATTTA CCTCGCAAGT
CCGGCTGTAG CTGCGGCATC GGCTGTTTTG GGAAGAATAG GTTCACCGGA TGAACTTTAA
 
Protein sequence
MGMTMTQKIL ADHAGLDKVS PGQLIKAKLD MVLGNDITTP VAVKEFRKIG VNKVFDVNKI 
AIVPDHFTPN KDIKSAEQVK FIREFAREMG IVNFFEVGQM GVEHALLPEK GLVVPGDVVI
GADSHTCTYG ALGAFSTGIG STDMAAGMAT GEAWFKVPEA MKFVLKGKPG KWVSGKDIIL
HIIGMIGVDG ALYRSMEFTG DGVAHLSMDD RFAMANMAIE AGAKNGIFEV DEKTIEYVKE
HSTRQYKVYK ADEDAEYVAT YEIDLSQVKP TVAFPHLPSN TRTIDNVGNI KIDQVVIGSC
TNGRIEDLRV AAEVLKGRKV HKDVRCIIIP ATQKIWKQAM NEGLFDIFID AGAAVSTPTC
GPCLGGHMGI LAKGERAVAT TNRNFVGRMG HPESEIYLAS PAVAAASAVL GRIGSPDEL