Gene Ccel_0126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0126 
Symbol 
ID7309038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp140802 
End bp142061 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content44% 
IMG OID643607055 
Product3-isopropylmalate dehydratase large subunit 
Protein accessionYP_002504494 
Protein GI220927585 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR01343] homoaconitate hydratase family protein
[TIGR02083] 3-isopropylmalate dehydratase, large subunit
[TIGR02086] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAATGA CAATGACACA AAAGATTCTT GCAAGCCATG CGGGAGTAGA TAAATTAGTA 
CCCGGACAAT TGATAATGGC TAAACTTGAT ATGGTTCTGG GGAATGATAT AACATCTCCC
GTAGCTATCA AGGAGTTTGA TAAAATCGGT CTGGACAAAG TATTTGATAA GAACAAAATT
GCCATAGTAC CGGACCATTT TACTCCCAAC AAGGATATTA AGTCTGCAGA ACAGGTAAAG
GTTTGCAGGG AGTTTTCAAA GAGAATGGGA ATAGTAAATT TCTTTGAGGT AGGGCAAATG
GGGGTTGAAC ACGCACTGCT GCCCGAAAAA GGACTTGTAG TACCGGGTGA TGTTGTTATA
GGAGCGGATT CACACACCTG TACCTATGGA GCACTTGGTG CATTTTCAAC AGGAATTGGC
AGTACTGACA TGGCAGCGGG GATGGCAACA GGAAAAGCTT GGTTCAAGGT ACCGGAAGCG
ATTAAATTCG TGCTTAGAGG TAAGCCGCAA AAATGGGTGG GCGGAAAAGA TATAATTCTT
CACATTATCG GGATGATAGG CGTTGACGGT GCACTCTACA AATCAATGGA GTTTACAGGT
GATGGTGTAA ATGCTCTTTC AATGGACGAC AGGTTCTCAA TGGCAAATAT GGCTATAGAG
GCTGGTGCAA AAAACGGTAT TTTTGAAGTG GACGAAAAAA CCGTTGAATA CGTAAAGGAA
CATTCCATCA GGCATTATAC CGTATACAAG GCAGATGAAG ATGCTGAATA TTCGGAAGTT
TATGAAATAG ACCTTGCGGA AGTAAAACCA ACAGTTGCAT TCCCTCACCT TCCTGAAAAT
GCACGTACTA TTGACAATGT TGGTGATATT AAAATTGACC AGGTTGTAAT CGGTTCGTGT
ACAAACGGAC GTATAGAAGA CATGAGAATA GCAGCAGGAG TATTGAAGGG AAGAAAGGTT
AGCGACAACG TAAGATGCAT TATCATTCCT GCAACTCAAA AAATATGGAA ACAGGCTATG
AATGAAGGCT TGTTTGACAT ATTCATCGAT GCCGGGGCTG CTGTAAGCAC ACCTACCTGC
GGCCCATGCT TGGGCGGTCA TATGGGTATT CTGGCAAAGG GGGAAAGAGC AGTTGCAACT
ACTAACAGAA ATTTTGTAGG ACGTATGGGA CATCCCGAAA GCGAGGTTTA TCTTGCGGGT
CCTGCGGTTG CAGCGGCTTC TGCGGTAGCA GGCAGGATTG CCGGGCCGGA TGAGATATAA
 
Protein sequence
MGMTMTQKIL ASHAGVDKLV PGQLIMAKLD MVLGNDITSP VAIKEFDKIG LDKVFDKNKI 
AIVPDHFTPN KDIKSAEQVK VCREFSKRMG IVNFFEVGQM GVEHALLPEK GLVVPGDVVI
GADSHTCTYG ALGAFSTGIG STDMAAGMAT GKAWFKVPEA IKFVLRGKPQ KWVGGKDIIL
HIIGMIGVDG ALYKSMEFTG DGVNALSMDD RFSMANMAIE AGAKNGIFEV DEKTVEYVKE
HSIRHYTVYK ADEDAEYSEV YEIDLAEVKP TVAFPHLPEN ARTIDNVGDI KIDQVVIGSC
TNGRIEDMRI AAGVLKGRKV SDNVRCIIIP ATQKIWKQAM NEGLFDIFID AGAAVSTPTC
GPCLGGHMGI LAKGERAVAT TNRNFVGRMG HPESEVYLAG PAVAAASAVA GRIAGPDEI