Gene Cthe_1391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1391 
Symbol 
ID4809052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1698328 
End bp1699863 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content41% 
IMG OID640106815 
Product2-isopropylmalate synthase 
Protein accessionYP_001037816 
Protein GI125973906 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00973] 2-isopropylmalate synthase, bacterial type 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAGAA TAAGAATTTT TGATACAACT TTAAGAGACG GAGAACAAAC ACCGGGGGTA 
AATCTTAATA TACAGGAAAA AGTGGATATT GCAAAACAAT TGGCACGTCT TGGAGTGGAT
GTTATCGAAC CGGGATTCCC GCTGACATCA CCGGGAGATT TTGAAGCTGT GCAGAGAATT
GCCAGAGAGG TTGAGGGGCC TTATATATGC GGCTTTTCAA GAGCAATTAT AAGAGATATT
GATGAGACCT GGAAAGCTAT AAAAGATGCT CAGAAGAAAT GTTTCCATAT TTTTATATCC
AGCTCCGACA TACAAATAAA GCACCAGTTG GGGAAAACGG AAAAAGACGT TCTTGAAATT
GTAAAAAGCA CTGTATATCA TGCAAAGCAG TATACCGACG AGGTAGAATA CTCGCCGATG
GATGCATCGA GAACAAGGCT CGAGTTCCTT TATGAAGTTA TTGAAGCGGC AATAGACAAC
GGTGCCACAG TCATAAACAT TCCGGACACC GTTGGATATG CAACTCCCAT TGAGTTCGGA
GAACTCATAC AAAAAATCAG AAAGAATGTA AGAAATATTG ACAAGGCTAT AATAAGTGTC
CATTGTCACA ACGACCTGGG CATGGCTGTG GCCAACTCCA TTGTTGCTGC TATGAATGGC
GCCCAGCAGA TAGAGTGTAC TATCAACGGC GTGGGTGAAA GGGCAGGAAA TGCAGCTTTG
GAGGAAGTTG TTACCCACAT TGCGGCAAGG AAAGATTATC TGGGATTTGA AACGGGAATA
GATCTTTCAC AGTTGTATAA AACCAGTAAA ATTGTCAGCA GATACATGGG AATACCCATA
CCGGTAAACA AACCTATAGT TGGTAAAAAT GTGTTTACTC ATGAGTCGGG AATACATCAG
GACGGTGTCC TGAAGGAAAG ATCAACTTAT GAAGTAATAG ATCCAAGGCT TGTCGGCAGG
GATGACAGTG TTATTCTCCT TGGAAAGCAC TCGGGAAGGC ATGCTCTTAA AGTGGAAGCT
GAAAAACTCG GATATGACTT GGATGAGGAA CGTCTCAACA AGCTGTTTAA TGATTTTAAA
AAGCTTACCG ACGTTAAAAA GAATGTGACT ACGGCGGACT TGGAATCTCT TATAATTGAA
TCCGCCGCAA AAGCCGTGGA AGAGGCATAT GTGCTTGAAA AGATAAGAGT TGTAAGCGGC
AATATTGAGA CGCCTTCCGC AAAAGTTGTG ATTAAAGATT CAAAGGGCAA TTTGCTCGAA
GCCGAGCAGA CAGGAAACGG ACCGGTGGAT GCTGTTTTTA AAGCTATAAA TTCTGTTATT
AAAGAGACGG AAAATCTTAC GTTATACAAA TACAGTGTGT CCGCCGTAAC GGAAGAAATG
GAGTCATTGG GTGAAGTTTC TGTGACTCTC AGGGAAAAGG AAAAATTATA TACGGGCATA
GGTACACATA CCGACATAAT TACTTCAAGT GCCATAGCCT ATATTGATGC AATTAATAAA
GCTATTGCAG CAAATGCGAG AGCACAAAAA AATTAA
 
Protein sequence
MRRIRIFDTT LRDGEQTPGV NLNIQEKVDI AKQLARLGVD VIEPGFPLTS PGDFEAVQRI 
AREVEGPYIC GFSRAIIRDI DETWKAIKDA QKKCFHIFIS SSDIQIKHQL GKTEKDVLEI
VKSTVYHAKQ YTDEVEYSPM DASRTRLEFL YEVIEAAIDN GATVINIPDT VGYATPIEFG
ELIQKIRKNV RNIDKAIISV HCHNDLGMAV ANSIVAAMNG AQQIECTING VGERAGNAAL
EEVVTHIAAR KDYLGFETGI DLSQLYKTSK IVSRYMGIPI PVNKPIVGKN VFTHESGIHQ
DGVLKERSTY EVIDPRLVGR DDSVILLGKH SGRHALKVEA EKLGYDLDEE RLNKLFNDFK
KLTDVKKNVT TADLESLIIE SAAKAVEEAY VLEKIRVVSG NIETPSAKVV IKDSKGNLLE
AEQTGNGPVD AVFKAINSVI KETENLTLYK YSVSAVTEEM ESLGEVSVTL REKEKLYTGI
GTHTDIITSS AIAYIDAINK AIAANARAQK N