Gene Cthe_3158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3158 
Symbol 
ID4809608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3730815 
End bp3732743 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content48% 
IMG OID640108591 
Productaconitate hydratase 
Protein accessionYP_001039546 
Protein GI125975636 
COG category[C] Energy production and conversion 
COG ID[COG1048] Aconitase A 
TIGRFAM ID[TIGR01342] aconitate hydratase, putative, Aquifex type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000107146 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGTTTGA ATTTGGCACA AAAAATAATT AAGGAGCATT TGGTAAGCGG AGAAATGAAG 
CCCGGCACTG AAATAGCAAT AAGGATAGAT CAGACTTTGA CCCAGGACTC TACCGGAACA
ATGGCATATC TGCAGTTTGA GGCCATGGGA ATTCCAAGGG TAAAGACTAA AAAGTCCGTT
GCCTATATTG ACCACAACAC GCTCCAGACA GGTTTTGAGA ACGCAGATGA CCATAAATAT
ATTCAGACGG TTGCTGCAAA GCACGGAATA TATTTTTCAA AACCCGGTAA CGGAATATGC
CATCAGGTTC ACCTGGAGAG ATTTGGCGTA CCGGGAATGA CTCTTCTGGG ATCCGACAGC
CATACTCCCA CCGGTGGCGG AATCGGAATG CTTGCCATAG GAGCAGGCGG TCTTGACGTG
GCGGTGGCAA TGGGCGGAGG CCCATACTAT ATGATGATGC CCAAAGTATG CAGGGTGGTT
TTAAAGGGAG CTTTAAAGCC ATGGGTTACC GCCAAGGACA TAATTCTCGA AGTGCTGAGA
AGACTTTCGG TAAAAGGCGG AGTTGGCAAG ATTATCGAGT ATGCCGGAGA CGGCATAAAA
ACTCTTACCG TTCCTGAAAG GGCAACCATT ACCAACATGG GAGCGGAGCT TGGCGCCACC
ACTTCAATTT TCCCGAGCGA TGAGGTTACA AGGGAGTTTT TGAGGGCCCA GGGAAGAGAG
AATGACTGGG TGGAACTTAA GCCCGACGAG GATGCCGAGT ATGACGAAGA GATTGTTATT
AATCTTGACG AGCTTGAGCC TCTTGCAGCA CAACCGCACA GCCCGGACAA TGTTGCAAAG
GTTAAGGATA TAGGTAAGAT AAAGGTTGAC CAGGTGGCAA TCGGAAGCTG CACCAACTCT
TCATACATGG ATATGATGAA GGTGGCTGCA ATACTTAAAG GAAAGAAAGT ACATCCCGAT
GTCAGCCTTG TTATTGCACC GGGTTCAAAA CAGGTGCTGA CAATGCTTGC CCAAAACGGT
GCGCTGGCTG ACATGGTTGC GGCAGGAGCA AGAATACTCG AAAGCGCCTG CGGACCGTGT
ATAGGAATGG GACAGGCTCC GGCAACCGAT GCCGTTTCCT TGAGAACCTT CAACAGAAAC
TTTGAGGGAA GAAGCGGTAC AAAGTCTGCC AAAGTTTATT TGGTAAGTCC TGAGACAGCT
GCGGCAAGCG CAATAACCGG AGTGCTGATA GACCCGAGGG AATTGGGTGA GGCGCCGAAG
GTAAGCATGC CTGAAAAGTT TGTTATTGAT GACAGCATGG TACTGCCGCC CGCACCGGAG
GGAGCAGAAG TTGAGGTGGT AAGAGGACCC AACATTAAGC CTTTCCCGAT AAACCAGGCA
TTGGCTGACA AAGTTTCCGG CAAAGCTTTG ATAAAAGTTG GGGACAATAT AACCACTGAC
CATATTATGC CTTCAAATGC AAAGCTTCTG CCTTTCAGGT CAAATGTGCC GTACCTTGCG
GAATTCTGCC TTACACCTTG CGACCCTGAT TTTCCGAAGA GGGCAAAGGA AAACGGCGGC
GGATTTATCA TCGGCGGTTC AAACTACGGA CAGGGTTCAA GCCGTGAACA TGCTGCATTG
GCTCCACTTC AGCTCGGAGT AAAGGGAGTT ATAGCAAAAT CTTTTGCAAG AATTCATATG
GCAAACCTCA TTAACTCGGG TATTATCCCC ATGACCTTTG AAAATGAGGC TGATTACGAT
GAAATAGACA TGGACGACGA ACTTGTGATT GAAAACGCAA GGGAGCAGAT TAAAAACGGC
AGCAGCATTG TAGTGAAAAA TGTAACTAAA GGGAAAGATA TTAAAGTAAA TGTTGCTTTG
TCGCAAAGAC AAGTGGAAAT AATTCTTGCT GGCGGGCTTT TAAACTATAC GAGGCAGCAG
AATCAGTGA
 
Protein sequence
MGLNLAQKII KEHLVSGEMK PGTEIAIRID QTLTQDSTGT MAYLQFEAMG IPRVKTKKSV 
AYIDHNTLQT GFENADDHKY IQTVAAKHGI YFSKPGNGIC HQVHLERFGV PGMTLLGSDS
HTPTGGGIGM LAIGAGGLDV AVAMGGGPYY MMMPKVCRVV LKGALKPWVT AKDIILEVLR
RLSVKGGVGK IIEYAGDGIK TLTVPERATI TNMGAELGAT TSIFPSDEVT REFLRAQGRE
NDWVELKPDE DAEYDEEIVI NLDELEPLAA QPHSPDNVAK VKDIGKIKVD QVAIGSCTNS
SYMDMMKVAA ILKGKKVHPD VSLVIAPGSK QVLTMLAQNG ALADMVAAGA RILESACGPC
IGMGQAPATD AVSLRTFNRN FEGRSGTKSA KVYLVSPETA AASAITGVLI DPRELGEAPK
VSMPEKFVID DSMVLPPAPE GAEVEVVRGP NIKPFPINQA LADKVSGKAL IKVGDNITTD
HIMPSNAKLL PFRSNVPYLA EFCLTPCDPD FPKRAKENGG GFIIGGSNYG QGSSREHAAL
APLQLGVKGV IAKSFARIHM ANLINSGIIP MTFENEADYD EIDMDDELVI ENAREQIKNG
SSIVVKNVTK GKDIKVNVAL SQRQVEIILA GGLLNYTRQQ NQ