Gene Cthe_1217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1217 
Symbol 
ID4809909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1450289 
End bp1452619 
Gene Length2331 bp 
Protein Length776 aa 
Translation table11 
GC content41% 
IMG OID640106640 
ProductATP-dependent Clp protease ATP-binding subunit ClpA 
Protein accessionYP_001037642 
Protein GI125973732 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID[TIGR02639] ATP-dependent Clp protease ATP-binding subunit clpA 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAGAT TGGATGACGT AGCTAACAAA ATCTTAATTG CAGCATATAA CGAAGCAAAA 
CATCAAAAAC ATGAATTTTT CACACCGGAG CATATTCTTT ATGCTTCCCT GTTTTTTGAT
GAAGGCAGGG ACATAATAGA AAACTGCGGC GGCAAAGTTG AAGATATTAA AAAGGATTTG
CTGGAATTTT TCCGCAACAA TATGCCCATT GTTGAAAACC ACGAGCCCAT AGAATCCCTG
GGGATCAACA GTGTCATGCA GGCCACTGCG TATCAGTGCA TTGCCGCAGG CAGAGAATAT
ATACGCATAG GCGATATTAT AGTGGCCCTC TACGGTGAAA AAGAATCTTT TGCCAGTTAT
ATACTTCAAA AAAACGGAAT AAAAAAACTT GACGTTTTAA AATATATTTC CCACGGAGTA
TCCCTGGTCC CAAAAAATAT GGAAACATCT TTAAAATCTT TGGAGACCGA CACTTATCTT
GAAGCTTACC AGTATGCCGA TGATTGGGAA TACGATTATG AATATGAAGA TATTGATGAA
GATGAAGACA TTGATGAAGA AAGCTCCTCA AAAAGCAACT TTTTGGAGCA CTTTACAATT
GACCTTACCG AAAAAGCAAG AAAGGGTAAA ATAGACCCTC TTATCGGCAG AGAGGATATT
TTGGAACGAA CAATACAGGT TTTGTCCAGA AGGCTTAAGA ACAACCCCAT TCACGTTGGG
GATCCCGGAG TGGGAAAAAC TGCAATTACC GAAGGTCTGG CAAGGCTGAT TGTGGAGGAC
AAGGTTCCAA AAAGTTTAAA AGGCAGTAAA ATATACTATC TTGACATGGG AAGCATGCTG
GCAGGCACCA AATACCGCGG AGACTTTGAA GAACGTATTA AAAAGGTTCT CAATGAAATC
CAAAACCAGC CGAAAGCAAT TGTTTATATA GATGAAATTC ATACAATAGT GGGTGCCGGG
GCCGTATCCG ACGGTGCGAT GGATGCGTCA AACATCATAA AGCCTTTTCT TACACAGGGC
ACATTGAGGT TTATAGGCTC GACAACTTAT GAAGAGTATA AAAAGTACTT TGAAAAGGAC
AGGGCTCTGT CGAGGAGATT TCAAAAAATT GATGTTCCGG AACCGTCAAT TGATGACACG
TTCAAGATAC TCAAAGGTCT TAAGGACAGA TATGAAGAAT ATCACAAGGT AAAATACACA
GACAGTGCCT TAAGACTTGC CGCGGAGCTT TCTGCAAAAT ACATCCAGGA TCGTCATCTT
CCTGACAAAG CAATAGACGT AATTGACGAA ACCGGAGCTT ATGTGCGTCT TCATGCAAAA
GATGAAGACA AGGTAATTAC CATAAAAAAC AAGGACATAG AGCGCACGGT GTCTGCAATT
GCCAGAATAC CGATACAGAG TGTATCCAGG GATGAAATTT CAAAACTTAA AAACCTTGAT
GTAAAATTAA AATCCACAAT ATTCGGCCAG GACAAGGCTA TTGACACTGT GGTGCAAGCT
ATAAAAAGGT CCAGGGCGGG ATTCAATGAA AATGAAAAAC CCGTTGCCTC CCTTCTTTTT
GTCGGTCCGA CAGGTGTCGG TAAAACTGAG CTTGCAAAAC AGTTGTCCCT TCACCTCGGT
ATTCCTTTTA TAAGGTTTGA TATGAGTGAG TATCAGGAAA AGCACACTGT TTCAAGGTTG
ATAGGTGCTC CACCCGGATA CGTTGGATAT GAAGAAGGCG GACTTTTGAC GGATGCAATA
AGAAAAACTC CGCATTGTGT GCTGCTTCTC GACGAAATCG AGAAAGCGCA CCCCGATATT
TACAATGTGC TGCTTCAGGT AATGGATTAT GCGGTACTTA CGGACAACAA CGGAAAAAAA
GCGGACTTTA GAAATGTAAT ACTGATAATG ACCTCCAACG CCGGTGCCCG GGAAGTCGGA
AGAACGCTTA TAGGATTTGA CAGCAGAAAC GTTGACAGAA GCGCCATGAC AAAAGAAGTT
GAAAGAATAT TCTCTCCGGA GTTTCGAAAC AGACTTGATG ATATTGTGGT ATTCAACCAT
ATCAATGAAG AGATGGCGCT GCTTATAACC AAAAAAGCCA TAAATCAATT CAAGGAAAAA
CTAAAAACGA AAAATATCAA GCTTAAAGTG ACGGAAAGAT GCTGCAAATG GATTGCCCAA
AAAGGTCTTT CGTCAATTTA CGGTGCCCGT GAAATATTGA GGTATGTTCA GGACAAAATA
AAAACGTATT TTGTTGACGA AGTTCTCTTT GGAGAGCTTT CCAAAGGCGG CACTGCAATA
ATAGACGTTG TGGACGGAGA AATTAAAATA AGCAAAAAAA CTCAAAGGTG A
 
Protein sequence
MMRLDDVANK ILIAAYNEAK HQKHEFFTPE HILYASLFFD EGRDIIENCG GKVEDIKKDL 
LEFFRNNMPI VENHEPIESL GINSVMQATA YQCIAAGREY IRIGDIIVAL YGEKESFASY
ILQKNGIKKL DVLKYISHGV SLVPKNMETS LKSLETDTYL EAYQYADDWE YDYEYEDIDE
DEDIDEESSS KSNFLEHFTI DLTEKARKGK IDPLIGREDI LERTIQVLSR RLKNNPIHVG
DPGVGKTAIT EGLARLIVED KVPKSLKGSK IYYLDMGSML AGTKYRGDFE ERIKKVLNEI
QNQPKAIVYI DEIHTIVGAG AVSDGAMDAS NIIKPFLTQG TLRFIGSTTY EEYKKYFEKD
RALSRRFQKI DVPEPSIDDT FKILKGLKDR YEEYHKVKYT DSALRLAAEL SAKYIQDRHL
PDKAIDVIDE TGAYVRLHAK DEDKVITIKN KDIERTVSAI ARIPIQSVSR DEISKLKNLD
VKLKSTIFGQ DKAIDTVVQA IKRSRAGFNE NEKPVASLLF VGPTGVGKTE LAKQLSLHLG
IPFIRFDMSE YQEKHTVSRL IGAPPGYVGY EEGGLLTDAI RKTPHCVLLL DEIEKAHPDI
YNVLLQVMDY AVLTDNNGKK ADFRNVILIM TSNAGAREVG RTLIGFDSRN VDRSAMTKEV
ERIFSPEFRN RLDDIVVFNH INEEMALLIT KKAINQFKEK LKTKNIKLKV TERCCKWIAQ
KGLSSIYGAR EILRYVQDKI KTYFVDEVLF GELSKGGTAI IDVVDGEIKI SKKTQR