Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1217 |
Symbol | |
ID | 4809909 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1450289 |
End bp | 1452619 |
Gene Length | 2331 bp |
Protein Length | 776 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106640 |
Product | ATP-dependent Clp protease ATP-binding subunit ClpA |
Protein accession | YP_001037642 |
Protein GI | 125973732 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0542] ATPases with chaperone activity, ATP-binding subunit |
TIGRFAM ID | [TIGR02639] ATP-dependent Clp protease ATP-binding subunit clpA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAGAT TGGATGACGT AGCTAACAAA ATCTTAATTG CAGCATATAA CGAAGCAAAA CATCAAAAAC ATGAATTTTT CACACCGGAG CATATTCTTT ATGCTTCCCT GTTTTTTGAT GAAGGCAGGG ACATAATAGA AAACTGCGGC GGCAAAGTTG AAGATATTAA AAAGGATTTG CTGGAATTTT TCCGCAACAA TATGCCCATT GTTGAAAACC ACGAGCCCAT AGAATCCCTG GGGATCAACA GTGTCATGCA GGCCACTGCG TATCAGTGCA TTGCCGCAGG CAGAGAATAT ATACGCATAG GCGATATTAT AGTGGCCCTC TACGGTGAAA AAGAATCTTT TGCCAGTTAT ATACTTCAAA AAAACGGAAT AAAAAAACTT GACGTTTTAA AATATATTTC CCACGGAGTA TCCCTGGTCC CAAAAAATAT GGAAACATCT TTAAAATCTT TGGAGACCGA CACTTATCTT GAAGCTTACC AGTATGCCGA TGATTGGGAA TACGATTATG AATATGAAGA TATTGATGAA GATGAAGACA TTGATGAAGA AAGCTCCTCA AAAAGCAACT TTTTGGAGCA CTTTACAATT GACCTTACCG AAAAAGCAAG AAAGGGTAAA ATAGACCCTC TTATCGGCAG AGAGGATATT TTGGAACGAA CAATACAGGT TTTGTCCAGA AGGCTTAAGA ACAACCCCAT TCACGTTGGG GATCCCGGAG TGGGAAAAAC TGCAATTACC GAAGGTCTGG CAAGGCTGAT TGTGGAGGAC AAGGTTCCAA AAAGTTTAAA AGGCAGTAAA ATATACTATC TTGACATGGG AAGCATGCTG GCAGGCACCA AATACCGCGG AGACTTTGAA GAACGTATTA AAAAGGTTCT CAATGAAATC CAAAACCAGC CGAAAGCAAT TGTTTATATA GATGAAATTC ATACAATAGT GGGTGCCGGG GCCGTATCCG ACGGTGCGAT GGATGCGTCA AACATCATAA AGCCTTTTCT TACACAGGGC ACATTGAGGT TTATAGGCTC GACAACTTAT GAAGAGTATA AAAAGTACTT TGAAAAGGAC AGGGCTCTGT CGAGGAGATT TCAAAAAATT GATGTTCCGG AACCGTCAAT TGATGACACG TTCAAGATAC TCAAAGGTCT TAAGGACAGA TATGAAGAAT ATCACAAGGT AAAATACACA GACAGTGCCT TAAGACTTGC CGCGGAGCTT TCTGCAAAAT ACATCCAGGA TCGTCATCTT CCTGACAAAG CAATAGACGT AATTGACGAA ACCGGAGCTT ATGTGCGTCT TCATGCAAAA GATGAAGACA AGGTAATTAC CATAAAAAAC AAGGACATAG AGCGCACGGT GTCTGCAATT GCCAGAATAC CGATACAGAG TGTATCCAGG GATGAAATTT CAAAACTTAA AAACCTTGAT GTAAAATTAA AATCCACAAT ATTCGGCCAG GACAAGGCTA TTGACACTGT GGTGCAAGCT ATAAAAAGGT CCAGGGCGGG ATTCAATGAA AATGAAAAAC CCGTTGCCTC CCTTCTTTTT GTCGGTCCGA CAGGTGTCGG TAAAACTGAG CTTGCAAAAC AGTTGTCCCT TCACCTCGGT ATTCCTTTTA TAAGGTTTGA TATGAGTGAG TATCAGGAAA AGCACACTGT TTCAAGGTTG ATAGGTGCTC CACCCGGATA CGTTGGATAT GAAGAAGGCG GACTTTTGAC GGATGCAATA AGAAAAACTC CGCATTGTGT GCTGCTTCTC GACGAAATCG AGAAAGCGCA CCCCGATATT TACAATGTGC TGCTTCAGGT AATGGATTAT GCGGTACTTA CGGACAACAA CGGAAAAAAA GCGGACTTTA GAAATGTAAT ACTGATAATG ACCTCCAACG CCGGTGCCCG GGAAGTCGGA AGAACGCTTA TAGGATTTGA CAGCAGAAAC GTTGACAGAA GCGCCATGAC AAAAGAAGTT GAAAGAATAT TCTCTCCGGA GTTTCGAAAC AGACTTGATG ATATTGTGGT ATTCAACCAT ATCAATGAAG AGATGGCGCT GCTTATAACC AAAAAAGCCA TAAATCAATT CAAGGAAAAA CTAAAAACGA AAAATATCAA GCTTAAAGTG ACGGAAAGAT GCTGCAAATG GATTGCCCAA AAAGGTCTTT CGTCAATTTA CGGTGCCCGT GAAATATTGA GGTATGTTCA GGACAAAATA AAAACGTATT TTGTTGACGA AGTTCTCTTT GGAGAGCTTT CCAAAGGCGG CACTGCAATA ATAGACGTTG TGGACGGAGA AATTAAAATA AGCAAAAAAA CTCAAAGGTG A
|
Protein sequence | MMRLDDVANK ILIAAYNEAK HQKHEFFTPE HILYASLFFD EGRDIIENCG GKVEDIKKDL LEFFRNNMPI VENHEPIESL GINSVMQATA YQCIAAGREY IRIGDIIVAL YGEKESFASY ILQKNGIKKL DVLKYISHGV SLVPKNMETS LKSLETDTYL EAYQYADDWE YDYEYEDIDE DEDIDEESSS KSNFLEHFTI DLTEKARKGK IDPLIGREDI LERTIQVLSR RLKNNPIHVG DPGVGKTAIT EGLARLIVED KVPKSLKGSK IYYLDMGSML AGTKYRGDFE ERIKKVLNEI QNQPKAIVYI DEIHTIVGAG AVSDGAMDAS NIIKPFLTQG TLRFIGSTTY EEYKKYFEKD RALSRRFQKI DVPEPSIDDT FKILKGLKDR YEEYHKVKYT DSALRLAAEL SAKYIQDRHL PDKAIDVIDE TGAYVRLHAK DEDKVITIKN KDIERTVSAI ARIPIQSVSR DEISKLKNLD VKLKSTIFGQ DKAIDTVVQA IKRSRAGFNE NEKPVASLLF VGPTGVGKTE LAKQLSLHLG IPFIRFDMSE YQEKHTVSRL IGAPPGYVGY EEGGLLTDAI RKTPHCVLLL DEIEKAHPDI YNVLLQVMDY AVLTDNNGKK ADFRNVILIM TSNAGAREVG RTLIGFDSRN VDRSAMTKEV ERIFSPEFRN RLDDIVVFNH INEEMALLIT KKAINQFKEK LKTKNIKLKV TERCCKWIAQ KGLSSIYGAR EILRYVQDKI KTYFVDEVLF GELSKGGTAI IDVVDGEIKI SKKTQR
|
| |