Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1720 |
Symbol | |
ID | 4808895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2041016 |
End bp | 2041759 |
Gene Length | 744 bp |
Protein Length | 247 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640107133 |
Product | peptidase S14, ClpP |
Protein accession | YP_001038134 |
Protein GI | 125974224 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0740] Protease subunit of ATP-dependent Clp proteases |
TIGRFAM ID | [TIGR00493] ATP-dependent Clp protease, proteolytic subunit ClpP |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 55 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGTTTT GGAATTTTAA AAACAGCGAA GAAAATGAGG AGGAGATAGA GCTTAGAATT GATGGTGACA TTGTCATGGA CGATGATTTT TGGTCGATGC TATTTGGAAA TGACAATGTA ACTCCAAAGG GTTTTATGTC AGAACTAAAT AAATATAAAG GCAAAGACAT AAATGTTTGG ATCAATTCCT ATGGAGGGGA TGTCTATGCA GCTTCAAGAA TTTACACAGG TTTAAAGGAA CACAAGGGAA AGGTTAAAGT AAAAATTGAT GGTGTGGCAA TTTCAGCAGC TTCAGTTATA GCTATGGCAG GAGATGAAAT ACTTATGTCT CCAACATCAA TAATAATGCT ACACAACCCC TGGGGAACTT TTCAAGGTGA AGCTAAGGAT TTAAGGCATG GAGCTGATGT GCTAGATGAA GTAAAGGAAA CTATAATTAA TGCCTATCAG CTTAAAACAG GCAAATCAAG AGCAAAAATA TCTCAGATGA TGGATGAGGA AACTTGGATG AGTGCAAAAA AAGCTGTAGC TGAAGGATTT GCAGATGGAA TGCTTTATGA AAAGAATAAA GAAGAGCCAT TGGAAAATTC TTTTATGTTC AGCAGGTTTG CTATTCAAAA TAGTGTTAAT AACAGGACTA GAGATTTTAT AAAACAGTAT AATCAAAGGT TTAAAGAAAC CTATAAAGAT GAAGAAAAAA TAAAATTATT AAAAGCAAAA CTTGCCTTAG AGTGTGAACT TTAG
|
Protein sequence | MPFWNFKNSE ENEEEIELRI DGDIVMDDDF WSMLFGNDNV TPKGFMSELN KYKGKDINVW INSYGGDVYA ASRIYTGLKE HKGKVKVKID GVAISAASVI AMAGDEILMS PTSIIMLHNP WGTFQGEAKD LRHGADVLDE VKETIINAYQ LKTGKSRAKI SQMMDEETWM SAKKAVAEGF ADGMLYEKNK EEPLENSFMF SRFAIQNSVN NRTRDFIKQY NQRFKETYKD EEKIKLLKAK LALECEL
|
| |