Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1773 |
Symbol | |
ID | 4810018 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2094165 |
End bp | 2096609 |
Gene Length | 2445 bp |
Protein Length | 814 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640107187 |
Product | peptidase S16, lon-like protein |
Protein accession | YP_001038187 |
Protein GI | 125974277 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1067] Predicted ATP-dependent protease |
TIGRFAM ID | [TIGR00764] lon-related putative ATP-dependent protease |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCAAA TTAGTGGGTT GCCTGCCGGT ATGTTAAGGA AGGAATGTGA TCCTAATTCT TTCAAGTTTA ATGACACTTC AGAGTTGGAA CCCCTTGAAG GAATTATAGG TCAGGAACGT GCTGTGCGTG CCATGACATT CGGACTTAAA ATCAATACCC GCGGTTACAA TATTTTTATG AGTGGTATGA CCGGAACCGG CAAAACTAGT TACGCTGTAA ATTATATTAA GAAAATAGCT AAAAATTGCA AGACTCCGGA TGACTGGTGC TATGTATATA ATTTTGAGAA TCCGAATCAG CCTAAAGCGA TAAATCTGCC TGCAGGACTT GGCAAAGTGT TTAAAAAGGA CATGGAGGAA TTTATAAAAG TACTTCAGCA GGAAATCAGC AGGGCTTTTG AAAGTGAGGA CTATGAAAGA GAAAGGGCGG CCATTGCAAA TGAATATCAG GGAAAAAAGG CCGAACTTAT GGAAATATTA AACAGGGATG CTGAAAAACA AGGCTTCAAA GTCAGGACAA CAAACGCAGG AATATACTTT CTTCCGGTAA TTGAAGGCAA GACAATAACG GAGGAGGAAT ACGGGCAACT TGATGAAAAG ATTAAGCAGG AAATAACGGA AAGATCAAAT ATAGTTCAGC TTGAGACTTT GGAAATAATC AGAAAGATAA AAAATATTGA AAGGGAAGCG GAAGAAAGGG TTGCTGAATG GGAGAATAAA ATTGCCTTGT TTGCCGTAGG CATGCAGATA AATGACCTCA AAGAAAAGTA CAAGGATTAC AAAGAAGTGG TTAAATATTT GGAACAGGTT CAGGAAGATA TTCTTCAAAA TCTTGATGAT TTCAGGGAGG AAGAGTATTC TGAAGAACAA CAGCTCATTA TGCCCTGGCT TAAAGGTAAT GAAGGCTCGC CTGTAGACAA ATATAAAGTA AATCTTTTGG TGGACAATTC CGGTCTTGAA GGAGCTCCTG TCATAGTCGA TTTCAATCCT ACATATTACA ATCTTATTGG AAGAGTGGAA TATGAAAACG AATTTGGAAC AATGATAACT GATTTTACAA TGATAAAACC GGGATTGTTC CATCAGGCAA ACGGAGGTTA TCTGATACTC CAGGCAAAGG ATGTACTTAG CAATGTCCAA TCCTGGGAAG CTCTAAAAAG GGCACTGAAA ACCCGCCAGA TAACCATTGA GAATATGAAG GAGCAAATGG GACTTGTGGC AGTGTCGACA TTAAAGCCCG AGCCCATACC TTTGCAGGTC AAAGTGATTT TGGTGGGAAA CGAGTTTTTG CACCAGCTGC TTTATGAATA TGATGAGGAT TTCAAAAAGC TCTTTAAAAT AAAAGTGGAT TTTGACGAAG AGATGGACAG AAACGAAGAC AATACCTTGA AACTGGCGCA GTTTATAAGC TCATTCTGCA GAAGGGAGAA CGCCCCGCAT TTTGACAGGA CCGGGGTGGC AAAGGTGGTT GAGTACAGTT CGCGCCTGGT CGGCGATCAG AACAAGCTTT CCACCAGGTT TAATGATATT GTTGAGATAC TTTGTGAATC TGCGGCATGG GCTCAAATCG ACGGAAGCAG TCTGGTCAAA GCGGAGCATG TAAATAAAGC GATTCAGGAG AAGATATACA GGTCAAACAA GTATGATAAA AAGCTTTTGG AGCTTTTGAA GGACGGTATT ATAATTTTGG ATACCGAAGG CGAGGCAGTG GGACAGATAA ACGGCCTTAC CGTACTTGAT ATTGGAGACT ATTGCTTCGG AAAGCCCACG AGGATAACCG CAAACACCTT TATGGGTGAA AAAGGAATAG TAAATATTGA AAGAGAAGTT GAAATGAGCG GGACATCCCA TACAAAAGGG GTTCTGATAT TGAGCGGGTA CATTGGTCAA AAATATGCCC AGGATATACC GCTGTCTCTG ACTGCAAGCC TGTGCTTCGA ACAGCTGTAC AGCGGAGTTG ACGGCGACAG TGCATCAAGC GCGGAGCTCT ATGCGATTCT GTCAAGCCTG GCGGAGGTTC CCATAAAACA GAGCATTGCG GTAACGGGTT CGGTTAACCA GAAAGGAGAA ATTCAACCTA TTGGCGGGGT TAATGAGAAA ATAGAAGGAT TCTTCGAGCT TTGCAAAGCC CGTGGACTTA ACGGCAAGCA TGGAGTAATT ATTCCTTACC AGAATGTAAG AAATCTTGCT TTGAACGATG AGGTTATTGA AGCGGTGAAA GAAGGCAAGT TCCATATATA TGCCGTAAAA ACCATAGATG AGGGAATTGA AATACTTACA GGAATGAAAG CAGGGGAAAA GAGAGAAGAC GGAACTTATC CTGAGGGAAC AATAAACTAT CTTGTATATG AGAAACTTAA AAAATATGCA AGAACGGTTG CCGGATTTGG CAAGGATGAA AAGGAAGCAA AGGATGCAAA GGATGCAAAG AAGAATTCTG ATTAA
|
Protein sequence | MPQISGLPAG MLRKECDPNS FKFNDTSELE PLEGIIGQER AVRAMTFGLK INTRGYNIFM SGMTGTGKTS YAVNYIKKIA KNCKTPDDWC YVYNFENPNQ PKAINLPAGL GKVFKKDMEE FIKVLQQEIS RAFESEDYER ERAAIANEYQ GKKAELMEIL NRDAEKQGFK VRTTNAGIYF LPVIEGKTIT EEEYGQLDEK IKQEITERSN IVQLETLEII RKIKNIEREA EERVAEWENK IALFAVGMQI NDLKEKYKDY KEVVKYLEQV QEDILQNLDD FREEEYSEEQ QLIMPWLKGN EGSPVDKYKV NLLVDNSGLE GAPVIVDFNP TYYNLIGRVE YENEFGTMIT DFTMIKPGLF HQANGGYLIL QAKDVLSNVQ SWEALKRALK TRQITIENMK EQMGLVAVST LKPEPIPLQV KVILVGNEFL HQLLYEYDED FKKLFKIKVD FDEEMDRNED NTLKLAQFIS SFCRRENAPH FDRTGVAKVV EYSSRLVGDQ NKLSTRFNDI VEILCESAAW AQIDGSSLVK AEHVNKAIQE KIYRSNKYDK KLLELLKDGI IILDTEGEAV GQINGLTVLD IGDYCFGKPT RITANTFMGE KGIVNIEREV EMSGTSHTKG VLILSGYIGQ KYAQDIPLSL TASLCFEQLY SGVDGDSASS AELYAILSSL AEVPIKQSIA VTGSVNQKGE IQPIGGVNEK IEGFFELCKA RGLNGKHGVI IPYQNVRNLA LNDEVIEAVK EGKFHIYAVK TIDEGIEILT GMKAGEKRED GTYPEGTINY LVYEKLKKYA RTVAGFGKDE KEAKDAKDAK KNSD
|
| |