Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0260 |
Symbol | |
ID | 4808543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 319559 |
End bp | 321682 |
Gene Length | 2124 bp |
Protein Length | 707 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640105672 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_001036692 |
Protein GI | 125972782 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.500481 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATAA CCGGAGTCAT AGTAAGAATA CATAAAGACA GAGCTATTAT TCGGACAGAC GATAACAGAT TGCTTGCCGT AAAAAGACAC AATGACATGA TGGTAGGCCA AATAGTAAGC TTTGACGCAA ACGAAGTGCA TAAAGTTGAG AGCAAAAAAT ACAAATATGC AGCTTCGGGC AAACGTATCG AAAAAGTTCA AAAAACCCCC AAGATAAAGA ATTTTTCAAG AATAAACAAT ATCAAGGAAT TTTCCCGCGT TGACGACATA AAAAATTTCT CACGCGTAGC GGCAACCAAA GAGACTTCAC AGGACTCACC GCAGGAGTCA AAAGTGGAGA ACTTTTCCCG TGTAGTGGAC TTTTCACGGG TGATGAACTT CTCACGGGTG TCAAACAGTA AGAAAAACGA AATAAAGAAT TTCTCACGTA TAAGCAACAT CAAAAACTTC TCCAGAATAG CATCGATTGC GGCCGCTTTT GTGCTGATAT TCCTGTTTGG TCGGAATGTG ATGCTGAACA ATAGTTCAGA CAGTGAATAT GCTTATGTCA GCGTTGATGT TAATCCAAGC GTTGAGTTTA CGATAAACAG TAAACATAAA GTTATCGTCA CATCCGCAAT AAATCAAGAT GCGTCAGAAG TATTGGATGG CCTTGAACTG AAAGAGAAAG ACCTGAAGTC TGCTCTTGTG ATGGTTCTTG AAAAGGCAGA ATCGCTGGGC TATATTTCGG ATGATAAGAA CTATGTACTT GTTTCCATGG CTCTGAATGA CAAGAACAAA AAAACCAGGG ATAAAAGGGA AGAAAAGATT GATGAGCTGA AAGAGACTAT AGAACAGGGA ATAGAAGCGC TGGACAATGA TACTATTGTC CACAGGACAG TGACTGTGGA CCTTGAGGAA AGAAATAAGG CTTTGGAAAA TGAACTGTCA ATGGGAAGAT ATTATCTGTA TCTCGAAGCA AAAGAAAAAG GTATGGACAT TACTATTGAT GAAGTGAAAT CTTCAAAGAT TTCTGATTTG ATAGAAAAAA TAGAGGATAA TACCGAGCTG GCTCCTACGC CAACACCGGT ACCACCAGAA ACACCGGAGC CGACTCCGAC ACCTACGGCA TCCGAGGCAA CACCGTCAAA TTCACCGGTT GAGAGCAAGT CACCGGAAGC TGTGCCCGAA CTTGGCTCAA GGGAAATAGA AATCCTGGGT GAAAGCGTGG TTTTGGTAAC GGCCTATGAC GAGAACAGGA AGGTTGTTTC CCAGGGCAGC GGTTTTGCTG TCGGAACAGG GTTGTTTGCC ACAAACTATC ATCTGGTTAA AGACGGTGTG GTTGTTAAAA TAACGGCGGG TGACGGAAAA GTATATGATG TGGACGGAAT TGTAAAATAC GACAAAGCGA AGGATTTGGC TTTGCTGAAA ACCAGAGTTG AAACTGGTGT GAACCCACTT AAGCTTGGTA CAAAGAAATC TTTGACCAAA GGCAGCAGGA TTGTGGCAAT AGGCAAGGCA AATGGAGCTA AAAACACTGT GACGAAAGGA AGTATAAAGA GCCTTAAGGT TGACGGCCTG ACCGACGCAA TTGAACTTTC GGCTTCAATT TCAAAGGAAA GTACCGGCGG TCCTGTGTTT GACATGAAAG GAAATGTTGT AGGAATAACT GCTTATGGAA TTTCAAAACA AAATGTCAAT GCTGTGATTC CGGCAGACTA TGTAGCTGAC TGGGTAAAAG AGCTTTCGAA ACATTCCTTT GGCAACATCA GAATTGTAAG GAAAACTCTT GTATTTGACA GTGACTTTGA GTTCAATTTT GTGGTTTACA AAATAATAAG GGCGCTGGAA AATGAAGATG CTGCCACATA TTTTGGCTGC ATGACCGATG AATTGTACAA GGATGAAACA AGGAAAAATC TGGAAGTACT GTTTACAACT TACGACCTTG CTTATAACAT AGAAAGTATC AATGTTGTTT CAAAGAGTGA AGAACAGGCA AAAGTAAGCT ATGTTTATAC AATAAACAAA GAAGCCGGTC CGAACTTTAA AAATTACAGA ATAATCGGAG AATGCAGCCT CATAAAGGTT GACGGCACAT GGAAAATCAA TGATTCGGAG GAAAAGAAAG AGTATATACA ATAG
|
Protein sequence | MKITGVIVRI HKDRAIIRTD DNRLLAVKRH NDMMVGQIVS FDANEVHKVE SKKYKYAASG KRIEKVQKTP KIKNFSRINN IKEFSRVDDI KNFSRVAATK ETSQDSPQES KVENFSRVVD FSRVMNFSRV SNSKKNEIKN FSRISNIKNF SRIASIAAAF VLIFLFGRNV MLNNSSDSEY AYVSVDVNPS VEFTINSKHK VIVTSAINQD ASEVLDGLEL KEKDLKSALV MVLEKAESLG YISDDKNYVL VSMALNDKNK KTRDKREEKI DELKETIEQG IEALDNDTIV HRTVTVDLEE RNKALENELS MGRYYLYLEA KEKGMDITID EVKSSKISDL IEKIEDNTEL APTPTPVPPE TPEPTPTPTA SEATPSNSPV ESKSPEAVPE LGSREIEILG ESVVLVTAYD ENRKVVSQGS GFAVGTGLFA TNYHLVKDGV VVKITAGDGK VYDVDGIVKY DKAKDLALLK TRVETGVNPL KLGTKKSLTK GSRIVAIGKA NGAKNTVTKG SIKSLKVDGL TDAIELSASI SKESTGGPVF DMKGNVVGIT AYGISKQNVN AVIPADYVAD WVKELSKHSF GNIRIVRKTL VFDSDFEFNF VVYKIIRALE NEDAATYFGC MTDELYKDET RKNLEVLFTT YDLAYNIESI NVVSKSEEQA KVSYVYTINK EAGPNFKNYR IIGECSLIKV DGTWKINDSE EKKEYIQ
|
| |