Gene Cthe_0260 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0260 
Symbol 
ID4808543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp319559 
End bp321682 
Gene Length2124 bp 
Protein Length707 aa 
Translation table11 
GC content41% 
IMG OID640105672 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_001036692 
Protein GI125972782 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.500481 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATAA CCGGAGTCAT AGTAAGAATA CATAAAGACA GAGCTATTAT TCGGACAGAC 
GATAACAGAT TGCTTGCCGT AAAAAGACAC AATGACATGA TGGTAGGCCA AATAGTAAGC
TTTGACGCAA ACGAAGTGCA TAAAGTTGAG AGCAAAAAAT ACAAATATGC AGCTTCGGGC
AAACGTATCG AAAAAGTTCA AAAAACCCCC AAGATAAAGA ATTTTTCAAG AATAAACAAT
ATCAAGGAAT TTTCCCGCGT TGACGACATA AAAAATTTCT CACGCGTAGC GGCAACCAAA
GAGACTTCAC AGGACTCACC GCAGGAGTCA AAAGTGGAGA ACTTTTCCCG TGTAGTGGAC
TTTTCACGGG TGATGAACTT CTCACGGGTG TCAAACAGTA AGAAAAACGA AATAAAGAAT
TTCTCACGTA TAAGCAACAT CAAAAACTTC TCCAGAATAG CATCGATTGC GGCCGCTTTT
GTGCTGATAT TCCTGTTTGG TCGGAATGTG ATGCTGAACA ATAGTTCAGA CAGTGAATAT
GCTTATGTCA GCGTTGATGT TAATCCAAGC GTTGAGTTTA CGATAAACAG TAAACATAAA
GTTATCGTCA CATCCGCAAT AAATCAAGAT GCGTCAGAAG TATTGGATGG CCTTGAACTG
AAAGAGAAAG ACCTGAAGTC TGCTCTTGTG ATGGTTCTTG AAAAGGCAGA ATCGCTGGGC
TATATTTCGG ATGATAAGAA CTATGTACTT GTTTCCATGG CTCTGAATGA CAAGAACAAA
AAAACCAGGG ATAAAAGGGA AGAAAAGATT GATGAGCTGA AAGAGACTAT AGAACAGGGA
ATAGAAGCGC TGGACAATGA TACTATTGTC CACAGGACAG TGACTGTGGA CCTTGAGGAA
AGAAATAAGG CTTTGGAAAA TGAACTGTCA ATGGGAAGAT ATTATCTGTA TCTCGAAGCA
AAAGAAAAAG GTATGGACAT TACTATTGAT GAAGTGAAAT CTTCAAAGAT TTCTGATTTG
ATAGAAAAAA TAGAGGATAA TACCGAGCTG GCTCCTACGC CAACACCGGT ACCACCAGAA
ACACCGGAGC CGACTCCGAC ACCTACGGCA TCCGAGGCAA CACCGTCAAA TTCACCGGTT
GAGAGCAAGT CACCGGAAGC TGTGCCCGAA CTTGGCTCAA GGGAAATAGA AATCCTGGGT
GAAAGCGTGG TTTTGGTAAC GGCCTATGAC GAGAACAGGA AGGTTGTTTC CCAGGGCAGC
GGTTTTGCTG TCGGAACAGG GTTGTTTGCC ACAAACTATC ATCTGGTTAA AGACGGTGTG
GTTGTTAAAA TAACGGCGGG TGACGGAAAA GTATATGATG TGGACGGAAT TGTAAAATAC
GACAAAGCGA AGGATTTGGC TTTGCTGAAA ACCAGAGTTG AAACTGGTGT GAACCCACTT
AAGCTTGGTA CAAAGAAATC TTTGACCAAA GGCAGCAGGA TTGTGGCAAT AGGCAAGGCA
AATGGAGCTA AAAACACTGT GACGAAAGGA AGTATAAAGA GCCTTAAGGT TGACGGCCTG
ACCGACGCAA TTGAACTTTC GGCTTCAATT TCAAAGGAAA GTACCGGCGG TCCTGTGTTT
GACATGAAAG GAAATGTTGT AGGAATAACT GCTTATGGAA TTTCAAAACA AAATGTCAAT
GCTGTGATTC CGGCAGACTA TGTAGCTGAC TGGGTAAAAG AGCTTTCGAA ACATTCCTTT
GGCAACATCA GAATTGTAAG GAAAACTCTT GTATTTGACA GTGACTTTGA GTTCAATTTT
GTGGTTTACA AAATAATAAG GGCGCTGGAA AATGAAGATG CTGCCACATA TTTTGGCTGC
ATGACCGATG AATTGTACAA GGATGAAACA AGGAAAAATC TGGAAGTACT GTTTACAACT
TACGACCTTG CTTATAACAT AGAAAGTATC AATGTTGTTT CAAAGAGTGA AGAACAGGCA
AAAGTAAGCT ATGTTTATAC AATAAACAAA GAAGCCGGTC CGAACTTTAA AAATTACAGA
ATAATCGGAG AATGCAGCCT CATAAAGGTT GACGGCACAT GGAAAATCAA TGATTCGGAG
GAAAAGAAAG AGTATATACA ATAG
 
Protein sequence
MKITGVIVRI HKDRAIIRTD DNRLLAVKRH NDMMVGQIVS FDANEVHKVE SKKYKYAASG 
KRIEKVQKTP KIKNFSRINN IKEFSRVDDI KNFSRVAATK ETSQDSPQES KVENFSRVVD
FSRVMNFSRV SNSKKNEIKN FSRISNIKNF SRIASIAAAF VLIFLFGRNV MLNNSSDSEY
AYVSVDVNPS VEFTINSKHK VIVTSAINQD ASEVLDGLEL KEKDLKSALV MVLEKAESLG
YISDDKNYVL VSMALNDKNK KTRDKREEKI DELKETIEQG IEALDNDTIV HRTVTVDLEE
RNKALENELS MGRYYLYLEA KEKGMDITID EVKSSKISDL IEKIEDNTEL APTPTPVPPE
TPEPTPTPTA SEATPSNSPV ESKSPEAVPE LGSREIEILG ESVVLVTAYD ENRKVVSQGS
GFAVGTGLFA TNYHLVKDGV VVKITAGDGK VYDVDGIVKY DKAKDLALLK TRVETGVNPL
KLGTKKSLTK GSRIVAIGKA NGAKNTVTKG SIKSLKVDGL TDAIELSASI SKESTGGPVF
DMKGNVVGIT AYGISKQNVN AVIPADYVAD WVKELSKHSF GNIRIVRKTL VFDSDFEFNF
VVYKIIRALE NEDAATYFGC MTDELYKDET RKNLEVLFTT YDLAYNIESI NVVSKSEEQA
KVSYVYTINK EAGPNFKNYR IIGECSLIKV DGTWKINDSE EKKEYIQ