Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2513 |
Symbol | |
ID | 4809269 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2982057 |
End bp | 2983235 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640107929 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_001038908 |
Protein GI | 125974998 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTGACA ATGGCAGAAG GAAAAACGGT TTTTTCACCA TGTTTTTAAC TTCACTTCTT ACATCTCTGG GTGTTGGAAT ATTGTTGATT TTTGCTGTTA CAGGTTTTGT AACGGGAAAC CGCAGTACAG CTCCCCAAAC GCCTCAGGGG GGTTCGGGCA TTGAAATGCC CAGTCCGCAG CAGATTTCGG CGGAACAATC CAATAATACG GGCGGACAAA ATACCATACA GAGTGTTGCG GAAAATGCGT CAAAAGCTGT GGTAGGCATT TCCGTACTCA AAGTTGACAG CAGCTCAATA TTCAATCCCA ATGCTGTCGA ACGGTGGGGT GTGGGTTCGG GAGTAATAGT AACCCCAAAT GGTTATATTC TTACCAATCA CCATGTGGCT GGAGGAAAAA GCAAAAGGAT TGTTGTTTCG CTGGTCGACG GGAAAAACCT GGACGGAGTG ACTGTCTGGT CAGATTCGGT ATTGGATTTG GCGGTGGTAA AGATAGAAGC GGAAGGACTT CCTACAATAC CGCTGGGTGA TGCCACAAAA CTTAAAGTAG GAGAGCCTGC CATTGCCATC GGCAATCCTC TTGGACTTCA ATTCCAGAGA ACGGTCACAT CGGGTATTAT CAGTGCGCTT AACAGAACTA TAGAGGTTGA CACCGAACAG GGCACAAATT ACATGGAAGG CCTTATTCAA ACCGATGCCA GCATAAATCC CGGAAACAGC GGCGGACCGC TTTTGAACCT GAAAGGTGAA GTGGTGGGAA TTAATACGGT AAAAGTGGCC AGTGCGGAGG GAATAGGCTT TGCCGTACCC ATTAATGTGG CAATTCCCAT AATCAACAAG TTTGCAACTA CAGGAGAGTT TATTGAGCCA TACCTGGGGG TTTTTGCATA TGACAAAGAC ATTATTCCTT ACCTTGACGG AAATGTCAAA GTGCAAAACG GGGTTTATGT GGCAAATGTG GACGAAAACG GACCGGCTTA CAAGAGTGGA ATACGGGTGG GCTGCATCAT GACTCAGATT GACGGCGAGG AGATAAGCAC GATGATGCAG CTAAGATGCG TCATTTATTC AAAAAAACCG GGGGATGTGG TGACCATCAG ACATATCAGC AACGGCAAAC CCCAGACCGT CCAAGTCAGG CTTGCCGCAA AAGAGAAAGA CGGACTTGTG ACAAGGTAG
|
Protein sequence | MFDNGRRKNG FFTMFLTSLL TSLGVGILLI FAVTGFVTGN RSTAPQTPQG GSGIEMPSPQ QISAEQSNNT GGQNTIQSVA ENASKAVVGI SVLKVDSSSI FNPNAVERWG VGSGVIVTPN GYILTNHHVA GGKSKRIVVS LVDGKNLDGV TVWSDSVLDL AVVKIEAEGL PTIPLGDATK LKVGEPAIAI GNPLGLQFQR TVTSGIISAL NRTIEVDTEQ GTNYMEGLIQ TDASINPGNS GGPLLNLKGE VVGINTVKVA SAEGIGFAVP INVAIPIINK FATTGEFIEP YLGVFAYDKD IIPYLDGNVK VQNGVYVANV DENGPAYKSG IRVGCIMTQI DGEEISTMMQ LRCVIYSKKP GDVVTIRHIS NGKPQTVQVR LAAKEKDGLV TR
|
| |