Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3136 |
Symbol | |
ID | 4809699 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3705344 |
End bp | 3706471 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640108569 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001039524 |
Protein GI | 125975614 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACATAG GAGACCATGG CACACATTGT GCGGGAATAA TTGCAGGTTT TGGACCTAAT GTTAAGATTG CCTCACTTAA GCATCTTAGC GGAAATAAAT TTAAAGATTT GGACAATTGG GTTTCTACTA TGATTAAGGC AATTAATTAT GCCGATGCAA TGGATATTAA AATTGTAAAC GTTAGTTTGG GATTACATAA ATCTCAAATT GGAGATAGGC CATTTGATTC TCAGGCTCTA AATGATGCAA TAAGTAATGC AGACTTGTTA TTCGTAACAG CTGCCGGAAA CTTCAATAAA AATATTGACC TACCGGACGA TGTTATTTAT CCCGCAAGCT GTACCGCTGA AAATATTATT ACTGTTGCAA ATACTGATAA AGATGATAAA CTTTATGAAT CTTCAAATTA TGGTGTTATT TCGGTTGACC TTGCTGCCCC CGGGACAGAT ATTCGTAGTA CTATTCCAAC ACATCTTGCA GGAGAAGGCG GACCTTACGA TATAAAAACA GGTACATCCA TGTCTGCGCC ACATGTAGCA GGAGCAGCTG CTTTGTTGTT ATCTTCAAAT CCGTCTTTAA CTACACAACA ACTAAAAGAT TTGATTTTAT CCAGTGTGGA TTTTCTACCG GACTTGCAGG GTAAAGTTGC CACAAGCGGT AGGTTAAACG TTGCAAAAGC CTTGAGGAAG ATTAGAACTT CTGTCAAAAT TGGAGATATA GACGGAAATG GAGAAATATC CTCCATTGAT TACGCCATAC TTAAATCACA TTTAATAAAT TCAAACCTGA CATTTAAACA GTTAGCTGCC GCTGATGTAG ATGGGAATGG ATATGTAAAT TCCATCGATC TTGCCATACT TCAAATGTAT TTATTAGGCA AAGGTGGCAC GTCAGATATC GGGAAAAACC GCATATATAC GTATGGCGAC ATTGACAATA ACGGAATAGT AGACGAGAAT GATTATATAC TGATATGCAA CCATATTAAC GGTACAGGAC AATTATCGGA TGCTAGTCTG TTTGCTGCAG ATGCTGACGG AAATAATGTT ATAGACCAAA CCGATAGAAT TCTTATAGAA AAATATATCA CAGGAAGAAT TACTCATCTA CCTGTCGGAA ATCAATAA
|
Protein sequence | MDIGDHGTHC AGIIAGFGPN VKIASLKHLS GNKFKDLDNW VSTMIKAINY ADAMDIKIVN VSLGLHKSQI GDRPFDSQAL NDAISNADLL FVTAAGNFNK NIDLPDDVIY PASCTAENII TVANTDKDDK LYESSNYGVI SVDLAAPGTD IRSTIPTHLA GEGGPYDIKT GTSMSAPHVA GAAALLLSSN PSLTTQQLKD LILSSVDFLP DLQGKVATSG RLNVAKALRK IRTSVKIGDI DGNGEISSID YAILKSHLIN SNLTFKQLAA ADVDGNGYVN SIDLAILQMY LLGKGGTSDI GKNRIYTYGD IDNNGIVDEN DYILICNHIN GTGQLSDASL FAADADGNNV IDQTDRILIE KYITGRITHL PVGNQ
|
| |