Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1930 |
Symbol | |
ID | 4810788 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2302267 |
End bp | 2303784 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640107346 |
Product | carboxyl-terminal protease |
Protein accession | YP_001038341 |
Protein GI | 125974431 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0674113 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGTGA CGGGTAAATC ATTAAAACGG ATACTGTTAT CTTTGGCTGT TTTTTGTATT TTAATAACGG GCCCGGGAAT TGCCTGTGCC GAGGAAGCCA CGACGCAAGA AAAGGAAATT TTGTTTTCCG ACTATTTCAA AAGCATGATG GACATGGCGC AAGACAAATA CAAAGGTGAA ATAACCGAAA AGCAAATGCT GGAAGGCGCG CTGAAAGGCA TATTCAGCAC AATGGATTCT TACACCGTCT ATTACACTGT GGAAGAGTCG CAAGACTTTT TTACCGATAT AAACGGCTCT TACACGGGAA TAGGTGTGGT AATGTCGGAA GTTGACGGTA AAATAGTGAT AGACAAGGTG TATCCGTCCT CACCGGCGGA GGAAGCAGGG ATAAAAAAAG GCGATGTTAT AGCCCAGGTT GACGGTAAAA GCGTTGAGAA CCTTTCTTTG GAAGAAGTGG CCGGGCTCAT AAAGGGACCG TCGGGTACGA AAGTTGTAAT CGGTGTGTTA AGAAACGGAA CAGACGGGGT AATAGAGCTG GAAGTTACAA GAAGGCAGAT AATTATAAAT CCTGTCACCC ATAAGATAGA AGGCGATATA GGTTATATAA AGCTGGAATC GTTCAATTCC AATGCAAGCA AGGCTATGGA AGAAGCCTTG AAACAAATGG ATAAAAACAA TATAAAAAAG ATAATTCTTG ATTTACGGGA CAATCCGGGC GGAGATGTGG GCCAGGCGGT TTCAATTGCC AGGAAGTTTG TAAAAAAAGG CCTTATTACA AAGCTGGATT TTAAATCGGA ATCCCAAAAG GATGAAGAGT ATTATTCCTA TTTGGAGGAA TTAAAATATA AACTTGTGGT ATTGGTGAAC GAAAACAGCG CAAGCGCTTC GGAAATATTG GCGGGAGCGA TACAGGATAC AGGTTCGGGT ATTTTGGTTG GTACAACAAC CTTTGGAAAG GGAAAAGTCC AGAATCTTTA TCCCATACTG ACTCCGGAGG CGGTGGAAAA ATATCGGAAA GAAACCGGAG AAACATTTGT GAACGGATAC GATTTATTGG AAAAACACGG CATCTATCCT TCCGATGAAG AAATAATCGG ATGGGTGAAA ATAACAACCG GGGAATATTA TACCCCCAAC GGAAGGATGA TAGACGGAGT AGGTCTTGAG CCCGACGTTT ATGTTGAAAA TGAACCGGAG GGAAAATATA AAATCCTTGA AGGTGTGGAA AAACTTCGCA AGGTGACAAA ACCTTCTTTA AATGCTCAAA GTGAGGATGT CCTGAATGCG GAAAAAATTT TATCGGCACT GGGGTATGAT GTTGACACTC CGGACAATTT AATGGATGAA AAGACCGTCA AGGCGGTGGC AGAGTTTCAG AGAGACTGCG GATTGTATTC TTACGGAGTT TTGGACTTTG CCACACAGCA GGCGCTGAAT GACAAATTGG ATGAATTGCT TCTTGTAAAG AACAGAGACA AACAGTATGA AAAGGCGGTG GAACTGCTTG AAAATTAG
|
Protein sequence | MDVTGKSLKR ILLSLAVFCI LITGPGIACA EEATTQEKEI LFSDYFKSMM DMAQDKYKGE ITEKQMLEGA LKGIFSTMDS YTVYYTVEES QDFFTDINGS YTGIGVVMSE VDGKIVIDKV YPSSPAEEAG IKKGDVIAQV DGKSVENLSL EEVAGLIKGP SGTKVVIGVL RNGTDGVIEL EVTRRQIIIN PVTHKIEGDI GYIKLESFNS NASKAMEEAL KQMDKNNIKK IILDLRDNPG GDVGQAVSIA RKFVKKGLIT KLDFKSESQK DEEYYSYLEE LKYKLVVLVN ENSASASEIL AGAIQDTGSG ILVGTTTFGK GKVQNLYPIL TPEAVEKYRK ETGETFVNGY DLLEKHGIYP SDEEIIGWVK ITTGEYYTPN GRMIDGVGLE PDVYVENEPE GKYKILEGVE KLRKVTKPSL NAQSEDVLNA EKILSALGYD VDTPDNLMDE KTVKAVAEFQ RDCGLYSYGV LDFATQQALN DKLDELLLVK NRDKQYEKAV ELLEN
|
| |