Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1857 |
Symbol | |
ID | 4809408 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2201543 |
End bp | 2202832 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640107276 |
Product | carboxyl-terminal protease |
Protein accession | YP_001038271 |
Protein GI | 125974361 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000520735 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAATAAAA ATATTTTTTC ATCGAAAATT CTGCCTTTAA TTTTGGTTGC TTTGCTTTCC TCGACTGTAA CTGCATACGG GTTTCTTCAG TGGTATGAAA GAAATCCGAA ATATGTTGTC CTGTCCGAAG AGGAGGCAAA GGTTTTTGAA AAGAAATCAA ATAATAAATT TGATGCCGAT TATGCAATAA CCTTTGACAA AAACAGCGTT GATATTGAGA ATATCAAGAA ATACAACAAA GTAAAAAAGC TCCTCACTTC GCATTATTAT CAAGAAGTTG ACCAGAATCA AATGCTCGAA GGCGCTATTG CAGGGATGGT AAGTGCTTTG AAAGATCCTT ATACTGTATA TTTTACAAAG GATCAAATGC AAGTTTTCAC TGAAAGCACA TCCGGCAGTT ATGTGGGTAT AGGTGTGTCG CTAAATATGG ACTCTGACGG CTTGATGACT GTGGTAGAGG CATTCAACGG ATCCCCGGCA AAAGAAGCGG GAATAATGCC GGGAGACAAG ATAGTCAAAG TTGACGACCA GGATGTTACA ACTATAAGTG ACCAGGACTA TATTGTAAGC ATAATTAAAG GTGAGGAAAA TACCAAGGTG AAGATTACGG TATTTAGGCC TTCAGAAGGC ACATACGTGG ATTTTGACAT AATAAGAAAG AAGATAAAGA TTGAGAACAT AACCAGTGAA TTAATAGACA AAGATATTGG ATACATAAAA ATTAATATGT TTGACAGTGA AATTGCAAAA TATTTTGGAG ACCACCTAAA CGGGCTTCTT GCCAAGAATA TCAAAGGATT GATAATAGAT TTGAGGGATA ATCCCGGCGG AGATTATAAG CAGGTATGTG CGATAGCGGA CCGGCTGCTT CCGGAAGGAT TGATTGTTTA TACTGAAGAC CGATTGGGCA ACAGAATTGA AGAAAAATCG GATTCAACGG AGCTTGGCAT GCCTCTGGCA ATACTGGTGA ACGGCAATAG CGCCAGCGCT TCGGAAATTT TGGCCGGTGC CGTAAAGGAT CACGATAAAG GAACCCTGAT AGGAACCAGA ACCTTTGGAA AAGGGCTTGT TCAGGCGGTG GAGCCGCTTG AGGACGGGTC CGGCCTCAAG TTTACCATTG CAAGATACTT TACCCCATCC GGCGTATGCA TACACCAGGA TGGGATAGAA CCGGACATAG AGGTTAAGCT GGATGAAAAG TATTCAAACT TGCCTGTTTC ACAAGTGCCA AGAGAAGATG ACACCCAGCT TCAAAAAGCT GTTGAGGTAA TACACGGACA GATCGACTGA
|
Protein sequence | MNKNIFSSKI LPLILVALLS STVTAYGFLQ WYERNPKYVV LSEEEAKVFE KKSNNKFDAD YAITFDKNSV DIENIKKYNK VKKLLTSHYY QEVDQNQMLE GAIAGMVSAL KDPYTVYFTK DQMQVFTEST SGSYVGIGVS LNMDSDGLMT VVEAFNGSPA KEAGIMPGDK IVKVDDQDVT TISDQDYIVS IIKGEENTKV KITVFRPSEG TYVDFDIIRK KIKIENITSE LIDKDIGYIK INMFDSEIAK YFGDHLNGLL AKNIKGLIID LRDNPGGDYK QVCAIADRLL PEGLIVYTED RLGNRIEEKS DSTELGMPLA ILVNGNSASA SEILAGAVKD HDKGTLIGTR TFGKGLVQAV EPLEDGSGLK FTIARYFTPS GVCIHQDGIE PDIEVKLDEK YSNLPVSQVP REDDTQLQKA VEVIHGQID
|
| |