Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2904 |
Symbol | |
ID | 7311520 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 3458900 |
End bp | 3460147 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643609804 |
Product | carboxyl-terminal protease |
Protein accession | YP_002507178 |
Protein GI | 220930269 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0010841 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGAATA TGAAGGACCA AGAAACGAAA CGTCTGTTTT CAGTAATAAC CATGACGGCA GTGGTGTGTT TTACAATATC AATCCTAGTG TATGGCGGCT TAATGTATTT TAACGGAAGT TATTCACTGA TATTCAATAA AAATAGTGTA GACAGGGAAA CGATACAAAA ATTTAATGAA GCAAGAAGTA TACTTCAGAA AGCCTATTAT GAAAATGTAG ACACTAACAA ACTGGTGGAG GGTGCAATTA GCGGTATGAC AGAGTCACTG AATGATCCCT ATACAGTATA TTATAATAAG CAGCAAATGA AGTGGTTTAC AGGTCTTCAG AACAATACAG AGAATGAGTA TGTGGGGGTT GGACTGCCGA TAATGCTAGA TAAAAACGGA ATAGTAACCG TTTTAGAGCC TTACGATAAT TCCCCTGCAA AAATTGCAGG AATCAAGCAA GGAGATAAAA TACTTAAAAT AGATGGTAAA GACATAACAG GAATCAAGGA TGAAACACTG GTTGCCAGCA TGATTAAAGG ACCTGAGAAC ACCGAGACGG TTCTGACTAT TCTTCGAGAA TCAGACAACA GTACCATTGA TATCCCAGTA ATGAGAAAAA AGATTAAAGC CCTGGTGAAT ATAAGAAGTG AAATGTTGGA TGGAAATATT GCATATATTA AGCTTAAAAT GTTTGATAAA AATATTAGCA AGAACTTTAT CAGTCAGTTA AACAAATTGG TTAAGCAAGG TGCTAAAGGC TTAATAATAG ATGTGAGGGA CAATCCGGGG GGATTATATG ATGAAGTAGT GACATTGGCA GACCGACTTC TTCCAAAGGG AACAATAGTA TTTACAAAGG ATAAAAACGG TAAAAAAAGT GTGCAGTCGT CTGATGAAAA TGAACTTAAT ATGCCCATAG CTGTAATTAC AAATGGTAAC AGTGCAAGTG CTTCGGAAAT TCTGGCAGGT GCTGTTAAGG ACTTTAAAAA GGGAACACTA ATAGGAACTA AAACCTTTGG AAAAGGACTG GTGCAGACAA CCTATTCTTT TAAGGACGGA ACAGGACTTA AGGTAACAAT AGCAAGGTAT TATACACCTT CCGGTGTTTG TATACAGGGA CAGGGTATAA AACCTGAAAT CGAAGTAAAG CTTCCCGAAA AGTACAAAGA CATCGATGTT GCAGCAATTC CCAAGGAAGA TGACTTACAA CTTCAAAAGG GTATTGAAGT TATTAGCAAA AAAATAATAT CAGATTAG
|
Protein sequence | MLNMKDQETK RLFSVITMTA VVCFTISILV YGGLMYFNGS YSLIFNKNSV DRETIQKFNE ARSILQKAYY ENVDTNKLVE GAISGMTESL NDPYTVYYNK QQMKWFTGLQ NNTENEYVGV GLPIMLDKNG IVTVLEPYDN SPAKIAGIKQ GDKILKIDGK DITGIKDETL VASMIKGPEN TETVLTILRE SDNSTIDIPV MRKKIKALVN IRSEMLDGNI AYIKLKMFDK NISKNFISQL NKLVKQGAKG LIIDVRDNPG GLYDEVVTLA DRLLPKGTIV FTKDKNGKKS VQSSDENELN MPIAVITNGN SASASEILAG AVKDFKKGTL IGTKTFGKGL VQTTYSFKDG TGLKVTIARY YTPSGVCIQG QGIKPEIEVK LPEKYKDIDV AAIPKEDDLQ LQKGIEVISK KIISD
|
| |