Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2340 |
Symbol | |
ID | 7407759 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2481236 |
End bp | 2482654 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643716704 |
Product | carboxyl-terminal protease |
Protein accession | YP_002574183 |
Protein GI | 222530301 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000278022 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAGTA ATAAGAAAAT CTTGATAAAG ATTTTAGCTG TAATGGTTGC GCTTTCAATA TTTGTTGCAG TTCCTGTGTA TTCCCAGTTT TTTATAATCT CAAATGATTT TCCTACAGAC AAGCAGATGG ATTATATCAA AAAGGTGCTG CAGGTTGCAA AGGTGTATCA TATAGGCAAG TACAGTTATG ATGAGCTTAT TGATATGATG TTTACAGGGC TTTTCAAGAG CCTTGACAAA TATTCAGAGT ACATGAAACC ACAGCAGGCT CAGGACTTTA CCCAGAGCGT AAATGGCGAG TTTTCGGGAA TAGGTATCCA GATAGAAAAA CAGGAGGACT ACATAGTTAT TGTAGGAGTT TTTGATGGAA CACCTGCAAA AGAAGCGGGT CTGAAAGTTG GTGATAAAAT TATAGCAGCA GATGGAAAGT CTCTGGTTGG GAAAACAACA GATGATGCTG TTAAGCTCAT TCGCGGGCAG GAAGGCACAA CTGTTGTGAT TGACATCTTA AGAGATGGTA AGACTTACAG ATTTTCTATC GTAAGAAAAA AAATAAAGAT ACCTGTTGTT GAGTATAAGG TACTTGATAA TAATATAGGA TATATAAAAC TTACACAGTT TACACAGGGC TGTTCCAATG ATATCAAAAA AGCTCTTGAT GAGTTTGATA AAAAAGGTAT CAAAAATATT ATTTTTGATA TTCGAAACAA CCCCGGCGGA CTTTTGGATG AGGTTGTAAA GATATGTGAA TATTTTGTGC CAGAAGGACC AATTGTAACA ATTGAATATA ATACTTATAA AGATGAGTAT AAATCAAAAA ACAAAGAAAC AAAGTATAGG CTTGCAGTTT TGACTAACGA GTCGAGTGCT TCTGCTTCGG AGATTTTTGC CCAAGCTATA AAAGATAGAA AAGTTGGGGT TGTTATTGGT ACAAAGACAT ATGGCAAAGG AACTGTTCAG ACTCTAATTG GCCTTCCTGA GACAGGTACC AAGAAAGGAT ATGTTGCCAA AGTTACAGTT GCAAAGTACA AGTCACCGTC TGGCTATTAT GTTGAAGGAA AAGGTGTTGT GCCAGACATA GAGGTTCAGG ACGACTCACT CTCCCAGTTT GGACCTGATA AGATTTTGAG CCTGAGCGCA ACCAAGAAGT TCAAAAAAGG TGATATGGAC TTGGAGGTTT TGGCAGCTCA GCAAAGGCTT TTCTACCTTG GATATTTAAG CAACTGGACA GCCAAGATGG ATGATAGCAC AGTGGCTGCG GTTAAAAAGT TCCAGAAAGA CAATAAGCTT TATCCTTCTG GAGTGCTTGA TATAACAACG CAGAAAAAGC TAAATGAGAA GTTTTTAGAG TTTGTAAAAT CCAAATATGT AGACAAACAG CTACAGCGAG CAATCCAGTA TTTCAAAACT GGGAAGTAA
|
Protein sequence | MRSNKKILIK ILAVMVALSI FVAVPVYSQF FIISNDFPTD KQMDYIKKVL QVAKVYHIGK YSYDELIDMM FTGLFKSLDK YSEYMKPQQA QDFTQSVNGE FSGIGIQIEK QEDYIVIVGV FDGTPAKEAG LKVGDKIIAA DGKSLVGKTT DDAVKLIRGQ EGTTVVIDIL RDGKTYRFSI VRKKIKIPVV EYKVLDNNIG YIKLTQFTQG CSNDIKKALD EFDKKGIKNI IFDIRNNPGG LLDEVVKICE YFVPEGPIVT IEYNTYKDEY KSKNKETKYR LAVLTNESSA SASEIFAQAI KDRKVGVVIG TKTYGKGTVQ TLIGLPETGT KKGYVAKVTV AKYKSPSGYY VEGKGVVPDI EVQDDSLSQF GPDKILSLSA TKKFKKGDMD LEVLAAQQRL FYLGYLSNWT AKMDDSTVAA VKKFQKDNKL YPSGVLDITT QKKLNEKFLE FVKSKYVDKQ LQRAIQYFKT GK
|
| |