Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4712 |
Symbol | |
ID | 8336066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5372860 |
End bp | 5374149 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 644957812 |
Product | serine protease |
Protein accession | YP_003115414 |
Protein GI | 256393850 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4934] Predicted protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGATC GTTTCACCCC GCCCGCCGCG AGATTCCTGC GGCGCTCCGG CGCCCGCGTC GCAGCGCTCG GCGCGGCGTT CGCCCTCGTC CTGGCGGTCA CGGCGCCGGC GGGCGCCGAG GCCGCCGGCG CGAGCGCGGC TGCCGGCGCC CGGCCGGCCG CGTCCCCGAT CGTCCCGTTC GGGTGCGCGA AGGTCGTCAC GGGCAAGGCA CACTGCCTGG GGCAGGGGCG GCTGATACCG AACGCCTCCG GGAGCCCGAC CAGCTCCGGG CTCAAGCCCG CGGACCTGAT AGCGGCGTAC AAGCTCGCCG GGACCAACGG CGCCGGGCGG ACCGTGGCGA TCGTCGACGC CTTCGACGAC CCGAACGCGG CCGCGGACCT GGCGGCGTAC CGCAGCGCCT ACAACCTGCC GGCGTGCACC GCGGCGAGCG GCTGCTTCCA GAAGGTGAAC CAGAGCGGGC AGGCCTCGCC GCTGCCCGCC GCGGACTACG GCTGGGCGGA GGAGGAGAGT CTGGACCTTG ACATGGTCTC GGCGATCTGC CCCGGCTGCC ACATCCTGTT GGTGGAGGCC AGTGGCGCGG ACACCGCCTC GCTGACCACC GCCGAGGACA CCGCGGCGGC GGCGCCCGGC GTGGTGTCGA TCTCCAACAG CTGGGGCGCG GCGGAGGACA GCTCGACGCT TGCCGCCGAC GCGCACTTCA ACCACCCCGG CAAGGCGATC ACCGCCAGCT CCGGTGACAG CGGCTACGGC GTCAGTTGGC CGGCGGCCTC GCAGTACGTC ACCGCGGTCG GCGGCACCAG CTTGAGCGCC GCGTCCAACG CCCGCGGCTG GACCGAGACC GCGTGGTCCG GCGCCGGCAG CGGCTGTTCG GTGCAGGAGC CCAAGCCCTC GTGGCAGACC GATTCGGGCT GCGCGCACCG CACGGTCGCC GACGTGTCCG CGGTCGCCGA CCCGAACACC GGCGTGGCCG TGTACGACAC CGCGAACAGC TGCGGCGGCG GGGCGTTCTG CGACCTGCTG CTGGCCCTGG GCCTGGCCAC CGGCGCCGAC GGCTGGGTGC AGGTCGGCGG GACCAGCGCG TCCTCGCCGA TCATCGCCTC GGTCTACGCG CTGGCCGGGA ACACCGGATC GCTCGTGTAC GGCTCCCAGC CGTACAAGAA CGCCGGATCG CTGTTCGACG TCACCAGCGG GAACAACGGC ACGTGCACGC CGGCGTACCT GTGCACCGCC GGGACCGGCT ACGACGGCCC GACCGGGCTG GGCTCGCCGA ACGGGACCGG GGCGTTCTGA
|
Protein sequence | MIDRFTPPAA RFLRRSGARV AALGAAFALV LAVTAPAGAE AAGASAAAGA RPAASPIVPF GCAKVVTGKA HCLGQGRLIP NASGSPTSSG LKPADLIAAY KLAGTNGAGR TVAIVDAFDD PNAAADLAAY RSAYNLPACT AASGCFQKVN QSGQASPLPA ADYGWAEEES LDLDMVSAIC PGCHILLVEA SGADTASLTT AEDTAAAAPG VVSISNSWGA AEDSSTLAAD AHFNHPGKAI TASSGDSGYG VSWPAASQYV TAVGGTSLSA ASNARGWTET AWSGAGSGCS VQEPKPSWQT DSGCAHRTVA DVSAVADPNT GVAVYDTANS CGGGAFCDLL LALGLATGAD GWVQVGGTSA SSPIIASVYA LAGNTGSLVY GSQPYKNAGS LFDVTSGNNG TCTPAYLCTA GTGYDGPTGL GSPNGTGAF
|
| |