Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5281 |
Symbol | |
ID | 8336635 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 6084197 |
End bp | 6087127 |
Gene Length | 2931 bp |
Protein Length | 976 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644958379 |
Product | Peptidase S53 propeptide |
Protein accession | YP_003115981 |
Protein GI | 256394417 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4934] Predicted protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCACCA CCGCAAAGAA GCGTTCGAAG ATACTGGCCG CCGGGTCGAT ACCGATGGCC ACGGCCGTCA TCGCCGCCGT CGGGATGTCC TCGGCCCAGG CCCAGAGCAA CGACACGAAC AACAACGGCC ACACCGACTC CCTGGGGCGG GTCGCCGTCG CCGGGACGCA GCCGGCCTGG GCCACCCCGG CCACCGAGAA GGCCGCCGTT CCGGCCACCC AGACCATCAG CACCCGGGTC TACCTGGCCG GGCAGAACCA GGCGCAGATG TCGGCGCTGG CCCAGTCGGT CACCGACCCG GCCAGCCCGA ACTACCACAA GTACCTGACC CCGGCGCAGG TCCAGTCCGA GTTCGGCTCG ACCGCCTCGC AGGTCAAGAC GGTCACCGAC TGGGCGAAGG GCGCCGGGCT GCAGGTCAAG TCGGTCGGCG AGAGCTGGGT GGACCTGGAG GGTCCGGCCT CGGCGGTCCA GACGGCCTTC GGCACGACCC TGGCCAACTT CACCGCGCCG GACGGCCAGT CGCACTACGC GCCGGCCACC GCGGCCATGG TCCCGGGCAA CGTCGCCAGC GCGATCGTCG GCGTCTCCGG GCTCTACGAC GCGCCGAAGA TCGAGACGCC CGGCGCCGTG ACCCAGACCC CGTCGACGAC CCCGGCCGCC GGACACCCGA CCTCCACGAC GTATGACACC TCCTGGTGCT CCACCTCCTG GGGCAGCAAG AGCTACTACA ACATCCCGGC GCCGATCTGC GGCTACGACT CCGCGCAGCT GCGCGGCGCC TACGGGGTGG GCAGCTCCGG CCTGACCGGC AAGGGCGCGA CCGTGGCGAT CATCGACGCC TACGACAGCC CGACGATGGC CGCCGACGCC GCGAAGTACA ACAGCCAGTT CGGCCAGCAG CAGTTCCGCG CCGGGCAGTA CCGCGAGGTC ACGAACACCG CCGCGTACAA CAACCAGGCC GCCTGCGACC CCGACAAGTG GGGGCTGGAG CAGAGCCTGG ACGTCGAGGC CGTGCACAAC ATGGCGCCGG ACGCCAACGT GGTCTACGTC GGCGCCGCCT CGTGCGAGGA CAACGACCTG ATGGGGGCCT ACGAGTACGT GGTGAACAAC CACGCCGCCG ACATCGTGTC GGTCTCGCTC GGCGGTCTGA TGCACACCAC CGGCTGGAAC CAGGACGCGG CGACCACCGC CGCCTTCGAC CGCATCTTCA TGAAGGGCGC CCTGGAGGGC ATCGGCTTCA ACTTCTCCAC CGGCGACTGC GGCGACGACA ACCCGGCGAA CGCCAACACC GGCTTCAACT GCTCCCCGGA CTCCGCCGGC AAGCAGACCG AGTACCCGGC CAGCTCGGCG TGGGTCACCG CGGTCGGCGG CACGGCGCTG AAGGTCGGCA CGGGCAACAG CTACCAGGGC GAGGAGCCCT GGGGCGACTA CCAGACCCCG AAGGCCAACG TGCCCGGGAA CTTCGCCAAC GGCAGCTTCA CCGGCGGTGC CGGCGGCGGG ACGTCCACGG ACATCACGCA GCCGTTCTAC CAGCGCAGCG CGGTCCCGGC CGCGCTGTCG CAGACCGCCC CGAACGGCGC GCACCTGAAG ACCCCGATGC GCACCGTCCC GGACGTGGCG ATGGACGCCG ACCCGTACAC CGGCATGGCG ACCTTCCGCA CCGCCGGCGG AAAGCCGGAC TGGAACCCGA TGGGCGGCAC CTCGCTGGCC GCCCCGCTGT TCGCCGGCGA GGCCGCGCTG CAGATGCAGG CGCACGGCGG CGTGGCTCCG GGCTTCGAGA ACCCGACCAT CTACGCCAAC GCGAACAAGT TCCGCGACGT GACCGCCAAC GGCGGGATGT ACACCCTGAT CCCCGAGGGC TGGAGCGGCA GCACGCTGAC CCAGGCCCAG GAAGAGATCA TGGGCGACGA CTCCTCGCTC AAGGCCGGTC CCGGCTATGA CGAGGCCACG GGCCTGGGCT CGCCGACCCT GGGCTACCTG CAGGCGCCGT ACGACGCCAA CCGCGTGGGC CGCCTCGCCG GCGGCGACCG CTACCAGACC GGCATCCAGA TCTCCCAGCA GCAGTTCCCG GCGGCCGGCA CGGCCAACAA CGTGGTGCTG GCGATCGGCA CCAACTTCCC GGACGCGCTG GCCGGCGCCC CGCTGGCCAA GAAGGTCGGC GGCCCGCTGC TGCTGACCCC GGGCAACACG GTCGACGCCC AGGTGGTCAA CGAGATCCAC CGGGTCCTCA AGCCCGGCGG CAAGGTCTAC GTCCTGGGCG GCACCGCGGC GATCACCCCG GCGGTGGTCA ACGGCCTGAA GCTGCCGGCG GCGCAGATCA CCCGCATCGG CGGCGTGGAC CGCTACGACA CCGCGATGCA GATCGCCAAG GCCATGGGCA ACCCGAGCCA CGTGGTCCTG GCCACCGGCG CGGGCTTCGC CGACGCCCTG GCGGCCGGCC CGTACGCCTC CACGGTGTTC GCCGACAACG GCAACCCGGC GGCGATCCTG CTCACCAACG ACAAGGTCAT GAACCCGGCC GTGGCGGCGT ACGCGCACGG CGCCAAGGCC GTCTCGGCCG TCGGCGCCCA GGCCGTCACC GCGGCGAAGA ACGCGCACAT CGCCAACCTG ACCAGCTTCG CCGGCTTCGA CCGCTACGAC ACCGCCGCGC AGGTCGCCAA CACCTTCCAC GGCGAGCACA TCGCCGGCGT GGCGACCGGC CTGAACTTCG CCGACGCCCT CACCGGCGCC GCGCAGCTCG GCGAGGCCGG CGGTCCCCTG GTCCTGACCA ACGTCACCAA CCTCCCCGCC TTCTCCGCCA ACGCCCTCCA CGGCATCGGC GCCTCCCTCG GCGGCGCGGG CCTGGTCGAG ATCTTCGGCG GCCCGGTGGC CATCAACCAG GCCACCGAGT ACGCCATCGC GGCCGCGGCC AACGCGATCG CGGAGGGGTG A
|
Protein sequence | MSTTAKKRSK ILAAGSIPMA TAVIAAVGMS SAQAQSNDTN NNGHTDSLGR VAVAGTQPAW ATPATEKAAV PATQTISTRV YLAGQNQAQM SALAQSVTDP ASPNYHKYLT PAQVQSEFGS TASQVKTVTD WAKGAGLQVK SVGESWVDLE GPASAVQTAF GTTLANFTAP DGQSHYAPAT AAMVPGNVAS AIVGVSGLYD APKIETPGAV TQTPSTTPAA GHPTSTTYDT SWCSTSWGSK SYYNIPAPIC GYDSAQLRGA YGVGSSGLTG KGATVAIIDA YDSPTMAADA AKYNSQFGQQ QFRAGQYREV TNTAAYNNQA ACDPDKWGLE QSLDVEAVHN MAPDANVVYV GAASCEDNDL MGAYEYVVNN HAADIVSVSL GGLMHTTGWN QDAATTAAFD RIFMKGALEG IGFNFSTGDC GDDNPANANT GFNCSPDSAG KQTEYPASSA WVTAVGGTAL KVGTGNSYQG EEPWGDYQTP KANVPGNFAN GSFTGGAGGG TSTDITQPFY QRSAVPAALS QTAPNGAHLK TPMRTVPDVA MDADPYTGMA TFRTAGGKPD WNPMGGTSLA APLFAGEAAL QMQAHGGVAP GFENPTIYAN ANKFRDVTAN GGMYTLIPEG WSGSTLTQAQ EEIMGDDSSL KAGPGYDEAT GLGSPTLGYL QAPYDANRVG RLAGGDRYQT GIQISQQQFP AAGTANNVVL AIGTNFPDAL AGAPLAKKVG GPLLLTPGNT VDAQVVNEIH RVLKPGGKVY VLGGTAAITP AVVNGLKLPA AQITRIGGVD RYDTAMQIAK AMGNPSHVVL ATGAGFADAL AAGPYASTVF ADNGNPAAIL LTNDKVMNPA VAAYAHGAKA VSAVGAQAVT AAKNAHIANL TSFAGFDRYD TAAQVANTFH GEHIAGVATG LNFADALTGA AQLGEAGGPL VLTNVTNLPA FSANALHGIG ASLGGAGLVE IFGGPVAINQ ATEYAIAAAA NAIAEG
|
| |