Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5805 |
Symbol | |
ID | 8337166 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 6707573 |
End bp | 6710569 |
Gene Length | 2997 bp |
Protein Length | 998 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 644958909 |
Product | Peptidase S53 propeptide |
Protein accession | YP_003116504 |
Protein GI | 256394940 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4934] Predicted protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCAACG CCGCTCAGCG GCGGCGTGCG CAGGCCGTGA TGGCGGGGGT CTTCCCCGTC GCCATCGCGC TTGTCGCCGC GTCCGCGTCC GCCTCGCCCG CGCAGGCCGA CACGCAGCAC CACAGCACCA CGAACAAGAT CCTGACCGAC GCGCACCCGG CATGGGCGAC GGCCGACAAG GACCGGGGGG TCCTGCCGGC CGCGCAGCAG ATCAGCACCC GGGTGTACCT GACCGGTCAG GACCAGGCGG GGCTGGCCGC GCTGGCGCGC GCCGCGTCCG ACCCGAGCAG CCCGGACTAC CAGCACTACC TGACCCCGGC CCAGGTCCAG GCGCGCTTCG GTGCGACGCC GGCCCAGCTC GCCGCGGTGC AGAAGTGGCT GACCGGCGCC GGGCTGAAGG TCTCCGCGGT GCAGAGCGAC TGGATCGACG CGGTCGGTGA CTCCGCCGCC GTGCAGCGCG CCTTCGGCAC GCAGATCAAG GACTACCAGG GCACCGACGG CTCGGTGAAG TACGCCGCGT CCTCCTCCGC GGTCATCCCG GCCGCCGTCG CCGACTACGT CGCCGGCGTC TCGGGCCTGT CGCAGGCCGC CGTGCGCGTG CACGCCGATT CGGCGAAGGT CAACGCGGCC AACGCCGCGA ACCAGAGCTG CTCGCCGTCC TGGGGCGCGA ACACCTCCAC CGCGTGGCCG GCCGGCGTCA ACCCGGGCCC GACGCCCCTG CTGCCGTGCT CCTACACCCC GAAGCAGCTG CGCGACGCCT ACGGCGTGAC CAAGTCCGGC ATGACCGGCA AGGGCGCGAC CATCGCCGTC GTGGACTGGT TCGCCTCGGC CACCATGGAG GGCGACGCCA ACCAGTTCGC GGTCGCCCAC GGGGACAAGC CGTTCGCGCC AGGTCAGTAC TCTGAGATCA AGGACGCGTC TCAGTGGACC AACATCGACG CCTGCGGCGG CCAGGACAAC GTGGCCGGCG AGGAGTCGCT GGACGTGGAG ATGGCGCACG GCCTCGCGCC GGACGCCAAC GTCCTGTATG TCGGCGCGAA CTCCTGCACT GACGCGGACC TGATGTCCGC CGAGGAGAAC ATCGTCGATC ATCACCTCGC CGACGTGGTC TCCAACTCCT GGGGCGAGAT CATGCACACC ACCGACGGCG AGGACCTGGA CCCGTCGGAG ATAGCCGCCT ACGACCGCAT CTTCCAGAAG GGCGCGGCTG AGGGCATCGG CTTCGACTTC TCCTCCGGCG ACTGCGGCGA CGACGACCCG GCGAACTACG CCGGCGGCGG CGCCAACTGC GCCGGCGACT CGGCGCGCAA GCAGACCGAG TGGCCGACCA GCGACGCCTG GGTCACCTCG GTCGGCGGCA CCACGATGGC CACCAACGCG CAGGGCGGCT ACGCCTGGGA AGCCGCGATG GGCGACCATG TCGGCGTCGC CGCGCAGGGC GCCCCCAACT GGCAGCCGCC GGCCGGACGG GCGACCGTCC CGTTCTCGTT CTACTTCGGC GGCGGCGGCG GTACCTCCGA GGACATCGCG CAGCCCTTCT ACCAGGCCGG TATCGTGCCG AGCGCGCTGG CCAACGGCGG GCACGACAGC ACCCGCGCGA TGCGCACCGT CCCGGACGTG GCGATGAACG GCGCGCTGGC GACCTCGGTG CTGGTCGGCA TGACCAGCGG CGCCACCTAC AGTGAGGGCG GCTACGGCGG CACCTCGGTG GCCGCGCCGG AGTTCTCCGC GCTGCAGGCC GACGCCAAGC AGGCCGCGGG CCACGCGCTG GGCTTCGCGA ACCCGTCGCT GTACGCGCTG AACGGCGGCT CGGCGTTCCA CGACGTCACC GCGCACCCGG CGGGCCAGCC GCAGGTGATC GAGGGTATCC ACGTCTCGAC CGCCGACCCG ACCCGCGGCA CGATGTACCA CGCCGGCCAG GACACCTCGC TGGTCGCCGC CGCGGGTTAC GACGACGCCA CCGGCCTGGG CTCCCCGGCG GACGACTACC TGGCGAAGGT CGCCACGGTC ACGCCGCTGC AGCCGCCGGC GCCGCCGACC AACCCCGGCT CGCCGAACGC GCCGGTCGTC AAGCGGATCG CCGGCGGCGA CCGCTACGGC ACCGCGATCA GCGTCTCGCA GTCCTCCTTC CCGAAGGCGG GCTCGGCCTC GGCCGTGGTC CTGGCCACCG GCGAGACGTT CCCCGACGCG CTGTCCGGCG CGCCGCTGGC CACCAAGCTC GGCGGCCCGC TGCTGCTGAC CCCGTCGAAG ACCGTCGACC CGGCCGTGGT CGCCGAGATC CACCGGGTCC TGGCGCCCGG CGGCAAGGTC TACGTCCTGG GCGGCGTGAA CGCGGTCTCC GACAAGGTGG TCGCCGGTCT CGGCCTGCCC GGCGCGCAGG TCTCCCGGGT CTCCGGCTCG GACCGCTTCG CCACCTCGCT GGCGATCGCC GAGCAGCTGG GCAACCCGAC CGGCAACGTG ATCCTGGCGA CCGGCGACGA CTTCGCCGAC GCCCTGACCG CCGCGCCGTT CTCGGCGGTC TACGGCGGTC CCACCGGCGG CCCGGCGGCG ATCCTGCTGA CCGACAACCG CAAGCTGCCG CCGGCGGTGG CGAGCTACGT GGCCGGCGCG CACGCGGTGG CGGCGGTCGG CGTCCAGGCC ACCGTGGCCG ACGCCGGTCT GAAGAACCGC GACGCCAGCG CCCAGTTCGG CGGCACGGAC CGCTTCGCGA CCGGTGCGAT GGTGGCCGGG CGGTTCAGCG CGCCGAAGAC CGTCGGCGTG GCCACCGGCA CCCAGTTCGC CGACGCCCTG ACCGGCGCGG CGATGCTGGC CGCGGCGCAC AGCCCGCTGC TGCTCACCCA GCCGACGGCG CTGCCGGCCA GCACCGCGGC GGTGCTGCAC GGCTTCAGCC AGGCGCTGGC GGGCGGCTCG ATCGAGCTGT TCGGCGGCCG GGTGGCGGTC TCCGACGGCG TCGAGCAGCA GGTCGCCAAG GCGGTCGGCG GTCGCGTCGA GTCGTAA
|
Protein sequence | MPNAAQRRRA QAVMAGVFPV AIALVAASAS ASPAQADTQH HSTTNKILTD AHPAWATADK DRGVLPAAQQ ISTRVYLTGQ DQAGLAALAR AASDPSSPDY QHYLTPAQVQ ARFGATPAQL AAVQKWLTGA GLKVSAVQSD WIDAVGDSAA VQRAFGTQIK DYQGTDGSVK YAASSSAVIP AAVADYVAGV SGLSQAAVRV HADSAKVNAA NAANQSCSPS WGANTSTAWP AGVNPGPTPL LPCSYTPKQL RDAYGVTKSG MTGKGATIAV VDWFASATME GDANQFAVAH GDKPFAPGQY SEIKDASQWT NIDACGGQDN VAGEESLDVE MAHGLAPDAN VLYVGANSCT DADLMSAEEN IVDHHLADVV SNSWGEIMHT TDGEDLDPSE IAAYDRIFQK GAAEGIGFDF SSGDCGDDDP ANYAGGGANC AGDSARKQTE WPTSDAWVTS VGGTTMATNA QGGYAWEAAM GDHVGVAAQG APNWQPPAGR ATVPFSFYFG GGGGTSEDIA QPFYQAGIVP SALANGGHDS TRAMRTVPDV AMNGALATSV LVGMTSGATY SEGGYGGTSV AAPEFSALQA DAKQAAGHAL GFANPSLYAL NGGSAFHDVT AHPAGQPQVI EGIHVSTADP TRGTMYHAGQ DTSLVAAAGY DDATGLGSPA DDYLAKVATV TPLQPPAPPT NPGSPNAPVV KRIAGGDRYG TAISVSQSSF PKAGSASAVV LATGETFPDA LSGAPLATKL GGPLLLTPSK TVDPAVVAEI HRVLAPGGKV YVLGGVNAVS DKVVAGLGLP GAQVSRVSGS DRFATSLAIA EQLGNPTGNV ILATGDDFAD ALTAAPFSAV YGGPTGGPAA ILLTDNRKLP PAVASYVAGA HAVAAVGVQA TVADAGLKNR DASAQFGGTD RFATGAMVAG RFSAPKTVGV ATGTQFADAL TGAAMLAAAH SPLLLTQPTA LPASTAAVLH GFSQALAGGS IELFGGRVAV SDGVEQQVAK AVGGRVES
|
| |