Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_6409 |
Symbol | |
ID | 8337772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 7390062 |
End bp | 7393145 |
Gene Length | 3084 bp |
Protein Length | 1027 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 644959510 |
Product | Peptidase S53 propeptide |
Protein accession | YP_003117104 |
Protein GI | 256395540 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4934] Predicted protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.276013 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACGTTC GCCGACCCCG GTGGCGTCGC GCGGCCGGGA TCGCCGCCGC GATCGCCGTG TCCGCGACGG CGGTCCCGGC CGCCCAGGCA GCAGCCGGAT CGGCGCCGCC GACGGCACCG GTCAGGATCG GCGCCGCGCC CCGTCTGCCG GCCCGCACCG TCCCCGCTGC GACGCCCGCC GACTCCCAGC CGCTGCACCT GCGGATCGGG CTGGCGCCGC GCGATCCGGC CGGACTGGCC GCTTTCGTGG CCGCGGTGAG CGACCCGAAG TCGCGCCAGT ATAGGCACTA CCTCGCGCCC GGCGAGTTCG GCAGCCGCTT CGGCGCGACA CCGGCCGCCC TCGCCGCGGT CGGCGCCGAG CTGCGCTCGA TCGGGCTCTC CCCCGGCGCG ACCGACGGCA GCGGCGAATC CATCGCCGTG GACACCACCG TCGGCGCCGC GAAATCCATC CTGCACACCG GATTCTCCGG CTACACCACC GCGGACGGCC GCCACGCCTT CGCGAACACC GCCGCCCCGG CGCTGCCGCC GGCCCTCGCT TCGGCGATCA CCGGCATCAC CGGCCTGGAC GATCTGACGG CGCCGCGTCC GCAGTTCGTC AGCTCGAACC TCGGGAGCCA AGCGGCGCTC ACCCCGCAGG CGCCCTCACC GGATTTCGCG GCACCCGGGC TGTGTCCTCA GGAAACCAGC ATCCTGGCAG GCCAGGGCAA GACCGACGGC CAGCAGTACT ACGACCCCGC AGACCTCAAC GCCATCTACG GCAACCGCGA CCAGTACATC GTCGGCGACT ACGGCGAGGG CGTCACCGTC GCGCTGCCGG AGTTCGAGGG CTTCACCCCG GCGGCGATCG CGCAGTACCA GAACTGCGTC ACCCCGCAGA ACACCAACGC CGGCACGCCC GGCGGGCTGA CGACGATCGC CGTGGACGGC GGGCCCACCA CGGCGGCGCC GACCGCGGCG AACCAGGTCG GCATGGAGAC CGCGCTGGAC GTCGAGACGA TCAACGGCAC CGCGCCCGGC GCGGCCGTCC GGGATTATGA GGGCCCTGAT ACGGCCGTCG GCGCCCTGGA CACCTACCAG CGGATCGTCA CCGACGACCA GGCCCAGGTC ATCGCGATCA GCTGGAGCGT CTGCGAGCTC GACGCCGCCC CGGCGACCCT CGCCGCCGAG AACACGATCT TCCAGCAGGC CGCCGCCCAG GGACAGTCGG TGCTGGCCGC GACCGGGGAC AACGGCTCGA CGGCGTGCCC GGCGAGCAGT CCGCACGCCG CCACCCCGGC GGTGTCCGAC CCCGCCTCGC AGCCCTCGGT CACCGCGGTC GGCGGCACCA CGATGAGCGG TGGCGGCGGC GCCGGGCTGA CCACCTGGAA CACTTCCGGC GGCGCGACGG GCGGCGGCGT GTCCACCGTG TGGAAGCAGG ATGCGATCGG CGCCGCGTAT CAGGCCGGCC ACACCGGGCC CGGGTACGCC GACGCCTGCG GCGCGCCCGC CGGGACGACC TGCCGCCAGG TCCCCGACGT CGCCGCGCAC GCCGGCGCCG ACGGGCTGAT CGTCGAGTAC TACGCGACCG GCACGGCCGG GAGCTGGGCC ATCGTCGGCG GCACGAGCCT GGCGACCCAG CTGTGGGCCG GTATCACAGC GCTCGCCGAC AGCAGCGACG ACTGCGCGGC CACCGGCCCG ATCGGCCCGC TGAACCCCGC GCTCTACCAC GCCGCGGCCG ACGGCTCCGG ATCGTTCACC GACGTCACCA CCGGCAGCAA CGCCAGACCC GCATCCGGCT ACACCGGCGC GCTTTACACC GCCGGTCCCG GCTATGACCT CACCACCGGA CTCGGCGCCC CGCTGACCGA CGGCCTCGTG CCGGACCTGT GCGGCTACCC CGCCCCGGCT GCCGGCAGCA CGTATCACCC GGTGAGCCCG GCACGGATCC TGGACACCCG CAACGGCACC GGAGCCGGCG GCAGGATCGC GCAGGTCCCG GCCGGCGGAA CCGTCACGCT GGCCGTCACC GGCGCGCACG GCGTGCCGGC GAGCGGGGTC ACCGGCACGG CGCTGAACCT CACCGCGGTC AGCAGCTTGG GCGGGTACCT TTCGGTCTCC GCGCACGGCA CCGCGCGTCC GCTGTCCTCC ACCGTCAACT ACGGGGCGAA CCGGCCGGCC TCCAACTTCG TCACCGTCGC GGTCCCCGCC GGCGGCCAGA TCGACATCAC CAACCACGGG ACCGCCGGCG CCGACGTCAT CGCCGACCTG TCGGGGTACT ACTCCGCGAC CCTGGCCGGC GGTTCCACCT ACACCGCGGT CAACCCGTTC CGGATCCTCG ACACGCGCGT GCCGAACGGT GTCCCGGCCA AGGCTCCGGT GGCGGCACAC GGGACGCTGG CGTTGCAGAT CAGCGGCACG CACGGCATAC CGGCGACCGG CGTCACCGCG GTGGCGCTGA ACATCCAGGT CGCCGACGAC GCCTCGGGCG GCGACCTGAT CGCCTACGCC GACGGGACGA CGCAGCCGTT GGCCTCGAAC GACAACTGGG CCGCGGGTCA GACCGTGTCG GACTTCGCCC TTGTACCGGT CGGCGCCGAC GGCAAGATCG ACCTCTACAA CAGCAGCCCC GGCTCGGCCA ACGTCATCGC CGACTTCGCG GGCTACTCCA CGACCGCGGC CACCGGACTG AAGTTCCATC CCATCCGGGC GGCCCGGCTG CTCGACACCC GTGACGGCAC CGGCGTCAAC AGCACCGCCG GGCAGCCCTA CCAGATCCCG GCGGGCGGCA GCTTCACCGT GAACCTGGAC CCGGTCGCGC TTCTGCTGGG CGGCAATCCG CACAACGCCG TCGCGACCGC TCCGGCCGTC GCCCTGAACC TCACCGTCGC CTCACCGAGC AGCGGCGGCT ACATCACGGT CTATCCCGAC GGCCAGAGCC TGCCCGCCGT GGTCCCGGCC GCCGTCGACT TCACCGCCGG CCAGACCACC GCCGACGCCG CGATACTCCC GGTCGGCGCC GACGGCGGAC TGACTGTCAC CAACCACAGC GGCGGAACGA TCCAATTGGT CATCGATCTC ATCGGCTACT ACGGCACGAC CTGA
|
Protein sequence | MHVRRPRWRR AAGIAAAIAV SATAVPAAQA AAGSAPPTAP VRIGAAPRLP ARTVPAATPA DSQPLHLRIG LAPRDPAGLA AFVAAVSDPK SRQYRHYLAP GEFGSRFGAT PAALAAVGAE LRSIGLSPGA TDGSGESIAV DTTVGAAKSI LHTGFSGYTT ADGRHAFANT AAPALPPALA SAITGITGLD DLTAPRPQFV SSNLGSQAAL TPQAPSPDFA APGLCPQETS ILAGQGKTDG QQYYDPADLN AIYGNRDQYI VGDYGEGVTV ALPEFEGFTP AAIAQYQNCV TPQNTNAGTP GGLTTIAVDG GPTTAAPTAA NQVGMETALD VETINGTAPG AAVRDYEGPD TAVGALDTYQ RIVTDDQAQV IAISWSVCEL DAAPATLAAE NTIFQQAAAQ GQSVLAATGD NGSTACPASS PHAATPAVSD PASQPSVTAV GGTTMSGGGG AGLTTWNTSG GATGGGVSTV WKQDAIGAAY QAGHTGPGYA DACGAPAGTT CRQVPDVAAH AGADGLIVEY YATGTAGSWA IVGGTSLATQ LWAGITALAD SSDDCAATGP IGPLNPALYH AAADGSGSFT DVTTGSNARP ASGYTGALYT AGPGYDLTTG LGAPLTDGLV PDLCGYPAPA AGSTYHPVSP ARILDTRNGT GAGGRIAQVP AGGTVTLAVT GAHGVPASGV TGTALNLTAV SSLGGYLSVS AHGTARPLSS TVNYGANRPA SNFVTVAVPA GGQIDITNHG TAGADVIADL SGYYSATLAG GSTYTAVNPF RILDTRVPNG VPAKAPVAAH GTLALQISGT HGIPATGVTA VALNIQVADD ASGGDLIAYA DGTTQPLASN DNWAAGQTVS DFALVPVGAD GKIDLYNSSP GSANVIADFA GYSTTAATGL KFHPIRAARL LDTRDGTGVN STAGQPYQIP AGGSFTVNLD PVALLLGGNP HNAVATAPAV ALNLTVASPS SGGYITVYPD GQSLPAVVPA AVDFTAGQTT ADAAILPVGA DGGLTVTNHS GGTIQLVIDL IGYYGTT
|
| |