Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_7403 |
Symbol | |
ID | 8338773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 8587360 |
End bp | 8589216 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 644960483 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_003118070 |
Protein GI | 256396506 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGAAC CCTCGAGCAA CGCAGGCGGC GTCGGCGGCC TGGGCGACAA CGGCGCCAAC GAAGCGGCGC ACCAGGACGT GAGCGACAGC TTCGGCGCGA ACCCCGCCGC GTCCGGCGGC TCGGGGGCTC AGGACCCCCA GGCGGTGCCG CCGGTCCAGC AGGCCCAGCA CGACGCAGTC CCCCCGACCG CCGCGCCTCA GCCCGCGCCC CAGCCGGTGT CGCAGCCCGC CCCGCAGGCG CCCGCCGACC AGAACCCGAC CCTGATCCAG CCGACTGCCA GCGCCCCGGA CCAGACGCTG ATCCAGCCCG CGGTCCCGGC CACGCCGGAC CCGGCCGCCG GGGCATACCA GGCGCAGTCC GGTCCGGCCG AACCCGGCGC GGGCCAGCAG CTGCCCCCGA CGCAGCCCGG CGCCGCCTTC GGCCCGATGG GCCAGGCGCA CGAGGCGCAG CAGGCCCAGC AGGCCGGCGC GGGCAACCAC CTGGGCCCGG GCCCCACTCC CCCGCCGAAC TGGGCCGCCC CGTCCGACGG CACGGCCTTC CCGCCCTACA CGCCCCCCGG CGCGGTCGCC ACCACCTCGC AGCGCTCCTC GCTCGGCGGC GGCAAGATGC TCCTCGCCGT GGCCCTGATC GCCGGTCTGA TCGGCGGCGG CATCGGCACC GCGGTCACCT ACGCGGCCAA GGACAACAGC AGCTCCTCCT CGACGGCCGC GTCCAGCACG CGCTCGCCGC TGAACACCAA CAACACCGCG CTGAGCACGC CCGGCAGCGT CACCCAGGTC TCCTCCTCCG TCATGCCGAG CGTGGTGGAC ATCCAGGTCA CCACCGCCAA CGGCAGCGGG GACGAGGGCA CCGGCATCAT CTACAGCTCC GACGGCCTGA TCGTCACCAA CAACCACGTG GTCGCCGCGG CCAATGCTTC CAGCCAGAGC AACTCCAACG GCAACTCCAA CGGGAACGGC AACACCAACC CGTTCGGTGG GAGCAGTGGC GGGAGCAACG GCGGAAGCAA TGGCGGGAGC AACGGCGGGA GCAACGGAAA CAGCAACTCC ACCAGCGGCC CGGCGACGAT CACCGTCACC TTCAACGACG GCCGCACCGC CAGCGCGCAC ATCGTCGGCA CCGAGCCGCT GGCCGACCTC GCGGTGATCA AGGTCGACGG CGTGACCGGG CTGACGAAGG CCTCCTTCGC CGACTCCAAG AACCTGGCGG TCGGCCAGCA GGTGGTGGCC ATCGGCTCGC CGCTCGGCCT GACCAGCACC GTCACCTCCG GCATCGTCAG CGCCCTGAAC CGGCCGGTGG AGACCCAGGC CGAGGACGGC AGCACCGTGG TCCTGGACGC CGTCCAGACC GACGCCGCGA TCAACCCCGG CAACTCCGGC GGCCCGCTGG TGGACATGCA GGGCAACGTG ATCGGCATCA ACTCCGCGAT CGCCTCCAAC AGCCAGAACT CCGGCGGTCT GGGCGGCAGC AGCGGACAGG CCGGCTCCAT CGGCCTGGGC TTCGCGATCC CGATCTCCGA GGCGCTGCCC ATCGTGGACG CCCTGGCCCA GGGCAAGCCC GCGCAGATCG CCTCGCTGGG CGTCGCGCAG TCCGGCCAGA GCGACACCAC CACCCGCACC GCGAGCGGCT ACAAGGTGGA CCAGGTGTCC GCCGGCGGCC CGGCCGACAA GGCGGGCCTG AAGTCCGGCG ACGTGATAAC CAAGATCGGC GACCGGCTCG TGTACTCCTA CCAGGACGTG GCAGCCGCGG TGCGCTCGCA CCGGCCCGGC GACGTCATCC CGATCACCTA CACCCGCGGC GGCTCCTCCG CGAAGGCCAA CGTCACCCTC GGAGTGCTGC CCGCGCAGAC CCCGTAA
|
Protein sequence | MNEPSSNAGG VGGLGDNGAN EAAHQDVSDS FGANPAASGG SGAQDPQAVP PVQQAQHDAV PPTAAPQPAP QPVSQPAPQA PADQNPTLIQ PTASAPDQTL IQPAVPATPD PAAGAYQAQS GPAEPGAGQQ LPPTQPGAAF GPMGQAHEAQ QAQQAGAGNH LGPGPTPPPN WAAPSDGTAF PPYTPPGAVA TTSQRSSLGG GKMLLAVALI AGLIGGGIGT AVTYAAKDNS SSSSTAASST RSPLNTNNTA LSTPGSVTQV SSSVMPSVVD IQVTTANGSG DEGTGIIYSS DGLIVTNNHV VAAANASSQS NSNGNSNGNG NTNPFGGSSG GSNGGSNGGS NGGSNGNSNS TSGPATITVT FNDGRTASAH IVGTEPLADL AVIKVDGVTG LTKASFADSK NLAVGQQVVA IGSPLGLTST VTSGIVSALN RPVETQAEDG STVVLDAVQT DAAINPGNSG GPLVDMQGNV IGINSAIASN SQNSGGLGGS SGQAGSIGLG FAIPISEALP IVDALAQGKP AQIASLGVAQ SGQSDTTTRT ASGYKVDQVS AGGPADKAGL KSGDVITKIG DRLVYSYQDV AAAVRSHRPG DVIPITYTRG GSSAKANVTL GVLPAQTP
|
| |