Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bpro_3676 |
Symbol | |
ID | 4013665 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polaromonas sp. JS666 |
Kingdom | Bacteria |
Replicon accession | NC_007948 |
Strand | + |
Start bp | 3887277 |
End bp | 3888788 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637943331 |
Product | peptidase S1C, Do |
Protein accession | YP_550475 |
Protein GI | 91789523 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.769411 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATCTG CAGCATTGAA AAATACCCGA CTGGTCGTTG CCCTGCTGGC CGCTGGCGCC ATGGGTGGCG CCAGCGTCAG CGCACTGAAC GCCCTGCATG GCAGCGCCGT TGCCGCCCCT ACATCCGCAG CCGTTGCCAC TACCGGCAGC GTGGCCAGCA CGCCCGTGGC GCTACCCGAC TTTTCCCGCA TCACCGAGCG CCACGGCCCG GCAGTCGTCA ACATCAGCGT CACCGGCACC ACCAAGGTGT CCAGCGGATC ACCGCTGGCT CAGGGCGATG GCGATGATGA CGACGATGCT CTGGCCGGTG ATCCATTTTT CGAGTTCTTC CGCCGCTTCC AGCAAGGGCA GGGCCGACAG GGCCGCGGCG GCCAGCAGGA GGTCCCCACC CGGGGCCAGG GCTCCGGCTT CATCGTGAGC AGCGACGGCA TCATCCTGAC CAATGCACAC GTGGTGCGCG ATGCCCGCGA AGTCACGGTC AAGCTGACCG ACCGGCGCGA ATTCCGCGCC AAGGTGCTGG GCGCTGATCC GAGGACCGAC GTCGCCGTGC TGCGGATTGC GGCCAGCAAC CTGCCGGTCG TGACCCTGGG CAAAACCAGC GAACTGAAGG TCGGCGAGTG GGTGCTGGCG ATTGGCTCGC CCTTTGGTTT CGAAAACACC GTGACGGCCG GCGTCGTCAG CGCCAAGGGC CGGTCCCTGC CCGACGACAG CACCGTGCCT TTCATCCAGA CCGACGTCGC CATCAACCCC GGCAACTCGG GCGGCCCGCT GTTTAATGCC CGCGGCGAGG TGGTCGGCAT CAATTCGCAG ATCTACTCCC GCAGCGGCGG CTATCAAGGT GTGTCGTTTG CGATTCCGAT TGATATAGCG GCCAGAATCC AGAAGCAGAT CGTGGCAAAC GGCAAGGTGG AGCATGCGCG TCTAGGCGTC GCGGTGCAGG AAGTGAATCA GACCTTTGCC GACTCGTTCA AACTCGACAA ACCGGAAGGC GCCCTGGTGT CCACGGTTGA AAAAGGCAGC CCGGCCGAGA AGGCGGGCCT GCAGTCGGGC GACGTGGTTC GCAAGGTCAA TGGCCAACCC ATCGTCTCCT CGGGCGACCT GGCGGCCCTC ATTGGCCTGG CCGCCCCTGG CGACACGGTC AAGCTGGACG TCTGGCGTCA AGGTTCGGCC AAGGAAATCA CCGCACGCCT TGCCAGTGCA GACGAGAAGT CGGCCCAGGC GGCCGGCAAG AAAGACTCGC CCAGCCAGGG CAAGCTGGGC CTGGCCCTGC GCCCGCTTCA GCCCGACGAA AGGCAGGAGG CGGGCCTGGA CAGCGGCCTG GTGGTGCAGC AAGCCAGTGG TCCGGCGGCG CTGGCCGGCG TGCAGGCTGG CGACGTACTG ATCGCCATCA ATGGCACACC GGTCAGGAAT GTCGAGCAGG TGCGCAGCGT GGTCGCCAAA GCGGACAAAT CGGTGGCGCT GCTCATCCAG CGTGGCGACA GCAAGATTTT TGTGCCGGTG AACCTGGGCT GA
|
Protein sequence | MTSAALKNTR LVVALLAAGA MGGASVSALN ALHGSAVAAP TSAAVATTGS VASTPVALPD FSRITERHGP AVVNISVTGT TKVSSGSPLA QGDGDDDDDA LAGDPFFEFF RRFQQGQGRQ GRGGQQEVPT RGQGSGFIVS SDGIILTNAH VVRDAREVTV KLTDRREFRA KVLGADPRTD VAVLRIAASN LPVVTLGKTS ELKVGEWVLA IGSPFGFENT VTAGVVSAKG RSLPDDSTVP FIQTDVAINP GNSGGPLFNA RGEVVGINSQ IYSRSGGYQG VSFAIPIDIA ARIQKQIVAN GKVEHARLGV AVQEVNQTFA DSFKLDKPEG ALVSTVEKGS PAEKAGLQSG DVVRKVNGQP IVSSGDLAAL IGLAAPGDTV KLDVWRQGSA KEITARLASA DEKSAQAAGK KDSPSQGKLG LALRPLQPDE RQEAGLDSGL VVQQASGPAA LAGVQAGDVL IAINGTPVRN VEQVRSVVAK ADKSVALLIQ RGDSKIFVPV NLG
|
| |