Gene Bpro_3676 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_3676 
Symbol 
ID4013665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp3887277 
End bp3888788 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content66% 
IMG OID637943331 
Productpeptidase S1C, Do 
Protein accessionYP_550475 
Protein GI91789523 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.769411 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCTG CAGCATTGAA AAATACCCGA CTGGTCGTTG CCCTGCTGGC CGCTGGCGCC 
ATGGGTGGCG CCAGCGTCAG CGCACTGAAC GCCCTGCATG GCAGCGCCGT TGCCGCCCCT
ACATCCGCAG CCGTTGCCAC TACCGGCAGC GTGGCCAGCA CGCCCGTGGC GCTACCCGAC
TTTTCCCGCA TCACCGAGCG CCACGGCCCG GCAGTCGTCA ACATCAGCGT CACCGGCACC
ACCAAGGTGT CCAGCGGATC ACCGCTGGCT CAGGGCGATG GCGATGATGA CGACGATGCT
CTGGCCGGTG ATCCATTTTT CGAGTTCTTC CGCCGCTTCC AGCAAGGGCA GGGCCGACAG
GGCCGCGGCG GCCAGCAGGA GGTCCCCACC CGGGGCCAGG GCTCCGGCTT CATCGTGAGC
AGCGACGGCA TCATCCTGAC CAATGCACAC GTGGTGCGCG ATGCCCGCGA AGTCACGGTC
AAGCTGACCG ACCGGCGCGA ATTCCGCGCC AAGGTGCTGG GCGCTGATCC GAGGACCGAC
GTCGCCGTGC TGCGGATTGC GGCCAGCAAC CTGCCGGTCG TGACCCTGGG CAAAACCAGC
GAACTGAAGG TCGGCGAGTG GGTGCTGGCG ATTGGCTCGC CCTTTGGTTT CGAAAACACC
GTGACGGCCG GCGTCGTCAG CGCCAAGGGC CGGTCCCTGC CCGACGACAG CACCGTGCCT
TTCATCCAGA CCGACGTCGC CATCAACCCC GGCAACTCGG GCGGCCCGCT GTTTAATGCC
CGCGGCGAGG TGGTCGGCAT CAATTCGCAG ATCTACTCCC GCAGCGGCGG CTATCAAGGT
GTGTCGTTTG CGATTCCGAT TGATATAGCG GCCAGAATCC AGAAGCAGAT CGTGGCAAAC
GGCAAGGTGG AGCATGCGCG TCTAGGCGTC GCGGTGCAGG AAGTGAATCA GACCTTTGCC
GACTCGTTCA AACTCGACAA ACCGGAAGGC GCCCTGGTGT CCACGGTTGA AAAAGGCAGC
CCGGCCGAGA AGGCGGGCCT GCAGTCGGGC GACGTGGTTC GCAAGGTCAA TGGCCAACCC
ATCGTCTCCT CGGGCGACCT GGCGGCCCTC ATTGGCCTGG CCGCCCCTGG CGACACGGTC
AAGCTGGACG TCTGGCGTCA AGGTTCGGCC AAGGAAATCA CCGCACGCCT TGCCAGTGCA
GACGAGAAGT CGGCCCAGGC GGCCGGCAAG AAAGACTCGC CCAGCCAGGG CAAGCTGGGC
CTGGCCCTGC GCCCGCTTCA GCCCGACGAA AGGCAGGAGG CGGGCCTGGA CAGCGGCCTG
GTGGTGCAGC AAGCCAGTGG TCCGGCGGCG CTGGCCGGCG TGCAGGCTGG CGACGTACTG
ATCGCCATCA ATGGCACACC GGTCAGGAAT GTCGAGCAGG TGCGCAGCGT GGTCGCCAAA
GCGGACAAAT CGGTGGCGCT GCTCATCCAG CGTGGCGACA GCAAGATTTT TGTGCCGGTG
AACCTGGGCT GA
 
Protein sequence
MTSAALKNTR LVVALLAAGA MGGASVSALN ALHGSAVAAP TSAAVATTGS VASTPVALPD 
FSRITERHGP AVVNISVTGT TKVSSGSPLA QGDGDDDDDA LAGDPFFEFF RRFQQGQGRQ
GRGGQQEVPT RGQGSGFIVS SDGIILTNAH VVRDAREVTV KLTDRREFRA KVLGADPRTD
VAVLRIAASN LPVVTLGKTS ELKVGEWVLA IGSPFGFENT VTAGVVSAKG RSLPDDSTVP
FIQTDVAINP GNSGGPLFNA RGEVVGINSQ IYSRSGGYQG VSFAIPIDIA ARIQKQIVAN
GKVEHARLGV AVQEVNQTFA DSFKLDKPEG ALVSTVEKGS PAEKAGLQSG DVVRKVNGQP
IVSSGDLAAL IGLAAPGDTV KLDVWRQGSA KEITARLASA DEKSAQAAGK KDSPSQGKLG
LALRPLQPDE RQEAGLDSGL VVQQASGPAA LAGVQAGDVL IAINGTPVRN VEQVRSVVAK
ADKSVALLIQ RGDSKIFVPV NLG