Gene Pnap_3107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_3107 
Symbol 
ID4687541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp3290336 
End bp3291850 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content68% 
IMG OID639836120 
Productprotease Do 
Protein accessionYP_983327 
Protein GI121605998 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.202807 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCCA CTGCTTTGAG CAATACCCGG CTGATCGCGG CGCTGCTGGC CGCAGGCGCC 
ATTGGCGGGG CCGGAGTCGG CGCATTCAAC ATGGCGCACA CACCGGCCAT CGCAGCCCTG
TCAGCCCCGG CCGTCAGCAC GGCCGCCACG CCCATGGCCC TGCCCGACTT CTCGCAGATC
ACCGAACGCT ACGGCCCGTC GGTGGTCAAC ATCAGCGTGA CCGGCAGCAC CAAGGTTTCC
AACGACTCCC CGCTGGCGCA GGGCGGCGGT GACGATGAAG AAGGCGACCC GCTGCTGGGC
AGCCCGCTGG GCGAATTGTT CCGCCGCTTC CAGCAGGGCC AGGGCCAGCG TCCCGGCGCG
GGCCGGGGCG GAGCGCCCGA GGAAATGCCG ACGCGCGGCC AGGGTTCGGG CTTCATCGTC
AGCGGCGACG GCATCATCCT GACCAATGCC CATGTGGTGC GCGGCGCCAA GGAAGTCACC
GTCAAGCTGA CCGACCGGCG TGAATTCCGC GCCAAGGTGC TGGGCGCCGA CGCCAGGACC
GACATCGCCG TGCTGAAGAT CGCCGCCAGC AACCTGCCGG TCGCCACGCT GGGCAAGACC
AGCGACCTCA AGGTCGGCGA ATGGGTGCTG GCCATCGGCT CACCCTTCGG TTTCGAAAAC
ACGGTGACCG CCGGCGTCGT CAGCGCCAAG GGCCGCTCGC TGCCCGACGA CAGCGCCGTG
CCCTTCATCC AGACCGATGT CGCCATCAAC CCCGGCAACT CGGGCGGCCC GCTGTTCAAC
GCGCGCGGCG AAGTGGTCGG CATCAACTCG CAAATCTACA GCCGCAGCGG CGGCTACCAG
GGCGTGTCGT TCGCCATCCC GATTGACGTG GCGACCAAGA TCAAGAACCA GATCGTCGCC
ACCGGCAAGG TCGAGCATGC GCGGCTGGGC GTGTCGGTGC AGGAAGTCAA CCAGGCGTTC
GCCGATTCCT TCAAGCTCGA CAAGCCCGAA GGCGCGCTGG TGTCAATGGT TGAAAAAGGC
AGCCCGGCCG ACAAGGCGGG CCTGCAGCCG GGCGACGTGA TCCGCCAAGT CAACGGCCAG
CCGATTGTCT CGTCGGGCGA CCTGCCGGCG GTGATTGGCC TGGCGGCGCC GGGCGACAGC
ATCAAGCTCG ATGTCTGGCG CCAGGGCGCG GCCAAGGAAA TCACGGCGCG GCTGGCCAAT
GCCGACGACA AGGCCGCGCA GGTCGCCAGC AAGAAAGAGG CGCCCGGCCA GGGCAAGCTG
GGCCTCGCGC TGCGTCCGCT GCAGCCCGAC GAACAGCAGG AAGCTGGCAT TGACAGCGGC
CTGCTGGTCC AGCAGGCCAG CGGCCCGGCG GCGCTGGCCG GCGTGCAGGC GGGCGACGTG
CTGCTGTCGA TCAACGGCGT TCCCGTGAAA AGCATCGACC AGGTGCGCGC CACCGTGGCC
AAGTCGCAAA AATCAGTGGC GCTGCTGATT TTGCGCGGCG ACACCCGGAT TTTCGTGCCG
GTGAATTTGG GCTGA
 
Protein sequence
MKSTALSNTR LIAALLAAGA IGGAGVGAFN MAHTPAIAAL SAPAVSTAAT PMALPDFSQI 
TERYGPSVVN ISVTGSTKVS NDSPLAQGGG DDEEGDPLLG SPLGELFRRF QQGQGQRPGA
GRGGAPEEMP TRGQGSGFIV SGDGIILTNA HVVRGAKEVT VKLTDRREFR AKVLGADART
DIAVLKIAAS NLPVATLGKT SDLKVGEWVL AIGSPFGFEN TVTAGVVSAK GRSLPDDSAV
PFIQTDVAIN PGNSGGPLFN ARGEVVGINS QIYSRSGGYQ GVSFAIPIDV ATKIKNQIVA
TGKVEHARLG VSVQEVNQAF ADSFKLDKPE GALVSMVEKG SPADKAGLQP GDVIRQVNGQ
PIVSSGDLPA VIGLAAPGDS IKLDVWRQGA AKEITARLAN ADDKAAQVAS KKEAPGQGKL
GLALRPLQPD EQQEAGIDSG LLVQQASGPA ALAGVQAGDV LLSINGVPVK SIDQVRATVA
KSQKSVALLI LRGDTRIFVP VNLG