Gene Pnap_3065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_3065 
Symbol 
ID4686807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp3238292 
End bp3239767 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content63% 
IMG OID639836078 
Productprotease Do 
Protein accessionYP_983285 
Protein GI121605956 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAGAC TGATGAAGAT GCTTGAATTG AACCTGAAAC CCCTGCGCCC TTACCTGACC 
GCAGGCCTGA TGGCGATTGC GGCCACCACT GCCGTCCTGC CGGTGACGCC GGTGTGGGCG
CAGACCCGCA CGCTGCCTGA CTTTACCGAT CTGGTGGACC AGGTGGGGCC GTCGGTGGTC
AACATCCGGA CCCTGGAAAA AGTCAAGGCA TCGGCCGCAG GAAATATTGA CGAGCAGATG
CTGGAGTTTT TCAAGCGCTT CGGCATTCCC GTGCCGCCCA ACACACCGCG CGCGCCGCGC
CCTGATCCGA GCCAGCCGGA CGAAGACCAG CCGCGTGGCG TGGGCTCGGG ATTCATCCTC
ACGACGGACG GCTTTGTCAT GACCAATGCC CATGTGGTCG AAGGTGCGGA TGAAGTCCTC
GTCACCCTGA CGGACAAGCG CGAATTCAAG GCCAGGATCA TTGGCGCCGA CAAGCGCAGC
GATGTGGCTG TGGTCAAGAT CGAGGCAACC GGCCTGCCGG CCGTCAAGAT TGGCGACCTG
GGCCGTCTGC GGGTGGGCGA GTGGGTGATG GCCATCGGTT CGCCGTTCGG GCTTGAAAAC
ACGGTGACGG CCGGCATCGT GAGCGCCAAG CAGCGTGACA CCGGCGACTA TCTGCCTTTT
ATCCAGACCG ATGTGGCCAT CAATCCCGGC AACTCGGGCG GCCCGCTGAT CAACATGCGC
GGCGAGGTCG TCGGCATCAA CAGCCAGATC TATTCACGTT CCGGCGGCTT CCAGGGTATT
TCGTTCTCCA TCCCGATTGA CGAGGCGATG CGCGTGTCGG AACAACTGCG CATCAGCGGC
AAGGTGACGC GCGGTCGCAT CGGCGTGCAG ATTGACCAGG TGACCAAGGA CGTGGCCGAA
TCCATCGGCC TGGGCAAGGC GCAGGGCGCG CTCGTCAGGG GCGTGGAGAG TGACGCCCCT
GCCGAGAAAG CCGGCATCGA AGCGGGCGAC ATCATCACCA AGTTTGAAGG CCGGCCGATT
GACAAGGCCA GCGACCTTCC GCGCATGGTC GGCAATGTCA AGCCGGGCAC CAAGGTGACA
GTGACCGTGT TCCGGCGCGG CGCCACCAAA GACCTGTCAG TCACCATTGC CGAAGTCGAG
GCCGACAAGC CTGCCCGCCC GGCTGCCAAG TCCGAATCCA AGCCGCCTGT GGCCGGTCCC
GCGCAGGCAT TGGGCCTGGC GGTGAGCGAG ATCACGGATG CACAGAAAAA GGAACTCAAT
GTCAAGGGCG GCGTCAAGGT CGATACGGTC GATGGCGCGG CCGCAAGAGC GGGACTGCGC
GAAGGCGATG TGATTGTGTC GATTGCCAAC ACGGAGGTGA CCGGCGTCAA GGGATTCGAG
GCGGCGCTGG CAAAAATTGA CAAGTCCAAA AACATCACCG TGCTGGTCCG GCGCGGTGAA
CTGGCGCAAT TTGTCATCAT CAAGCCGGCG CGTTGA
 
Protein sequence
MMRLMKMLEL NLKPLRPYLT AGLMAIAATT AVLPVTPVWA QTRTLPDFTD LVDQVGPSVV 
NIRTLEKVKA SAAGNIDEQM LEFFKRFGIP VPPNTPRAPR PDPSQPDEDQ PRGVGSGFIL
TTDGFVMTNA HVVEGADEVL VTLTDKREFK ARIIGADKRS DVAVVKIEAT GLPAVKIGDL
GRLRVGEWVM AIGSPFGLEN TVTAGIVSAK QRDTGDYLPF IQTDVAINPG NSGGPLINMR
GEVVGINSQI YSRSGGFQGI SFSIPIDEAM RVSEQLRISG KVTRGRIGVQ IDQVTKDVAE
SIGLGKAQGA LVRGVESDAP AEKAGIEAGD IITKFEGRPI DKASDLPRMV GNVKPGTKVT
VTVFRRGATK DLSVTIAEVE ADKPARPAAK SESKPPVAGP AQALGLAVSE ITDAQKKELN
VKGGVKVDTV DGAAARAGLR EGDVIVSIAN TEVTGVKGFE AALAKIDKSK NITVLVRRGE
LAQFVIIKPA R