Gene Pnap_3919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_3919 
Symbol 
ID4689646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp4181999 
End bp4183648 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content70% 
IMG OID639836937 
Productpeptidase M48, Ste24p 
Protein accessionYP_984136 
Protein GI121606807 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.749041 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000945115 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGACTTCTT CGAACCGCTC TGACAACACC CCGCCACAGC CGCGCCATGC CTGGCGCCGG 
CTGGTGCTGG TGCTGGCGCT GGCGCTGTGC CAGCTTGCGC CGCCCCTCTC GCTGGCCCAG
ACGGCGGGCG CCGCGCCTGG CCGGGCAACC GGCTCGCTGC CGTCGCTGGG CGACAACTCC
GAGCTGTCCG CCGCCGCCGA GCGCCGCATC GGCGACCGCA TTGCCGTCAG CATTTACCGC
GACCCCGACT ATGTCGATGA CCCGGTGCTG GTGGACTACC TGCAGGGCAT CTGGCAGCCG
CTGATGGCTG CCGCGCGCGC GCGGGGCGAG CTGCCGGCGG AACTCGACGA GCGCTTTGCC
TGGGAACTGT TCCTGATCCG CGACCGCAGC ATCAACGCAT TTGCCCTGCC CGGCGGCTAT
TTTGGCGTGC ACCTGGGCCT GATCGGCACC GTGGGCAGCG CCGATGAACT GGCCGCCGTG
CTGGCCCATG AAATGAGCCA TGTCACGCAG CGGCATATTT CGCGGCTGAT GACGCAGCAG
TCCCGGCAGG CGCCGTGGAT GATTGCGGCA ATGATTTTGG GGGTGCTGGC CGCCAACAAG
AACCCCAACG CGGGCAGCGC GGCCATCGTC GGCGGGCAGG CACTGGCGGC GCAGGGGCAG
CTGAATTTTT CGCGCGACAT GGAGCGCGAG GCCGACCGCG TCGGCTTTGG CGTGATGGAG
GGCGCCGGCT ATGCCAGCCG GGGCGTCTCG GGCATGTTCG AAAAGCTGCA GCAGGCCAAC
CGGCTCAATG ACAACGGCTC GTTTCCCTAC CTGCGCTCGC ACCCGCTGAC CACCGAGCGC
ATCGCCGAGG CGCAGGCCCG CGTGCAGCTG GCGTCGGCGG CAGCCAAGCC GTCACCCGAG
AAGCTGGCCG CCGAGAGCCG CACGCAGCTG CTGCACGCGA TGATGGCCCC CCGCGCCCGC
ATCCTGGCCG TTCCCGGCGT GGATGCGCTG CGCACCATGC TGGCCGAGGG CCAGCGCAGG
GCGGCTGCCT TGCCCGCGCC GCAGCCGGCG GCGCCAGCGG TGTCGGCGGC CACGGTCCGC
GATGCTGGCG CGCTGTATGG CGGCGCCTTC GCTGCCGCCC AGCTGCGTGA TTTTTCGGCG
GCGCGCAACC TGCTCGGCCG GCTCAAGCCG CTCACCGCCG ACATCGGGCC TGCGGCCAAG
GCCGCCGAAC TGCTGGCCAT CGAGGTCGAT TTGCTGGAGG GCAAAATACC GGCGTCCGCA
GCCTCGGCCG ACATCGGCAA GGCCGGCTCG CGGGCCGAGT TGCTGCTGCA GGCGAAGGCG
CTGATGGCGG CCAGCCGCGC GCCGGACGTG TCGCAGGCGC TGCAGACCTG GGTGGCCGTG
CATCCCCGCG ATGCGATGGC CTGGCAGTTG CTGGCCGTTG CCTGCGGCCA GCAAAACCAG
CCGGTGCGGG CGATCCGGGC CGATGCCGAA AGCCGCGCCG CCCAGCTCGA CTACGCCGCT
GCGCTGGACC GCTTCAAGGC GGCGCAGGGC TTGATGCGCA GCAGCCCGGC CAGCGCCGAT
TATGTGGAAG GCTCGATCAT CGACACCCGC ACCCGGCAGG TCGAGTCAAT CCTTAAAGAG
CAGGCGCTCG AGGACAAAGT CAATCGCTAG
 
Protein sequence
MTSSNRSDNT PPQPRHAWRR LVLVLALALC QLAPPLSLAQ TAGAAPGRAT GSLPSLGDNS 
ELSAAAERRI GDRIAVSIYR DPDYVDDPVL VDYLQGIWQP LMAAARARGE LPAELDERFA
WELFLIRDRS INAFALPGGY FGVHLGLIGT VGSADELAAV LAHEMSHVTQ RHISRLMTQQ
SRQAPWMIAA MILGVLAANK NPNAGSAAIV GGQALAAQGQ LNFSRDMERE ADRVGFGVME
GAGYASRGVS GMFEKLQQAN RLNDNGSFPY LRSHPLTTER IAEAQARVQL ASAAAKPSPE
KLAAESRTQL LHAMMAPRAR ILAVPGVDAL RTMLAEGQRR AAALPAPQPA APAVSAATVR
DAGALYGGAF AAAQLRDFSA ARNLLGRLKP LTADIGPAAK AAELLAIEVD LLEGKIPASA
ASADIGKAGS RAELLLQAKA LMAASRAPDV SQALQTWVAV HPRDAMAWQL LAVACGQQNQ
PVRAIRADAE SRAAQLDYAA ALDRFKAAQG LMRSSPASAD YVEGSIIDTR TRQVESILKE
QALEDKVNR