Gene Pnap_3840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_3840 
Symbol 
ID4687841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp4096811 
End bp4097818 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content69% 
IMG OID639836858 
Productpeptidase M19, renal dipeptidase 
Protein accessionYP_984057 
Protein GI121606728 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATACCC GATCATTGCC GATTCACTCC GCCAAAGCCA TAAATTGGGA AGCCCACGCC 
TGCCTGCCGC TGCATCCCGA CGCCGACTTT TCGCCGCTGG ACCGGCTGCG CGATGCCGGC
GTGCATTACG TGTCGGTCAA TGTCGGCATG GACATGAACC CGGTGACGCA AATCCTGTCG
GTGCTGGCGG CTTATCGCGC CCGGATTGCG GCCCATCCCG AGCGTTTCCG GCTGGTGACC
AGCGTGGCGG AAATCGAACA GGCGGCGGCG AATGGCGCGC TGGCTGTCGG CTTCGACCTC
GAAGGCGCGC TGCCACTGCT GGGCCAGCCC GACATGGTGG CGCTGTACCG CGACCTGGGC
GTGCGCCAGA TCCACTTCGC CTACAACCGC AACAACCCGG TCGCCGACGG CTGCCATGAC
GTGGAGCGCG GCCTCACGCC GCTGGGCCGG CGCATGGTCG AGGCGGTCAA CCGGGCCGGC
GTGCTGATGG ACTGCTCGCA CACCGGCCGG CGCTGCAGCC TGGACATCAT GGCCGCCTCC
AGCCAGCCGG TGATCTTCAG CCACGCCAAC CCGCTGGCGC TCATCCCGCA CGGGCGCAAT
GTCAGCGACG AGCAGATCAG GGCCTGCGCG GCCACGGGCG GCGTGGTCTG CATCTCTGGC
GTGTCGGCGT TTCTGGGCAC CGGCACGCCC ACGGCCATGG ATGTGGCGCG CCACGCGGCC
TACGTGGCCG ATCTGGTCGG CGCGCAGCAT GCCGGCATCG GGCTGGACAT CGGCTTTGGC
CAGCCCGGAC TCGACGACAA CCCGCCGGGC CATCACGACC CGGCCTACTG GTGGCCCGCT
GCCGCAGGCT ACCAGGGGGC GCTGGGCAGC ATCACCTACA CGCCGGTCGA AACCTGGCGG
CTGTTGCCCA AGGCGCTGCA AAAGGTCGGC ATGAACGCGG CTGAAGCCGC CGGCGTCATG
GGTAACAATA TGCTGCGCGT CGCCGGCCAG GTCTGGCAGC GCCCCTGA
 
Protein sequence
MHTRSLPIHS AKAINWEAHA CLPLHPDADF SPLDRLRDAG VHYVSVNVGM DMNPVTQILS 
VLAAYRARIA AHPERFRLVT SVAEIEQAAA NGALAVGFDL EGALPLLGQP DMVALYRDLG
VRQIHFAYNR NNPVADGCHD VERGLTPLGR RMVEAVNRAG VLMDCSHTGR RCSLDIMAAS
SQPVIFSHAN PLALIPHGRN VSDEQIRACA ATGGVVCISG VSAFLGTGTP TAMDVARHAA
YVADLVGAQH AGIGLDIGFG QPGLDDNPPG HHDPAYWWPA AAGYQGALGS ITYTPVETWR
LLPKALQKVG MNAAEAAGVM GNNMLRVAGQ VWQRP