Gene Pnap_1997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_1997 
Symbol 
ID4688763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp2122054 
End bp2123289 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content60% 
IMG OID639835005 
ProductCMP/dCMP deaminase, zinc-binding 
Protein accessionYP_982227 
Protein GI121604898 
COG category[F] Nucleotide transport and metabolism
[J] Translation, ribosomal structure and biogenesis
[R] General function prediction only 
COG ID[COG0590] Cytosine/adenosine deaminases
[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.382734 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATG AAGCCTTTAT GGAGATGGCC CTTGTGCAAG CCCGGGTAGC GGCTGCTTTC 
GGGGAAGTTC CGGTGGGTGC AGTGGTCGTC CGGCAGGGCA AGGTGATTGC CACTGGCCGC
AACGCACCGG TTGAGGCCCA TGATCCGACG GCCCATGCCG AGATCATGGC CTTGCGCGCC
GCAGCGCTGG CCTTGGGCAA CTACCGTCTT GACGAGTGCG AACTGTTTGT CACGCTGGAG
CCCTGTGCCA TGTGCAGCGG CGCCATGCTC AACGCCCGGC TCAAACGCGT GGTGTTTGGG
GCGTCTGAAC CCAAAACTGG AGCTGCCGGC TCGGTCATCA ACCTGTTTGC GCAAGCGCGG
CTGAACCACC AGACCGAATT GCAGGGCGGG GTGCTGGCCG AATCTAGCCG CGCATTGCTG
CAGGACTTTT TTCGCCAGCG CCGTGCTGAC CAGCGCGAGG CGGCCCGGCA GCGTCATCCG
CTGCGCGATG ATGCGCTTCG AACCCCTGAC GCGGCTTTTG ACGGCTTGAG CGCTTATCCA
TGGGCGCCGC GTTACCTGAG CGATCTGCCG GCCCTTGACG GCTTGCGGAT GCATTATCTT
GATGAACAGG CGCTTGGCGT GGAGGAGGGC GGCGGGCCGC GCCTGACCTA CCTTTGCCTG
CATGGCAGCC CGGACTGGAG CTATGCTTTT CGCCGGCTGA TCCCGTCCTT GCTGCAGACT
GGTCACCGGG TGGTTGCGCC GGACCTGATT GGTTTTGGCA AAAGCGACAA GCCCAAGAAG
GACAGTTTTC ATACCTTCAG CCGGCATCGC CAGATACTGC TTGAGCTGGT GGAAAAGCTG
GATCTGCAGA ACATCGTGCT GGTGCTGCCC AGACAGGGCA CTTTGCTTGG ATTGACCTTG
CCCCTGGCTG CGCCGCTGCG CTACCGAGGA TTGCGGCTCA TGAACCCCGA TCCATTGGCA
GAGAGTGAAT ACCTGCCGCT AAGCCAGGGG TTTTTGCTGT GGGAGCAAGC AGGCGGCATG
CCGGCAGGTA CTGAAACCGA TGTCGCTTGT GAAGCGCCAT TTCCCGGCAA CAGCTACCGC
GCCGGAGTGC GTGCGTTTGC TGCCATGGCA CCAGATTCCG GAAATCACGA CGGCATTGCA
TCTGAAGCCG GGCAGTTCTG GCGTGATCGC CGCGATGAAT GTAATTTACA CGTTAGTTTT
TTAGAAAAAT ATACGAACAA ACCGTATCAA GAATAG
 
Protein sequence
MSDEAFMEMA LVQARVAAAF GEVPVGAVVV RQGKVIATGR NAPVEAHDPT AHAEIMALRA 
AALALGNYRL DECELFVTLE PCAMCSGAML NARLKRVVFG ASEPKTGAAG SVINLFAQAR
LNHQTELQGG VLAESSRALL QDFFRQRRAD QREAARQRHP LRDDALRTPD AAFDGLSAYP
WAPRYLSDLP ALDGLRMHYL DEQALGVEEG GGPRLTYLCL HGSPDWSYAF RRLIPSLLQT
GHRVVAPDLI GFGKSDKPKK DSFHTFSRHR QILLELVEKL DLQNIVLVLP RQGTLLGLTL
PLAAPLRYRG LRLMNPDPLA ESEYLPLSQG FLLWEQAGGM PAGTETDVAC EAPFPGNSYR
AGVRAFAAMA PDSGNHDGIA SEAGQFWRDR RDECNLHVSF LEKYTNKPYQ E