Gene Pnap_1416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_1416 
Symbol 
ID4688505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp1503628 
End bp1504698 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content66% 
IMG OID639834418 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_981651 
Protein GI121604322 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.800159 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAGTCC TAGGAATTGA ATCAAGTTGC GATGAAACCG GAGTGGCGCT GGTCGATACC 
AGCGCCAGCC CGACTCCGTG CCTGCTCGCG CACGCTTTGT ACAGCCAGAT CGCCATGCAC
CAGCCCTATG GCGGCGTGGT GCCAGAGCTG GCCAGCCGCG ACCATATCCG GCGCGTGCTG
CCGCTGACGC AGGACGTCAT GAAGAGCGCA CAGCACACGC TGGCCGATGT CGATGTGATT
GCCTACACGC GCGGCCCTGG CCTGGCCGGC GCGCTGCTGG TCGGCGCTGG CGTTGCCTGC
TCGCTGGCGG CAGCTTTGGG CAAGCCGGCG ATGGGCGTGC ATCACCTCGA AGGGCATTTG
CTGTCGCCCT TTTTAAGTGC GGACCCGCCG GAATTTCCCT TTGTTGCGCT GCTGGTATCG
GGCGGCCACA CCCAGCTGAT GCGGGTTGAC CGGGTCGGCA GCTACGAGCT GCTCGGCGAA
ACCATCGACG ACGCGGCCGG CGAGGCGTTC GACAAGTCGG CCAAGCTGAT GGGCATGCCT
TACCCCGGCG GGCCGCATCT GGCCCGGCTG GCGCTGGGCG GCGACGGCGC AGCGTTCAAG
CTGCCCCGGC CCTTGCTGCA CAGCGGCAAC CTCGATTTTT CATTCGCCGG CCTGAAAACC
GCCGTGCTGA CGCAGGCGAA AAAGCTCGGC ACCGAGCTGG AAAGCCGCAA AGCCGATCTG
GCCGCTTCCA CGCAGGCGGC GATTGTCGAG GTGCTGGTGA AGAAAACGCT GGCCGCCCTG
TCACAAACCG CGCTCAAGCG GCTGGTGGTG GCCGGCGGCG TCGGCGCCAA CGCCTTGCTG
CGCAGCCAGC TCAATGCGGC CTGCCAGCAG CGCGGCATTC GCGTGCATTA CCCCGAACTG
GAGTTTTGCA CCGACAACGG CGCCATGATC GCCATGGCCG CCGGCATGCG GCTGCAGGCT
GGGCTGGTGA ACCTGGACGC CTTGCGCGGC AGCTACACCT TTGATGTGAA GCCGCGCTGG
AACCTTTCGG AATTCCAGTC CGAACCCGCC CTGGCCACGC AGAACGCCTG A
 
Protein sequence
MLVLGIESSC DETGVALVDT SASPTPCLLA HALYSQIAMH QPYGGVVPEL ASRDHIRRVL 
PLTQDVMKSA QHTLADVDVI AYTRGPGLAG ALLVGAGVAC SLAAALGKPA MGVHHLEGHL
LSPFLSADPP EFPFVALLVS GGHTQLMRVD RVGSYELLGE TIDDAAGEAF DKSAKLMGMP
YPGGPHLARL ALGGDGAAFK LPRPLLHSGN LDFSFAGLKT AVLTQAKKLG TELESRKADL
AASTQAAIVE VLVKKTLAAL SQTALKRLVV AGGVGANALL RSQLNAACQQ RGIRVHYPEL
EFCTDNGAMI AMAAGMRLQA GLVNLDALRG SYTFDVKPRW NLSEFQSEPA LATQNA