Gene Pnap_4334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_4334 
Symbol 
ID4685431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008757 
Strand
Start bp245524 
End bp246882 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content65% 
IMG OID639826193 
ProductHNH endonuclease 
Protein accessionYP_973358 
Protein GI121582916 
COG category[V] Defense mechanisms 
COG ID[COG1403] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.069794 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.702083 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTTT TTGTATTGGA TCGGAGCGGG CAGCCGGTGA TGCCCTGCAG CGAAAAGCGC 
GCCCGGCTGC TTCTGCAGAG CAAGCGCGCC AGGGTTCACC GGGTCATGCC GTTCACGATT
CGACTGATCG ACCGAAGCCA GGCCGACTGC CTGCTCCAGC CGCTTCGCCT CAAGCTCGAT
CCGGGCAGCC GGGCCACGGG CCTGGCACTG GTGCGGGATA TTGAAACCAT TGAGCCCGCC
ATGGGTGAGG TCACGCGCGG CGCGGCCGTG GTGAGCCTGC TTGAGCTGGA GCACCGCGGC
AGGCAGATAT CAGAAGCGCT CACTGCGCGT CGCCAGATGC GCCGCCGGCG CCGCAACCAG
CTGCGCTACC GCGCCCCGCG TTTTTTGAAC CGGGGCAACA AGCAAAAAGG TTGGCTCGCG
CCTTCGTTGC AGCACCGGGT CGATACGACG GCGGCCTGGG TAGCACGCAT CCAGCGCTGG
GCACCGGTGA CGGCGCTCAG CTCGGAACTG GTGCGCTTTG ACATGCAGCA ATTGCAAAAC
ACCGAGATCG AAGGCGCCGA GTACAGCCAA GGCACGCTGG CGGGCTACGA AGTGCGCGAG
TACCTGCTGG AGAAGTGGAA GCGCACCTGC GCCTACTGCG ATGCGCAAAA CACGCCGCTG
CAGATCGAGC ACATCGAGCC CCGGGCGCGG GGCGGCAGCC ACCGGATCTC CAACCTATGC
CTGGCCTGCC AGCCCTGCAA CCAGAAAAAA GCCGCGCGCA CGCTTCAAGA TTTCCTCAAG
AAAGACCCCA AGCGCCTGGC GCGCATCCTC GCGCAAGCCC AGCGGCCGCT GCGCGATGCC
GCAGCGGTCA ACGTCACGCG CTGGGCGCTG GCCAACGCAC TGAAGACCAC TGGCTTCCCG
CTGGAGCTGG CCTCGGGTGG CCGGACCAAA TTCAACCGAT GCACGTTGGA CGTGCCCAAG
ACGCACGCGC TGGATGCGGC CTGCGTGGGC CAGGTGGAGG CCATCACTGG CTGGCAGCAG
CCTGCGTTCT ACACCTTGAC CATCAAGGCC ATGGGCCGGG GCAGCTACCA GCGCACTCGG
CTGGACGCCT ACGGCTTTCC AAGAGGCTAC CTGATGCGTG CCAAGTCGGT CCACGGTTTC
CAGACCGGGG ACCGGGTCAA GGCCGTCGTG CCCCAGGGCA AGAAGGTTGG CACGCATGTG
GGGCGCGTGG CGATCCGCAA GACGGGCAGC TTCAACATCA CCACGCCGGC TGGCGTGGTT
CAGGGCATCA GTCACAAGCA CTGCCGCATC GTTCAGCGGA ACGACGGGTA TGGCTATTTC
TTCCACCGGG CCGATTTAAC ACAGGACGCG AACAGGTAA
 
Protein sequence
MAVFVLDRSG QPVMPCSEKR ARLLLQSKRA RVHRVMPFTI RLIDRSQADC LLQPLRLKLD 
PGSRATGLAL VRDIETIEPA MGEVTRGAAV VSLLELEHRG RQISEALTAR RQMRRRRRNQ
LRYRAPRFLN RGNKQKGWLA PSLQHRVDTT AAWVARIQRW APVTALSSEL VRFDMQQLQN
TEIEGAEYSQ GTLAGYEVRE YLLEKWKRTC AYCDAQNTPL QIEHIEPRAR GGSHRISNLC
LACQPCNQKK AARTLQDFLK KDPKRLARIL AQAQRPLRDA AAVNVTRWAL ANALKTTGFP
LELASGGRTK FNRCTLDVPK THALDAACVG QVEAITGWQQ PAFYTLTIKA MGRGSYQRTR
LDAYGFPRGY LMRAKSVHGF QTGDRVKAVV PQGKKVGTHV GRVAIRKTGS FNITTPAGVV
QGISHKHCRI VQRNDGYGYF FHRADLTQDA NR