Gene Pnap_4378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_4378 
Symbol 
ID4685368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008757 
Strand
Start bp284190 
End bp287387 
Gene Length3198 bp 
Protein Length1065 aa 
Translation table11 
GC content55% 
IMG OID639826233 
Productphage integrase family protein 
Protein accessionYP_973398 
Protein GI121582956 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.971134 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCGGC TCAAGGCTAC ACAGACGCCT GTTGATAAAA CAGACGAGGT TGCTGCTCGA 
CCGAAAAACA ACCGTGGCCG CAAGGCAGGG CAGCCGCCGA CACCACGAAC GGCGCTGCAA
AAGTGTATTT GGCAGAAAGA GCAGTTTGTC GCGCGCACAA GCGAACTGAC CGGCCGTACC
GTGTATGAAA ACCAGTCACA CGACCAATAC CTTATTTCTC CCGCTTGGAA TCAAGCGGCC
GTCCAGGTGG TCGTTGCCGA GTGGAATGCT CGTATCTACG CACTTCCAGG AAGCAACCGA
CCGCCAGCGG GCCAGCGGGC TGCGGTTGCG GCGCTGTCAG ATTGTGTTTC GCCACAAGCC
TTCGGAGAAG GCCTGAACGT ACTTTTGCTC CAAGAGTTCA AAGCAAAGCT CGACAATGAA
GGCTTCCACT TCTCCGGCCA GCAGTTTTTG ACGCAGTTGA TGCTAGCGAT GTGGTGCAAA
GGGCTGGTGG CGTGGCCACT GACCGTTCAA GAGCCGTTTC CGTCGGCTAT CTTGACCCTC
AATAATGCGG GATGGAGCGC CGAGTTCGTA GTCGTTCTGA ACTATGTGCG AAAGTACCTC
GCCCGCGACA CCCTTACCGA CGCTGTGACG TTTCGGTTCT TCGTGGACAT GGTGCTCTCC
AGGGCTGGAG TCGTAGAACT TGGCGACATA ACTCCCACCA CCATGCAGGT CCAGCCAGCG
CCGACGCGAA AGAGGAAACA GCGACCCATG GCGTTTACAG GTCTGCTGCA GGCGCTAAGG
GATGAATACT CCATCAAACA GGTTCCGTGG GTACTGGAAG ACTTCGGCTT TTATCAGGCC
AGGTCTGGTC ATCTGCGCCG CAAGGAAAAT TTCGAATGGG CTCTTGAGAT TGACCCAACA
ATGATCGAAT GGGTGGAATT GGCAAAGCAG CATCTTGCCG AAAATCCGAC GAACTATAGG
AAACGCCGGA GTGCGGTGAA TGTCTTTATC GAGCACATCG CCAAAAACTC GACTGTCAAC
CGAAGTCCTG CAGGCTACTG TGACATTAAG CGTCGTCCTG ATCCTCTCTT TCACATCGAT
GGCAATAAAG GGCGTCAGAC GATGGTGGTG GTGTACCAGT TCCTCAACGA GGTTCTGCAC
AAAGTCTGTA TCCAGGCTGA CGACAATGCA TTTCCTATTT TGATGCCGGG CTTTGCCAAC
CCGCTGGTCA AACAGACGTT TATCGGTGTC AACAAGGGAG AGACACACCG CGAGTCCATG
CCAACGCGTC TCATTCGCCA AGCGATGAGC ATACTGACCG AGAACGATTT CGCTTGGGCG
CGAGAGGTCG GAAAGCTCTC AGACAACTTC CGTTGGAAAA ATCCAGAAAC CCGAGAATTT
GAGAGCGTCT GGAGCCCTGT GCGTACATAC GCTCTTATGG CCAAGCTTAT CCTCCCAGCT
CGTACCTATC AAATTCGGCA TCTCGACAGC AGTGAAGGCG ATTCGCTCAG GTATGAAGAG
AACGGAACAT GGGGACCAAA TACGGGAAAA CACGCGCCAT CTAATGCGGG CGTCGAGCGT
GGTGTGTTCC GGCAATACAA GCGCAAGGAC GGTAGTCTCG GCGCTGTGCT GTACTTCAAC
ACCAATAAGA CTGGGGACAT CGATAAAGAC AAGGACAAGA CCGGTTTCGT CATGCCGTGG
GAGAAGCTGG ACGCACTTCA ACTGTTTGCA CGGATGCGCA ACTGGCAGGA AAAGTACAAC
CAGTTAGATG GCCCAACCAA TTGGACCGAC ATCAACGAAC TGAAGGCGGC AAAGCACGTT
GAGGACCTTC GCAAGCTCGG CACCAACTTA TTCCTCTTCC GTGACCCCTG CCATCAACAT
CGACCAGACC TTCCTGTGTC TGATGTTCGA CTGCGCAACC TCTGGTTGAG GCTCATGGAA
GAACTGGAGA AAAGGCTGGC GCTGGCTGGT GAAACCCTCG CAAACGGCGA GCCGATAAAA
CTTGTTATCA GCTCAACGAA AAGAAGCGCG CCATCGGCGG CACTGTTCGA CCTTCACACG
CTGCGGGTCA CAATGATTAC CGCTATGTAC GAAGAGGGCA TTCCGCCGGA AATCCTCATG
AAAATTGTCG GGCATGCCTC GATCATCATG ACGCTCTACT ACGTCAAGCT CAACGCCGAA
ACCATTTCGG TGCAGTTGGA TGCTGCAGTG CAGGAGCGTC AACGCAAGGA ACAGTCCGAG
ATGGCAGGGT TCATCCAGCG CGCCAGCAGA ATGGAGCTGG AGCGCGCCGT CGCCAGAACG
CACCCGTCGG CGCTCGATGC CATCACCAGC GGAACTGGCA CAGGTTTGGT CGTCATGGAC
CACGGCGTGT GTCCGGTTGC AGCAAGGCGT TGCCATGAAG GTCTCGCCTC GATGGACCCA
AGCTCTGGTT TCATCCGGTA TCTGGCTGTC CCGGGTGGAG CAAGCAATTG CGTCCGTTGC
CGATTCTTCG TCAGTGGTCC CGCGTTCCTG CTTGGGCTTG AGGCCCACGT CATTGACCTG
TCGTACAGAC TGCGCAAGGC ATCGGTGTCA TTTGAGAAGT CCCAACTGCG ATTCGATGCC
CTTTCGAACC AATTTGCCGA CGCTCTTGAA AACGTGACCC CGTTCTCGCA GCAAAGAGAC
CTGGAAATTG CCGAAACCGC GCTGGAGTCA GTGACCGCCG AGGTTGATGC GATTGCGCTC
AGCCTTCAGT GTGCTTACGC CCTCACGGAG CAAAGCATTC ACATCAACAA CCTTGGCGTC
AACAAGCCCG GTGGCAGCGG ACTGTCACTG GTGGCAGTGG GCGGCACAGG CGAGTTGGAA
GCGGTTCTCA CTGAAACGCA CGAGTTCGAG CAGTTGCATC GTATTTGTGA GAGCGCAATG
CTGTTTGACG GGCTGAAAAT CGACTGGCAA CACCCCAATC TGGAACGCGC CCGTCTGTTT
GACAGAATGC TGCGCGCATC CGGCTGCGAG CCGCACTTCT CGCTTCTAGA CGATGAGGAT
GCGCTCGGTG CGGCTAATGC GATGGGAAAG TTCTTGTACG CCAGGCTGGG GCCAAAAACA
GTCCATGATC TCATGGACGG CCGCACCACG CTGCGGGCAC TCGGTATGGA AAAGGCTTTT
GCAGACCAGC TTCAGGCCAT GACGCCCAAG TCCCTTGCCT TGAGCCGCAC AATTTTTATT
GAAGGCAACT CCTCATGA
 
Protein sequence
MTRLKATQTP VDKTDEVAAR PKNNRGRKAG QPPTPRTALQ KCIWQKEQFV ARTSELTGRT 
VYENQSHDQY LISPAWNQAA VQVVVAEWNA RIYALPGSNR PPAGQRAAVA ALSDCVSPQA
FGEGLNVLLL QEFKAKLDNE GFHFSGQQFL TQLMLAMWCK GLVAWPLTVQ EPFPSAILTL
NNAGWSAEFV VVLNYVRKYL ARDTLTDAVT FRFFVDMVLS RAGVVELGDI TPTTMQVQPA
PTRKRKQRPM AFTGLLQALR DEYSIKQVPW VLEDFGFYQA RSGHLRRKEN FEWALEIDPT
MIEWVELAKQ HLAENPTNYR KRRSAVNVFI EHIAKNSTVN RSPAGYCDIK RRPDPLFHID
GNKGRQTMVV VYQFLNEVLH KVCIQADDNA FPILMPGFAN PLVKQTFIGV NKGETHRESM
PTRLIRQAMS ILTENDFAWA REVGKLSDNF RWKNPETREF ESVWSPVRTY ALMAKLILPA
RTYQIRHLDS SEGDSLRYEE NGTWGPNTGK HAPSNAGVER GVFRQYKRKD GSLGAVLYFN
TNKTGDIDKD KDKTGFVMPW EKLDALQLFA RMRNWQEKYN QLDGPTNWTD INELKAAKHV
EDLRKLGTNL FLFRDPCHQH RPDLPVSDVR LRNLWLRLME ELEKRLALAG ETLANGEPIK
LVISSTKRSA PSAALFDLHT LRVTMITAMY EEGIPPEILM KIVGHASIIM TLYYVKLNAE
TISVQLDAAV QERQRKEQSE MAGFIQRASR MELERAVART HPSALDAITS GTGTGLVVMD
HGVCPVAARR CHEGLASMDP SSGFIRYLAV PGGASNCVRC RFFVSGPAFL LGLEAHVIDL
SYRLRKASVS FEKSQLRFDA LSNQFADALE NVTPFSQQRD LEIAETALES VTAEVDAIAL
SLQCAYALTE QSIHINNLGV NKPGGSGLSL VAVGGTGELE AVLTETHEFE QLHRICESAM
LFDGLKIDWQ HPNLERARLF DRMLRASGCE PHFSLLDDED ALGAANAMGK FLYARLGPKT
VHDLMDGRTT LRALGMEKAF ADQLQAMTPK SLALSRTIFI EGNSS