Gene Pnap_3827 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_3827 
Symbol 
ID4687707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp4080983 
End bp4084057 
Gene Length3075 bp 
Protein Length1024 aa 
Translation table11 
GC content66% 
IMG OID639836845 
Productexcinuclease ABC, A subunit 
Protein accessionYP_984044 
Protein GI121606715 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.67305 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACTTTC CTAACGACAG CAAGTACCTC GCCAGCGTGC TGGCCGCCCA GCGCATCAGC 
ATTCGCGGCG CGCGCACCCA CAACCTCAAG AACATCGACC TGGACATTCC GCGCAACCAG
CTGGTGGTCA TCACCGGCCT GAGCGGCTCG GGCAAGTCGA GCCTGGCCTT TGACACGCTG
TACGCCGAAG GCCAGCGCCG CTATGTCGAA AGCCTGTCCA CCTATGCGCG GCAGTTTCTG
CAGCTCATGG ACAAGCCCGA CGTGGACATG ATCGAGGGCC TGTCGCCGGC GATTTCCATC
GAGCAGAAGG CGACCTCGCA CAACCCGCGC TCGACCGTCG GCACGGTGAC CGAGATCCAC
GACTACCTGC GCCTGCTGTT CGCCCGCGCC GGCACGCCGT ATTGCCCCGA ACACGATCTG
CCGCTGCAGG CGCAAAGCGT CGCCGAAATG GTCGATGCCG TGCTGGCGCT GCCCGAGGAC
ACGCGCCTCA TGATCCTGGC GCCGGTGGCG CGCGAGCGCA AGGGCGAGTT CGTTGACCTG
TTCGAAGAGA TGCAGGCCCA GGGCTATGTG CGCTTTCGGG TCGATGGCGC GACCTATGAA
TTCGACAACC TGCCCAAGCT CAAGAAAACC GAGAAGCACA CGATTGACGT GGTGATTGAC
CGCGTCAAGG TGCGCCTTGA CAAGGAAGTC CAGCCCGACG CCGAAGCCGC GACCGGAACC
GCCCCGCCGC CCGCGCTGCG CCAGCGCCTG GCCGAGAGCT TCGAGGCGGC GCTGCGGCTG
GCCGACGGCC GGGCGGTGGC GCTGGAGATG GACTCCGGCA AAGAGCACCT GTTCAGCGCC
AAGTTCTCCT GCCCGGTCTG CAGCTATTCG CTCAGCGAGA TGGAGCCGCG CCTGTTTTCG
TTCAACTCGC CGGTCGGCGC CTGCCCGTGC TGCGACGGCC TGGGGCATCG GGATTTTTTC
GATCCGGCGC GGGTCGTGGC CTTTCCGTCC CTGAGCCTGG CCAGCGGCGC GGTCAAGGGC
TGGGACCGGC GCAATGCCTA TTACTTCTCG ATGCTGGAGA GTCTGGCCAA ACACTACCAG
TTTGACATCG ACAAGGCGTT TGAAGACCTG CCCGAGAACG TGCGCCAGGT GGTGTTGCAT
GGCTCCGGCG AGGAAGAGAT CAAGTTCAGC TACCTCATGG AGTCGGGCGA GAAGGCCGGC
AAGAAGGTGA GCAAGAAGCA TCCATTCGAA GGCATCATCA CCAACTTCGA GCGCCGCTAC
CGCGAAACCG AATCCACGGT GGTGCGCGAA GAACTCGCGC GCTACCGCAG CCTGCAGCCC
TGCCCCGACT GCCGCGGCAC GCGGCTGCGG CTGGAAGCGC GCCATGTGTT TTTGGTCGGC
ACGGCGCAGG GCGAAGACGC AGCCGGCCCG CGCAAGACCA TCCATGAAAT CAGCCGCCTG
ACGCTGCGCG AGAGTTTCAG CTACTTCAGC ACCCTGAGCA TGCACGGCGC CAAGGCCGAG
ATTGCCGCCA AGGTGGTGCG CGAGATCGGC CTGCGGCTCA AATTCCTCAA CGACGTGGGC
CTGAATTACC TGAGCCTGGA CCGCAGCGCC GAGACGCTGT CGGGCGGCGA GTCGCAGCGC
ATCCGGCTGG CCAGTCAGAT CGGCTCGGGG CTGACCGGCG TGATGTATGT GCTCGACGAG
CCCAGCATCG GTCTGCACCA GCGCGACAAC GACCGCCTGA TCAGCACGCT CAAGCATTTG
CGCGACATCG GCAACAGCGT GCTGGTGGTC GAGCACGACG AGGACATGAT CCGCGCCGCC
GACCATGTGA TCGACATGGG GCCGGGCGCG GGCATCCACG GCGGGCGCGT GATGGCGCAG
GGCACGTTCG AGGAAGTTCA GGCGAACCCC GATTCGCTGA CCGGCCGCTA CCTGGCGGGA
ACACTCAGGA TCGCGGTGCC CCAGCACCGC ACCCCCTGGC TGCCGACGGT CAAAAATGCC
AACGCCAACG CCTTCGACAA AACCAAATCG CGCTTTGCGC CCAGCCCGGC GGCCGAGCGC
CGCGCGGCGC GCGAAGCCAT CCACCAGGCC ACGCTGGGCG ACATGCAGGC GCTGCGCGTG
ATCAACGCCA CCGGCCACAA TTTAAAAAAC GTGAGCGTGG AATTCCCCGT CGGCCTGCTG
ACCTGCGTGA CCGGCGTGTC CGGCTCGGGC AAATCGACAC TGGTCAACGA CACGCTGTAC
GCCGCCGTGG CGCGCACGCT GTACCGCGCG CATGAAGAAC CCGCGCCGCA CGAAAGCATC
GAGGGCATCG AGCATTTCGA CAAGGTCATC AACGTGGACC AGTCGCCGAT TGGCCGCACG
CCGCGCAGCA ACCCGGCGAC CTACACCGGC CTGTTCACGC CGATCCGCGA ACTGATGGCC
GAAATGAACA CGGCGCGCGA ACGCGGCTAT GGCGCCGGGC GCTTTTCCTT CAACGTCGCC
GGCGGGCGCT GCGAAGCCTG CCAGGGCGAC GGCGTGGTGA AGGTGGAAAT GCACTTCCTG
CCCGACGTGT ACGTGCCCTG CGATGTCTGC CACGGCCAGC GCTACAACCG CGAAACGCTC
GAAGTCCAGT ACAAGGGCAA AAACATCGCG CAGATCCTGG ACTTCACGGT CGAGACCGCC
GCCGAGTTCT TCAAGGCCGT GCCGACGATA GCGCGCAAGC TGCAGACGCT GCTCGACGTG
GGCCTGAGCT ACATCAAGCT CGGCCAGGCG GCGACCACGC TGTCGGGCGG CGAGGCGCAG
CGGGTCAAAC TCGCGCTGGA ACTGAGCAAG CGCGACACCG GCCGCACGCT CTACATCCTG
GACGAACCCA CCACCGGCCT GCATTTCGCC GACATCGACC TGCTGCTCAA GGTGCTGCAC
CAGTTGCGCG ACGCGGGCAA CACCATCGTC GTGATCGAGC ACAACCTGGA CGTGATCAAG
ACCGCCGACT GGCTGATCGA CATGGGACCG GAAGGCGGGG CCGGCGGCGG CCGGGTGGTG
GGCGTTGGCA CGCCCGAAGA CATTGCCGCC AACCCCGACA GCCATACGGG TCACTATCTG
GCGCGGCTGC TGTAG
 
Protein sequence
MNFPNDSKYL ASVLAAQRIS IRGARTHNLK NIDLDIPRNQ LVVITGLSGS GKSSLAFDTL 
YAEGQRRYVE SLSTYARQFL QLMDKPDVDM IEGLSPAISI EQKATSHNPR STVGTVTEIH
DYLRLLFARA GTPYCPEHDL PLQAQSVAEM VDAVLALPED TRLMILAPVA RERKGEFVDL
FEEMQAQGYV RFRVDGATYE FDNLPKLKKT EKHTIDVVID RVKVRLDKEV QPDAEAATGT
APPPALRQRL AESFEAALRL ADGRAVALEM DSGKEHLFSA KFSCPVCSYS LSEMEPRLFS
FNSPVGACPC CDGLGHRDFF DPARVVAFPS LSLASGAVKG WDRRNAYYFS MLESLAKHYQ
FDIDKAFEDL PENVRQVVLH GSGEEEIKFS YLMESGEKAG KKVSKKHPFE GIITNFERRY
RETESTVVRE ELARYRSLQP CPDCRGTRLR LEARHVFLVG TAQGEDAAGP RKTIHEISRL
TLRESFSYFS TLSMHGAKAE IAAKVVREIG LRLKFLNDVG LNYLSLDRSA ETLSGGESQR
IRLASQIGSG LTGVMYVLDE PSIGLHQRDN DRLISTLKHL RDIGNSVLVV EHDEDMIRAA
DHVIDMGPGA GIHGGRVMAQ GTFEEVQANP DSLTGRYLAG TLRIAVPQHR TPWLPTVKNA
NANAFDKTKS RFAPSPAAER RAAREAIHQA TLGDMQALRV INATGHNLKN VSVEFPVGLL
TCVTGVSGSG KSTLVNDTLY AAVARTLYRA HEEPAPHESI EGIEHFDKVI NVDQSPIGRT
PRSNPATYTG LFTPIRELMA EMNTARERGY GAGRFSFNVA GGRCEACQGD GVVKVEMHFL
PDVYVPCDVC HGQRYNRETL EVQYKGKNIA QILDFTVETA AEFFKAVPTI ARKLQTLLDV
GLSYIKLGQA ATTLSGGEAQ RVKLALELSK RDTGRTLYIL DEPTTGLHFA DIDLLLKVLH
QLRDAGNTIV VIEHNLDVIK TADWLIDMGP EGGAGGGRVV GVGTPEDIAA NPDSHTGHYL
ARLL