Gene Pnap_3963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_3963 
Symbol 
ID4687732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp4229131 
End bp4232103 
Gene Length2973 bp 
Protein Length990 aa 
Translation table11 
GC content65% 
IMG OID639836980 
ProductDNA topoisomerase III 
Protein accessionYP_984179 
Protein GI121606850 
COG category[B] Chromatin structure and dynamics
[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA
[COG5531] SWIB-domain-containing proteins implicated in chromatin remodeling 
TIGRFAM ID[TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.107223 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0620528 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAC TGGTAATTGC CGAAAAACCA TCGGTCGCGC AAGACATCGT GCGCGCCCTC 
ACGCCCACGG CGGGCAAGTT TGAAAAGCAC GACGAGTACT TTGAAAGCGA CGACTGGGTC
ATCACCTCCG CCGTCGGCCA CCTGGTCGAA ATCCAGGCGC CCGAAGAGTT CGACGTGAAG
CGCGGCAAGT GGAGCTTTGC GAACCTGCCG GTGATTCCGC CGTACTTCGA CTTGAAGCCG
ATGGACAAGA CCAAGACGCG GCTCAACGCC ATCGTCAAGC TGACCAAGCG AAAAGACATA
GGCGCGCTGG TCAACGCCTG CGACGCGGGC CGCGAGGGCG AACTGATCTT CCGGCTGATC
CAGCAATACG CCAAGAGCAA GCACCCGGTC AGGCGGCTGT GGCTGCAGTC GATGACGCCG
GCGGCCATCC GCGACGGCTT TGCCGCGCTG CGCAGCGACG CGCAGATGCT GCCGCTGGCC
GACGCCGCGC GCTGCCGCTC CGAGGCCGAC TGGCTGGTCG GCATCAACGG CACGCGCGCC
ATGACGGCCT TCAATTCGCG CGACGGCGGC TTCTTCCTGA CCACCGTCGG CCGGGTGCAG
ACGCCGACGC TGTCGGTGGT CGTCGAGCGC GAGGAAAAAA TCCGCAAGTT CGTCAGCCGC
GACTACTGGG AAATCCACGC CCTGTTTTCG GCCGAAGCCG GCGAATACCC GGGCAAATGG
TTCGACCCGA AGTTCAAGCG CGCCGCCGCC AATGCAGACA ATGCCGAGCC CGACCCCGAG
CTGAAGGCCG ACCGGCTGTG GACCGAGCGC GAGGCGCGCG CGATTGCCGA CGCGGTGCGC
GGCCAGACCG CCACGGTGAC CGAGGAATCG AAACCCACCA CGCAGGCGCC GGGGCTGCTG
TTCGATTTGA CTTCTTTGCA GCGCGAAGCC AACGGCAAGT TCGGCTTTTC GGCCAAGACC
ACGCTGTCGA TTGCGCAAAG CCTGTACGAG CGCCACAAGG CGCTGACCTA CCCGCGCACC
GACTCGCGCG CGCTGCCCGA GGACTATGTG CCGGTGGTGA AGCAGACCTT CGAGATGCTG
GCCACGAGCG GCATGAACCA CCTCGCGCCG CATGCCGCGA CGGCCCTCAA GGGCAATTAC
ATCAAGCCGA CCAAGCGCAT CTTCGACAAC GCCAAGGTCA GCGACCACTT CGCCATCATC
CCGACACTGC AGGCGCCCAG CGGCCTGAGC GAGGCCGAGC AAAAGCTCTA TGACTTGGTG
GTGCGCCGCT TCATGGCGGT GTTTTTCCCG AACGCCGAAT ACCTGGTGAC GACGCGGATT
TCGGTGTCGG TCGGCCACAG CTTCAAGACC GAAGGCAAGG TGCTGGTCAA GCCGGGCTGG
CTGGCGATCT ACGGCAAGGA AGCTGCCGCC GAAGTGCCCG AGGCCAAGGA AGGCGACAAG
GGCCAGAGCC TCGTGCCGGT CAAACCCGGC GAGCGCGTCG GCGTCGAGGC CGCCGATGCC
AAGGCCCTGA AAACCCGCGC GCCCGCCCGC TACAGCGAAG CCACGCTGCT CGGCGCGATG
GAAGGCGCGG GCAAGACGAT TGACGACGAC GAGCTGCGCG AAGCCATGCA GGAAAAAGGC
CTGGGCACGC CGGCCACCCG CGCCGCCACG ATTGAAGGCC TGATCGCCGA AAAATACATG
CTGCGCGAAG GCCGCGAGCT GATCCCGACG GCCAAGGCCT TCCAGCTCAT GACGCTGCTG
CGCGGGCTGG ATGTGCAGGA GCTGTCCAAG GCCGAGCTGA CCGGCGAGTG GGAGTTCAAG
CTCGCGCAGA TGGAACACGG CAAGCTCAGC CGCGAGACCT TCATGGCCGA GATTGCCGCC
ATGACGAAGA ACCTGGTCGC CAAGGCCAAG GGTTACAGCC GCGACAGCGT GCCCGGCGAC
TATGCGACGC TGCAGGCGCG CTGCCCGAAC TGCGGCGGCG TGATCAAGGA AAACTACCGC
CGCTATACCT GCACCGGCGC CACCGGCGAC GGCGAAGGCT GCGGCTTCAG CTTTGGCAAA
ACGCCGGCCG GCCGGACCTT TGAGCTGGCG GAAGTCGAGC AGTTTTTGCG CGACAAGAAA
ATCGGCCCGC TGGAGGGTTT TCGCTCCAAG GCGGGCTGGC CGTTCACCGC CGAGATGGTG
CTCAAGTTCG ACGAGGAAAC GAAAAACTAC AAGCTCGAAT TCGATTTCGG CGACGACAAA
AATGCCGACA CCGGCGAGAT CGTCGATTTT GGCGACCAGG AATCCCTGGG CGCGTGCCCG
AAATGCGCCG CTAACGTGTA CGAGCTGGGC AAGAACTACG TCTGCGAAAA ATCCGTGCCC
ACGCTGGAGC AGCCGACGCC AAGCTGCGAC TTCAAGACCG GCCAGGTGAT CCTGCAGCAG
CCCATCGAGC GCGAGCAGAT GAGCAAACTG CTGGCCACCG GCAAGACCGA CCTGCTCGAC
AAGTTCGTCT CGATGCGCAC GCGCCGCGCC TTCAAGGCCA TGCTGGTGTG GGACGCGGAA
GCGGGCAAGG TGAATTTTGA ATTCGCCCCG TCCAAGTTCC CGCCCAAGCC GGGCGCGGCG
CCCAGGGCCG GCGCTGGCAC GATCAAGACG CCGTTCGGCA AGACCGTGGC GGTCAAGGCT
GCGGCCGCGC CGGCTGCGAA GAAAGCCGCG GTCAAGAAAG TGGCCGCCAA GAAATCAGCC
GCCGCCAGCG ACAAGCCCAA GGCGGTTCGC AAGGCGGCCG CGCCCGGCGC CGGCCTCAAG
CCCAGCGACG CGCTGGCCGC CATCATCGGC AGCGAACAGG TCGCGCGTCC GCAGGTCATC
AAGAAGCTGT GGGACTACAT CAAGGACCAG AACCTGCAGG ACCCGGCCAA CAAGCGCGCC
ATCAACGCCG ACGCCAAGCT GCTGCCGGTG TTCGGCAAGC CGCAGGTGAC GATGTTCGAG
CTGGCAGGCA TCGTGGGCAA GCATTTGAGC TGA
 
Protein sequence
MKTLVIAEKP SVAQDIVRAL TPTAGKFEKH DEYFESDDWV ITSAVGHLVE IQAPEEFDVK 
RGKWSFANLP VIPPYFDLKP MDKTKTRLNA IVKLTKRKDI GALVNACDAG REGELIFRLI
QQYAKSKHPV RRLWLQSMTP AAIRDGFAAL RSDAQMLPLA DAARCRSEAD WLVGINGTRA
MTAFNSRDGG FFLTTVGRVQ TPTLSVVVER EEKIRKFVSR DYWEIHALFS AEAGEYPGKW
FDPKFKRAAA NADNAEPDPE LKADRLWTER EARAIADAVR GQTATVTEES KPTTQAPGLL
FDLTSLQREA NGKFGFSAKT TLSIAQSLYE RHKALTYPRT DSRALPEDYV PVVKQTFEML
ATSGMNHLAP HAATALKGNY IKPTKRIFDN AKVSDHFAII PTLQAPSGLS EAEQKLYDLV
VRRFMAVFFP NAEYLVTTRI SVSVGHSFKT EGKVLVKPGW LAIYGKEAAA EVPEAKEGDK
GQSLVPVKPG ERVGVEAADA KALKTRAPAR YSEATLLGAM EGAGKTIDDD ELREAMQEKG
LGTPATRAAT IEGLIAEKYM LREGRELIPT AKAFQLMTLL RGLDVQELSK AELTGEWEFK
LAQMEHGKLS RETFMAEIAA MTKNLVAKAK GYSRDSVPGD YATLQARCPN CGGVIKENYR
RYTCTGATGD GEGCGFSFGK TPAGRTFELA EVEQFLRDKK IGPLEGFRSK AGWPFTAEMV
LKFDEETKNY KLEFDFGDDK NADTGEIVDF GDQESLGACP KCAANVYELG KNYVCEKSVP
TLEQPTPSCD FKTGQVILQQ PIEREQMSKL LATGKTDLLD KFVSMRTRRA FKAMLVWDAE
AGKVNFEFAP SKFPPKPGAA PRAGAGTIKT PFGKTVAVKA AAAPAAKKAA VKKVAAKKSA
AASDKPKAVR KAAAPGAGLK PSDALAAIIG SEQVARPQVI KKLWDYIKDQ NLQDPANKRA
INADAKLLPV FGKPQVTMFE LAGIVGKHLS