Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pnap_3963 |
Symbol | |
ID | 4687732 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polaromonas naphthalenivorans CJ2 |
Kingdom | Bacteria |
Replicon accession | NC_008781 |
Strand | - |
Start bp | 4229131 |
End bp | 4232103 |
Gene Length | 2973 bp |
Protein Length | 990 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639836980 |
Product | DNA topoisomerase III |
Protein accession | YP_984179 |
Protein GI | 121606850 |
COG category | [B] Chromatin structure and dynamics [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA [COG5531] SWIB-domain-containing proteins implicated in chromatin remodeling |
TIGRFAM ID | [TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.107223 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0620528 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACAC TGGTAATTGC CGAAAAACCA TCGGTCGCGC AAGACATCGT GCGCGCCCTC ACGCCCACGG CGGGCAAGTT TGAAAAGCAC GACGAGTACT TTGAAAGCGA CGACTGGGTC ATCACCTCCG CCGTCGGCCA CCTGGTCGAA ATCCAGGCGC CCGAAGAGTT CGACGTGAAG CGCGGCAAGT GGAGCTTTGC GAACCTGCCG GTGATTCCGC CGTACTTCGA CTTGAAGCCG ATGGACAAGA CCAAGACGCG GCTCAACGCC ATCGTCAAGC TGACCAAGCG AAAAGACATA GGCGCGCTGG TCAACGCCTG CGACGCGGGC CGCGAGGGCG AACTGATCTT CCGGCTGATC CAGCAATACG CCAAGAGCAA GCACCCGGTC AGGCGGCTGT GGCTGCAGTC GATGACGCCG GCGGCCATCC GCGACGGCTT TGCCGCGCTG CGCAGCGACG CGCAGATGCT GCCGCTGGCC GACGCCGCGC GCTGCCGCTC CGAGGCCGAC TGGCTGGTCG GCATCAACGG CACGCGCGCC ATGACGGCCT TCAATTCGCG CGACGGCGGC TTCTTCCTGA CCACCGTCGG CCGGGTGCAG ACGCCGACGC TGTCGGTGGT CGTCGAGCGC GAGGAAAAAA TCCGCAAGTT CGTCAGCCGC GACTACTGGG AAATCCACGC CCTGTTTTCG GCCGAAGCCG GCGAATACCC GGGCAAATGG TTCGACCCGA AGTTCAAGCG CGCCGCCGCC AATGCAGACA ATGCCGAGCC CGACCCCGAG CTGAAGGCCG ACCGGCTGTG GACCGAGCGC GAGGCGCGCG CGATTGCCGA CGCGGTGCGC GGCCAGACCG CCACGGTGAC CGAGGAATCG AAACCCACCA CGCAGGCGCC GGGGCTGCTG TTCGATTTGA CTTCTTTGCA GCGCGAAGCC AACGGCAAGT TCGGCTTTTC GGCCAAGACC ACGCTGTCGA TTGCGCAAAG CCTGTACGAG CGCCACAAGG CGCTGACCTA CCCGCGCACC GACTCGCGCG CGCTGCCCGA GGACTATGTG CCGGTGGTGA AGCAGACCTT CGAGATGCTG GCCACGAGCG GCATGAACCA CCTCGCGCCG CATGCCGCGA CGGCCCTCAA GGGCAATTAC ATCAAGCCGA CCAAGCGCAT CTTCGACAAC GCCAAGGTCA GCGACCACTT CGCCATCATC CCGACACTGC AGGCGCCCAG CGGCCTGAGC GAGGCCGAGC AAAAGCTCTA TGACTTGGTG GTGCGCCGCT TCATGGCGGT GTTTTTCCCG AACGCCGAAT ACCTGGTGAC GACGCGGATT TCGGTGTCGG TCGGCCACAG CTTCAAGACC GAAGGCAAGG TGCTGGTCAA GCCGGGCTGG CTGGCGATCT ACGGCAAGGA AGCTGCCGCC GAAGTGCCCG AGGCCAAGGA AGGCGACAAG GGCCAGAGCC TCGTGCCGGT CAAACCCGGC GAGCGCGTCG GCGTCGAGGC CGCCGATGCC AAGGCCCTGA AAACCCGCGC GCCCGCCCGC TACAGCGAAG CCACGCTGCT CGGCGCGATG GAAGGCGCGG GCAAGACGAT TGACGACGAC GAGCTGCGCG AAGCCATGCA GGAAAAAGGC CTGGGCACGC CGGCCACCCG CGCCGCCACG ATTGAAGGCC TGATCGCCGA AAAATACATG CTGCGCGAAG GCCGCGAGCT GATCCCGACG GCCAAGGCCT TCCAGCTCAT GACGCTGCTG CGCGGGCTGG ATGTGCAGGA GCTGTCCAAG GCCGAGCTGA CCGGCGAGTG GGAGTTCAAG CTCGCGCAGA TGGAACACGG CAAGCTCAGC CGCGAGACCT TCATGGCCGA GATTGCCGCC ATGACGAAGA ACCTGGTCGC CAAGGCCAAG GGTTACAGCC GCGACAGCGT GCCCGGCGAC TATGCGACGC TGCAGGCGCG CTGCCCGAAC TGCGGCGGCG TGATCAAGGA AAACTACCGC CGCTATACCT GCACCGGCGC CACCGGCGAC GGCGAAGGCT GCGGCTTCAG CTTTGGCAAA ACGCCGGCCG GCCGGACCTT TGAGCTGGCG GAAGTCGAGC AGTTTTTGCG CGACAAGAAA ATCGGCCCGC TGGAGGGTTT TCGCTCCAAG GCGGGCTGGC CGTTCACCGC CGAGATGGTG CTCAAGTTCG ACGAGGAAAC GAAAAACTAC AAGCTCGAAT TCGATTTCGG CGACGACAAA AATGCCGACA CCGGCGAGAT CGTCGATTTT GGCGACCAGG AATCCCTGGG CGCGTGCCCG AAATGCGCCG CTAACGTGTA CGAGCTGGGC AAGAACTACG TCTGCGAAAA ATCCGTGCCC ACGCTGGAGC AGCCGACGCC AAGCTGCGAC TTCAAGACCG GCCAGGTGAT CCTGCAGCAG CCCATCGAGC GCGAGCAGAT GAGCAAACTG CTGGCCACCG GCAAGACCGA CCTGCTCGAC AAGTTCGTCT CGATGCGCAC GCGCCGCGCC TTCAAGGCCA TGCTGGTGTG GGACGCGGAA GCGGGCAAGG TGAATTTTGA ATTCGCCCCG TCCAAGTTCC CGCCCAAGCC GGGCGCGGCG CCCAGGGCCG GCGCTGGCAC GATCAAGACG CCGTTCGGCA AGACCGTGGC GGTCAAGGCT GCGGCCGCGC CGGCTGCGAA GAAAGCCGCG GTCAAGAAAG TGGCCGCCAA GAAATCAGCC GCCGCCAGCG ACAAGCCCAA GGCGGTTCGC AAGGCGGCCG CGCCCGGCGC CGGCCTCAAG CCCAGCGACG CGCTGGCCGC CATCATCGGC AGCGAACAGG TCGCGCGTCC GCAGGTCATC AAGAAGCTGT GGGACTACAT CAAGGACCAG AACCTGCAGG ACCCGGCCAA CAAGCGCGCC ATCAACGCCG ACGCCAAGCT GCTGCCGGTG TTCGGCAAGC CGCAGGTGAC GATGTTCGAG CTGGCAGGCA TCGTGGGCAA GCATTTGAGC TGA
|
Protein sequence | MKTLVIAEKP SVAQDIVRAL TPTAGKFEKH DEYFESDDWV ITSAVGHLVE IQAPEEFDVK RGKWSFANLP VIPPYFDLKP MDKTKTRLNA IVKLTKRKDI GALVNACDAG REGELIFRLI QQYAKSKHPV RRLWLQSMTP AAIRDGFAAL RSDAQMLPLA DAARCRSEAD WLVGINGTRA MTAFNSRDGG FFLTTVGRVQ TPTLSVVVER EEKIRKFVSR DYWEIHALFS AEAGEYPGKW FDPKFKRAAA NADNAEPDPE LKADRLWTER EARAIADAVR GQTATVTEES KPTTQAPGLL FDLTSLQREA NGKFGFSAKT TLSIAQSLYE RHKALTYPRT DSRALPEDYV PVVKQTFEML ATSGMNHLAP HAATALKGNY IKPTKRIFDN AKVSDHFAII PTLQAPSGLS EAEQKLYDLV VRRFMAVFFP NAEYLVTTRI SVSVGHSFKT EGKVLVKPGW LAIYGKEAAA EVPEAKEGDK GQSLVPVKPG ERVGVEAADA KALKTRAPAR YSEATLLGAM EGAGKTIDDD ELREAMQEKG LGTPATRAAT IEGLIAEKYM LREGRELIPT AKAFQLMTLL RGLDVQELSK AELTGEWEFK LAQMEHGKLS RETFMAEIAA MTKNLVAKAK GYSRDSVPGD YATLQARCPN CGGVIKENYR RYTCTGATGD GEGCGFSFGK TPAGRTFELA EVEQFLRDKK IGPLEGFRSK AGWPFTAEMV LKFDEETKNY KLEFDFGDDK NADTGEIVDF GDQESLGACP KCAANVYELG KNYVCEKSVP TLEQPTPSCD FKTGQVILQQ PIEREQMSKL LATGKTDLLD KFVSMRTRRA FKAMLVWDAE AGKVNFEFAP SKFPPKPGAA PRAGAGTIKT PFGKTVAVKA AAAPAAKKAA VKKVAAKKSA AASDKPKAVR KAAAPGAGLK PSDALAAIIG SEQVARPQVI KKLWDYIKDQ NLQDPANKRA INADAKLLPV FGKPQVTMFE LAGIVGKHLS
|
| |