Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_5202 |
Symbol | |
ID | 7969861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | + |
Start bp | 5523766 |
End bp | 5526708 |
Gene Length | 2943 bp |
Protein Length | 980 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644795796 |
Product | DNA topoisomerase III |
Protein accession | YP_002947070 |
Protein GI | 239818160 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.221506 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGACCT TGGTAATCGC AGAAAAGCCG TCGGTGGCAC AGGACATCGT CCGTGCACTC ACGCCCGTGG CCGGCAAGTT CGACAAGCAT GACGAGCATT TCGAGAACGA CAGCTACGTG GTAACCAGCG CGGTCGGCCA CCTGGTCGAA ATCCAGGCCC CGGAGGAGTT CGACGTCAAG CGGGGCAAGT GGAGCTTCGC GAACCTGCCG GTGATTCCGC CGCATTTCGA CCTGAAGCCG GTCGACAAGA CCAAGACGCG CCTGAACGCC GTGGTCAAGC AGGCCAAGCG CAAGGACGTG ACGCAACTCA TCAACGCCTG CGACGCGGGC CGCGAGGGCG AGCTGATCTT CCGCCTGATC GAGCAGTACG CGGGCGGCAA GACCGGCCTG AACAAGCCCG TGAAGCGCCT GTGGCTGCAG TCGATGACGC CGCAGGCCAT CCGCGACGGC TTCGACGCGC TGCGCACCGA AAAGCAGATG CAGGGCCTGG CCGACGCCGC GCGCTCGCGC TCGGAGGCCG ACTGGCTGGT GGGCATCAAC GGCACGCGCG CCATGACGGC CTTCAATTCG CGCGACGGCG GCTTCTTTTT GACGACCGTG GGCCGCGTGC AGACGCCCAC GCTGTCGGTG GTGGTCGAGC GCGAGGAAAA GATCCGCAAG TTCGTGAGCC GCGACTACTG GGAAATCCAC GGTGTGTTCC AGGCCGAGGC CGGCCAGTAC CCGGGCAAGT GGTTCGATGC CAACTTCAAG AAGCCGCCGC CCGGCCCCGA CGGCACGGCC GACGCCGAGA TCCGCGCCGA CCGCGTGTGG AGCGAGCGCG AGGCGCGCGA GATCGCCGAT GCGGCGCGCG GCAAGCCCGC CAGCGTCACC GAGGAAAGCA AGCCCACCAC GCAGGCTTCG CCGATGCTGT TCGACCTCAC CTCGCTGCAG CGCGAGGCCA ACGGCCGCTT CGGCTTCTCG GCCAAGACCA CGCTGGCGCT GGCGCAGAGC CTGTACGAGC GCCACAAGGC GCTGACCTAT CCGCGGACCG ACTCGCGCGC ACTGCCCGAG GACTACCTGC CGGTGGTCAA GGACACCATG AAGATGCTCG CCGAGAGCGG CATGAAGCAC TTGGCGCCCT TCGCGCAGCA GGCGGTCGAC GGCAGCTACG TGAAGCCGAA CAAGCGCATC TTCGACAACG CCAAGGTGTC GGATCACTTC GCGATCATTC CCACGCTGCA GGCGCCGAGC GGCCTGTCCG ACGCCGAGCA GAAGCTCTAC GACTTCGTGG TGCGCCGCTT CATGTCGGTG TTCTTCCCGA GCGCCGAGTT CCAGGTGACC ACCCGGATCA GCACGGTGGA GGCCGGCGGC AAGAAGTACC CGTTCCGCAG CGACGGCAAG GTGCTGGTCA AGCCGGGCTG GCTCGCGATC TGGGGCAAGG AAGCCATCAG CGACGACGAC GAGAAGGACG GCAAGAACCT GGTGGTGGTG AAGCCGGGCG AAACGGTGAA GACCGAATCG GCCGACCTGA AGGCGCTGAA GACCCGGCCG CCCGCGCGCT ATTCGGAAGC CACGCTGCTG GGCGCGATGG AAGGCGCCGG CAAGACCATC GACGACGACG AGCTGCGCGA GGCCATGCAG GAAAAAGGCC TGGGCACGCC AGCCACGCGC GCGGCCACCA TCGAAGGCCT GATCACCGAG AAATACATGC TGCGCGAAGG CCGCGAGCTG ATCCCCACGG CCAAGGCCTT CCAGCTCATG ACGCTGCTGC GCGGCCTGGG CGTGGAAGAA CTCTCCAAGG CCGAGCTCAC GGGCGAGTGG GAATACAAGC TCGCGCAGAT GGAGAAGGGC GCGCTGAGCC GCGACGCCTT CATGCGCGAG ATCGCCGAGA TGACGCAGCA CATCGTCAAG AAGGCCAAGG AATACGACCG CGACACCGTG CCGGGCGACT ACGCCACGCT GTCGACCCCA TGCCCCAACT GCGGCGGCGT GGTGAAGGAA AACTACCGCC GCTACAGCTG CACCGGCAAG GCCGGGCAGG AGCCCTGCGG CTTCTCGTTC GGCAAGTCGC CGGCGGGGCG CACCTTCGAG GTGGCCGAGG CGGAAGTGCT GCTGCGCGAC AAGCACATCG GCCCGCTGGA GGGCTTCCGC TCCAAGGCGG GCTGGCCCTT CACGTCCGAG ATCGTCCTCA AGTACGACGA AGAGGCGAAG AACTGGAAGC TGGAATTCGA CTTCGGCGAC GACAAGAACG CCGACACCGG CGAGATCGTC GATTTCAGCG AGCAGGACAC GGTGGGCCCG TGCCCGATCT GCGGCGCGCC GGTGTTCGAG CACGGCAGCA ACTACGTCTG CGAGAAGTCG GTGCCCACCA CTGCGCAGCC GACGCCCAGC TGCACCTTCA AGACCGGCAA GATCATCCTG CAGCAGCCGG TGGAGCGCGC GCAGATGGAA AAGCTGCTGG CCACGGGCAA GACCGACCTG CTCGACAAGT TCGTGAGCAT GCGTACGCGC CGCGCCTTCA AGGCCTTCCT GACCTGGAAC GCCGAGGAGG GCAAGGTGAC CTTCGAATTC GCGCCGCGCG AAGGCGGCAG CAAGTTCCCG CCGCGCAAGA CCTTCGGCAA GGCCGCGCCG GCGGGCAAGA CCGCGGCGGC CAAGAAGGTG GCGGCCAAGA AGACGCCCGC CGCCAAGAAG GCACCGGCCG CGAAGAAGGC TGCCGCGCCG CGCAAGCCGG GCGCGGGCCT CAAGCCCAGC GACTCGCTGG CCGCGGTGAT CGGCGCCGAG CCGGTGGCGC GCACCGAGGT CATCAAGAAG CTCTGGGACT ACATCAAGGC CAACGGCCTG CAGGACGCGG CCAACAAGCG CGCGATCAAT GCCGACGCCA AGCTCAAGCC GGTGTTCGGC AAAGACCAGG TGACGATGTT CGAGCTCGCG GGCATCGTGG GCAAGCACCT GTCGGCGCCA TGA
|
Protein sequence | MKTLVIAEKP SVAQDIVRAL TPVAGKFDKH DEHFENDSYV VTSAVGHLVE IQAPEEFDVK RGKWSFANLP VIPPHFDLKP VDKTKTRLNA VVKQAKRKDV TQLINACDAG REGELIFRLI EQYAGGKTGL NKPVKRLWLQ SMTPQAIRDG FDALRTEKQM QGLADAARSR SEADWLVGIN GTRAMTAFNS RDGGFFLTTV GRVQTPTLSV VVEREEKIRK FVSRDYWEIH GVFQAEAGQY PGKWFDANFK KPPPGPDGTA DAEIRADRVW SEREAREIAD AARGKPASVT EESKPTTQAS PMLFDLTSLQ REANGRFGFS AKTTLALAQS LYERHKALTY PRTDSRALPE DYLPVVKDTM KMLAESGMKH LAPFAQQAVD GSYVKPNKRI FDNAKVSDHF AIIPTLQAPS GLSDAEQKLY DFVVRRFMSV FFPSAEFQVT TRISTVEAGG KKYPFRSDGK VLVKPGWLAI WGKEAISDDD EKDGKNLVVV KPGETVKTES ADLKALKTRP PARYSEATLL GAMEGAGKTI DDDELREAMQ EKGLGTPATR AATIEGLITE KYMLREGREL IPTAKAFQLM TLLRGLGVEE LSKAELTGEW EYKLAQMEKG ALSRDAFMRE IAEMTQHIVK KAKEYDRDTV PGDYATLSTP CPNCGGVVKE NYRRYSCTGK AGQEPCGFSF GKSPAGRTFE VAEAEVLLRD KHIGPLEGFR SKAGWPFTSE IVLKYDEEAK NWKLEFDFGD DKNADTGEIV DFSEQDTVGP CPICGAPVFE HGSNYVCEKS VPTTAQPTPS CTFKTGKIIL QQPVERAQME KLLATGKTDL LDKFVSMRTR RAFKAFLTWN AEEGKVTFEF APREGGSKFP PRKTFGKAAP AGKTAAAKKV AAKKTPAAKK APAAKKAAAP RKPGAGLKPS DSLAAVIGAE PVARTEVIKK LWDYIKANGL QDAANKRAIN ADAKLKPVFG KDQVTMFELA GIVGKHLSAP
|
| |