Gene Vapar_5202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_5202 
Symbol 
ID7969861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp5523766 
End bp5526708 
Gene Length2943 bp 
Protein Length980 aa 
Translation table11 
GC content67% 
IMG OID644795796 
ProductDNA topoisomerase III 
Protein accessionYP_002947070 
Protein GI239818160 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.221506 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGACCT TGGTAATCGC AGAAAAGCCG TCGGTGGCAC AGGACATCGT CCGTGCACTC 
ACGCCCGTGG CCGGCAAGTT CGACAAGCAT GACGAGCATT TCGAGAACGA CAGCTACGTG
GTAACCAGCG CGGTCGGCCA CCTGGTCGAA ATCCAGGCCC CGGAGGAGTT CGACGTCAAG
CGGGGCAAGT GGAGCTTCGC GAACCTGCCG GTGATTCCGC CGCATTTCGA CCTGAAGCCG
GTCGACAAGA CCAAGACGCG CCTGAACGCC GTGGTCAAGC AGGCCAAGCG CAAGGACGTG
ACGCAACTCA TCAACGCCTG CGACGCGGGC CGCGAGGGCG AGCTGATCTT CCGCCTGATC
GAGCAGTACG CGGGCGGCAA GACCGGCCTG AACAAGCCCG TGAAGCGCCT GTGGCTGCAG
TCGATGACGC CGCAGGCCAT CCGCGACGGC TTCGACGCGC TGCGCACCGA AAAGCAGATG
CAGGGCCTGG CCGACGCCGC GCGCTCGCGC TCGGAGGCCG ACTGGCTGGT GGGCATCAAC
GGCACGCGCG CCATGACGGC CTTCAATTCG CGCGACGGCG GCTTCTTTTT GACGACCGTG
GGCCGCGTGC AGACGCCCAC GCTGTCGGTG GTGGTCGAGC GCGAGGAAAA GATCCGCAAG
TTCGTGAGCC GCGACTACTG GGAAATCCAC GGTGTGTTCC AGGCCGAGGC CGGCCAGTAC
CCGGGCAAGT GGTTCGATGC CAACTTCAAG AAGCCGCCGC CCGGCCCCGA CGGCACGGCC
GACGCCGAGA TCCGCGCCGA CCGCGTGTGG AGCGAGCGCG AGGCGCGCGA GATCGCCGAT
GCGGCGCGCG GCAAGCCCGC CAGCGTCACC GAGGAAAGCA AGCCCACCAC GCAGGCTTCG
CCGATGCTGT TCGACCTCAC CTCGCTGCAG CGCGAGGCCA ACGGCCGCTT CGGCTTCTCG
GCCAAGACCA CGCTGGCGCT GGCGCAGAGC CTGTACGAGC GCCACAAGGC GCTGACCTAT
CCGCGGACCG ACTCGCGCGC ACTGCCCGAG GACTACCTGC CGGTGGTCAA GGACACCATG
AAGATGCTCG CCGAGAGCGG CATGAAGCAC TTGGCGCCCT TCGCGCAGCA GGCGGTCGAC
GGCAGCTACG TGAAGCCGAA CAAGCGCATC TTCGACAACG CCAAGGTGTC GGATCACTTC
GCGATCATTC CCACGCTGCA GGCGCCGAGC GGCCTGTCCG ACGCCGAGCA GAAGCTCTAC
GACTTCGTGG TGCGCCGCTT CATGTCGGTG TTCTTCCCGA GCGCCGAGTT CCAGGTGACC
ACCCGGATCA GCACGGTGGA GGCCGGCGGC AAGAAGTACC CGTTCCGCAG CGACGGCAAG
GTGCTGGTCA AGCCGGGCTG GCTCGCGATC TGGGGCAAGG AAGCCATCAG CGACGACGAC
GAGAAGGACG GCAAGAACCT GGTGGTGGTG AAGCCGGGCG AAACGGTGAA GACCGAATCG
GCCGACCTGA AGGCGCTGAA GACCCGGCCG CCCGCGCGCT ATTCGGAAGC CACGCTGCTG
GGCGCGATGG AAGGCGCCGG CAAGACCATC GACGACGACG AGCTGCGCGA GGCCATGCAG
GAAAAAGGCC TGGGCACGCC AGCCACGCGC GCGGCCACCA TCGAAGGCCT GATCACCGAG
AAATACATGC TGCGCGAAGG CCGCGAGCTG ATCCCCACGG CCAAGGCCTT CCAGCTCATG
ACGCTGCTGC GCGGCCTGGG CGTGGAAGAA CTCTCCAAGG CCGAGCTCAC GGGCGAGTGG
GAATACAAGC TCGCGCAGAT GGAGAAGGGC GCGCTGAGCC GCGACGCCTT CATGCGCGAG
ATCGCCGAGA TGACGCAGCA CATCGTCAAG AAGGCCAAGG AATACGACCG CGACACCGTG
CCGGGCGACT ACGCCACGCT GTCGACCCCA TGCCCCAACT GCGGCGGCGT GGTGAAGGAA
AACTACCGCC GCTACAGCTG CACCGGCAAG GCCGGGCAGG AGCCCTGCGG CTTCTCGTTC
GGCAAGTCGC CGGCGGGGCG CACCTTCGAG GTGGCCGAGG CGGAAGTGCT GCTGCGCGAC
AAGCACATCG GCCCGCTGGA GGGCTTCCGC TCCAAGGCGG GCTGGCCCTT CACGTCCGAG
ATCGTCCTCA AGTACGACGA AGAGGCGAAG AACTGGAAGC TGGAATTCGA CTTCGGCGAC
GACAAGAACG CCGACACCGG CGAGATCGTC GATTTCAGCG AGCAGGACAC GGTGGGCCCG
TGCCCGATCT GCGGCGCGCC GGTGTTCGAG CACGGCAGCA ACTACGTCTG CGAGAAGTCG
GTGCCCACCA CTGCGCAGCC GACGCCCAGC TGCACCTTCA AGACCGGCAA GATCATCCTG
CAGCAGCCGG TGGAGCGCGC GCAGATGGAA AAGCTGCTGG CCACGGGCAA GACCGACCTG
CTCGACAAGT TCGTGAGCAT GCGTACGCGC CGCGCCTTCA AGGCCTTCCT GACCTGGAAC
GCCGAGGAGG GCAAGGTGAC CTTCGAATTC GCGCCGCGCG AAGGCGGCAG CAAGTTCCCG
CCGCGCAAGA CCTTCGGCAA GGCCGCGCCG GCGGGCAAGA CCGCGGCGGC CAAGAAGGTG
GCGGCCAAGA AGACGCCCGC CGCCAAGAAG GCACCGGCCG CGAAGAAGGC TGCCGCGCCG
CGCAAGCCGG GCGCGGGCCT CAAGCCCAGC GACTCGCTGG CCGCGGTGAT CGGCGCCGAG
CCGGTGGCGC GCACCGAGGT CATCAAGAAG CTCTGGGACT ACATCAAGGC CAACGGCCTG
CAGGACGCGG CCAACAAGCG CGCGATCAAT GCCGACGCCA AGCTCAAGCC GGTGTTCGGC
AAAGACCAGG TGACGATGTT CGAGCTCGCG GGCATCGTGG GCAAGCACCT GTCGGCGCCA
TGA
 
Protein sequence
MKTLVIAEKP SVAQDIVRAL TPVAGKFDKH DEHFENDSYV VTSAVGHLVE IQAPEEFDVK 
RGKWSFANLP VIPPHFDLKP VDKTKTRLNA VVKQAKRKDV TQLINACDAG REGELIFRLI
EQYAGGKTGL NKPVKRLWLQ SMTPQAIRDG FDALRTEKQM QGLADAARSR SEADWLVGIN
GTRAMTAFNS RDGGFFLTTV GRVQTPTLSV VVEREEKIRK FVSRDYWEIH GVFQAEAGQY
PGKWFDANFK KPPPGPDGTA DAEIRADRVW SEREAREIAD AARGKPASVT EESKPTTQAS
PMLFDLTSLQ REANGRFGFS AKTTLALAQS LYERHKALTY PRTDSRALPE DYLPVVKDTM
KMLAESGMKH LAPFAQQAVD GSYVKPNKRI FDNAKVSDHF AIIPTLQAPS GLSDAEQKLY
DFVVRRFMSV FFPSAEFQVT TRISTVEAGG KKYPFRSDGK VLVKPGWLAI WGKEAISDDD
EKDGKNLVVV KPGETVKTES ADLKALKTRP PARYSEATLL GAMEGAGKTI DDDELREAMQ
EKGLGTPATR AATIEGLITE KYMLREGREL IPTAKAFQLM TLLRGLGVEE LSKAELTGEW
EYKLAQMEKG ALSRDAFMRE IAEMTQHIVK KAKEYDRDTV PGDYATLSTP CPNCGGVVKE
NYRRYSCTGK AGQEPCGFSF GKSPAGRTFE VAEAEVLLRD KHIGPLEGFR SKAGWPFTSE
IVLKYDEEAK NWKLEFDFGD DKNADTGEIV DFSEQDTVGP CPICGAPVFE HGSNYVCEKS
VPTTAQPTPS CTFKTGKIIL QQPVERAQME KLLATGKTDL LDKFVSMRTR RAFKAFLTWN
AEEGKVTFEF APREGGSKFP PRKTFGKAAP AGKTAAAKKV AAKKTPAAKK APAAKKAAAP
RKPGAGLKPS DSLAAVIGAE PVARTEVIKK LWDYIKANGL QDAANKRAIN ADAKLKPVFG
KDQVTMFELA GIVGKHLSAP