Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_0155 |
Symbol | topB |
ID | 4899330 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 146666 |
End bp | 149386 |
Gene Length | 2721 bp |
Protein Length | 906 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640133385 |
Product | DNA topoisomerase III |
Protein accession | YP_001064439 |
Protein GI | 126453699 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGACCC GCTTTTTTGC TCACGCCGGC CGCCATTTGG GCGTCGTGTG CTATAAAGCC GTCGTAAAAG GGCCCGCATC GGGCTCGCCC GGTCTATCAA TACACACTGT CATGTCGAAA GCACTCATCA TTGCGGAAAA GCCTTCGGTC GCGAACGACA TCGCGCGTGC TTTGGGCGGC TTTACCAAGC ATGACGAATA CTACGAGAGC GACGAATACG TGCTGTCGTC GGCGGTCGGC CACCTGCTCG AAATCGCCGC GCCCGAGGAA TACGAAGTCA AGCGCGGCAA ATGGAGCTTC GCGCATCTGC CCGTCATCCC TCCCCATTTC GATCTGAATC CGATCGCGAA AAGCGAGTCG CGCCTGAAGG TGCTGACGAA GCTGATCAAG CGCAAGGATG TCGACCGTCT GATCAACGCA TGCGACGCGG GGCGCGAGGG CGAGCTGATC TTCCGCCTGA TCGCGCAGCA CGCGAAGGCC AAGCAGCCGG TCCAGCGCCT GTGGCTGCAG TCGATGACGC CCGCCGCGAT CCGCGACGGC TTCGCGCACC TGCGCACCGA CATGGACATG CAGCCGCTCG CCGATGCCGC GCGTTGCCGC TCGGAGGCCG ACTGGCTCGT CGGCATCAAC GGCACGCGCG CGATGACGGC GTTCAACAGC AAGGGCGGCG GCTTCTTCCT GACGACCGTC GGCCGCGTGC AGACGCCCAC GCTGTCGATC GTCGTCGAGC GCGAGGAGAA GATTCGCCGC TTCGTGCCGC GCGACTACTG GGAAGTGCGC GCGGAATTCG TCTGCGCCGG CGGTTTCTAC GAAGGCCGCT GGTTCGATCC GAAGTTCAAG AAGGACGAAT TCGACCCTGA AAAACGCGAT TCGCGCCTCT GGAGCCTGCC TGCCGCAGAG ACGATCGTCG CCGCGTGCCG CGACCACCTC GGCACGGTCA CCGAGGAATC GAAGCCGTCG ACGCAGCTTT CGCCGCTGCT GTACGACCTG ACGAGCCTGC AGCGCGAGGC GAACAGCCGC TTCGGCTTCT CCGCGAAGAA CACGCTCGGC CTGGCGCAGG CGCTGTACGA GAAGCACAAG GTGCTCACCT ATCCGCGTAC CGACGCGCGC GCGCTGCCGG AGGACTACCT CGGCACGGTG AAGTCGACGC TCGAGATGCT CAAGGAGAGC AACAACTACC TGCCGCACGC GAAGCAGGTG CTCGACAAGA ACTGGGTGAA GCCGAACAAG CGCATCTTCG ACAACTCGAA GATCAGCGAT CACTTCGCGA TCATCCCGAC GCTGCAGGCG CCGAAATCGC TGTCCGAGCC GGAACAGAAG CTCTACGACC TCGTCGTCAA GCGCTTCCTC GCGGTGTTCT TCCCGGCCGC CGAATTCAAG GTGACCACGC GGATCACCGA GGTTGCCGGC CATCACTTCA AGACGGAAGG CAAGGTGCTC GTCGAGCCCG GCTGGCTGCA GGTCTACGGC CGTGACGCCG AGGGCGCCGA CGCGAATCTC GTGCCGGTGC AGAAGGGCGA GAAGGTCAAG ACCGACAAGA TCGCCGCGCA CGGCCTCACG ACGAAGCCGC CCGCACGCTA TTCGGAAGCG ACGCTGCTGT CGGCGATGGA AGGCGCGGGC AAGCTCGTCG AAGACGACGA ACTGCGCGAG GCGATGGCCG CGAAGGGCCT CGGCACGCCG GCCACGCGCG CGGCGATCAT CGAAGGCCTG CTCGGCGAGA AGTACCTCGT GCGCGAAGGC CGCGAGCTGA TTCCGACCGC GAAGGCGTTC CAGCTGATGA CCCTGTTGCG CGGCCTCGGC GTGAAGGAGC TGACCGCGCC CGAGCTGACG GGCGAATGGG AATACAAGCT GTCGCAGATG GAGCGCGGCA ACCTGCAGCG CGACGCGTTC ATGCAGGAAA TCGCGCGGAT GACGCAGACG ATCGTCAAGC GCGCGAAGGA ATACGACTCC GACACGATCC CGGGCGACTA CGCGACGCTC GAAACGCCGT GCCCGAACTG CGGCGGCCAG GTCAAGGAGA ACTATCGGCG CTTCGCGTGC ACGAAATGCG AATTCTCGAT CTCGAAGATT CCGGGCAGCC GGCAGTTCGA GATCGCCGAA GTCGAGGAAC TGCTGCGGAA GAAGGAGATC GGGCCGCTGT CGGGCTTCCG CAGCAAGATG GGCCGACCGT TCTCGGCGAT CCTCAAGCTC ACGTTCGACG ACGAGACGAA GAATTACAAG CTCGAATTCG ACTTCGGCCA GGAGCAAGGC GGCGAGGAAG GCGAAGCGCC CGATTTCTCC GCGCAGGAGC CGGTCGGCGC GTGCCCGAAG TGCAAAGGCC GCGTGTTCGA GCACGGCATG AGCTATGTCT GCGAGCACGC GGTCGCGAAC CCAAAGACCT GCGACTTCCG CTCCGGCAAG GTGATCCTGC AGCAGGAGAT CACCCGCGAG CAGATGGCGA AACTCCTCGA GAACGGCCGC ACCGATCTGC TGCCGAACTT CAAGTCGTCG CGCACCGGGC GCAACTTCAA GGCGTATCTC GTCAAGCAGC CGGACGGCAA GATCGGCTTC GAGTTCGAGA AGAAGGAGCC GAAGCCCGCA GCCGCGAAGA AGACCGCGGC CAGATCGTCG GCCGCGGCCG ACGATGCGGC AGCCGACAGC GGGGAGAAAG CCGAGAAGAA GGCGGCACCT GCGCGCAAGA CGGCCGCGCG CAAGACGCCG GCCCGCAAGA CGGGCTCGTG A
|
Protein sequence | MMTRFFAHAG RHLGVVCYKA VVKGPASGSP GLSIHTVMSK ALIIAEKPSV ANDIARALGG FTKHDEYYES DEYVLSSAVG HLLEIAAPEE YEVKRGKWSF AHLPVIPPHF DLNPIAKSES RLKVLTKLIK RKDVDRLINA CDAGREGELI FRLIAQHAKA KQPVQRLWLQ SMTPAAIRDG FAHLRTDMDM QPLADAARCR SEADWLVGIN GTRAMTAFNS KGGGFFLTTV GRVQTPTLSI VVEREEKIRR FVPRDYWEVR AEFVCAGGFY EGRWFDPKFK KDEFDPEKRD SRLWSLPAAE TIVAACRDHL GTVTEESKPS TQLSPLLYDL TSLQREANSR FGFSAKNTLG LAQALYEKHK VLTYPRTDAR ALPEDYLGTV KSTLEMLKES NNYLPHAKQV LDKNWVKPNK RIFDNSKISD HFAIIPTLQA PKSLSEPEQK LYDLVVKRFL AVFFPAAEFK VTTRITEVAG HHFKTEGKVL VEPGWLQVYG RDAEGADANL VPVQKGEKVK TDKIAAHGLT TKPPARYSEA TLLSAMEGAG KLVEDDELRE AMAAKGLGTP ATRAAIIEGL LGEKYLVREG RELIPTAKAF QLMTLLRGLG VKELTAPELT GEWEYKLSQM ERGNLQRDAF MQEIARMTQT IVKRAKEYDS DTIPGDYATL ETPCPNCGGQ VKENYRRFAC TKCEFSISKI PGSRQFEIAE VEELLRKKEI GPLSGFRSKM GRPFSAILKL TFDDETKNYK LEFDFGQEQG GEEGEAPDFS AQEPVGACPK CKGRVFEHGM SYVCEHAVAN PKTCDFRSGK VILQQEITRE QMAKLLENGR TDLLPNFKSS RTGRNFKAYL VKQPDGKIGF EFEKKEPKPA AAKKTAARSS AAADDAAADS GEKAEKKAAP ARKTAARKTP ARKTGS
|
| |