Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2018 |
Symbol | topB |
ID | 6271493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 1836243 |
End bp | 1838204 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641726066 |
Product | DNA topoisomerase III |
Protein accession | YP_001880560 |
Protein GI | 187731958 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.000256516 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGTTGT TTATTGCCGA AAAACCGAGT CTGGCGCGCG CCATTGCTGA TGTCCTGCCC AAACCGCACC GGAAAGGCGA TGGCTTTATC GAGTGCGGTA ATGGTCAGGT GGTGACCTGG TGTATCGGTC ACCTGCTTGA GCAGGCGCAG CCAGACGCCT ACGACAGCCG CTATGCGCGC TGGAATCTTG CGGATTTGCC GATTGTCCCG GAAAAGTGGC AATTACAGCC CCGACCCTCC GTGACCAAAC AACTTAACGT CATCAAACGC TTCCTGCATG AAGCCAGCGA AATCGTTCAC GCCGGGGACC CGGATCGTGA AGGGCAACTG CTGGTGGATG AAGTGCTGGA CTATCTGCAA CTGGCACCGG AAAAGCGCCA GCAGGTGCAG CGTTGCTTGA TAAACGACCT GAACCCGCAG GCGGTTGAGC GGGCGATCGA CCGTCTTCGG TCCAACAGTG AGTTTGTGCC ATTGTGCGTT TCTGCGCTGG CGCGAGCGCG TGCCGACTGG CTGTACGGCA TCAATATGAC CCGTGCGTAT ACCATTCTCG GTCGCAATGC CGGTTATCAG GGCGTACTTT CCGTGGGACG CGTGCAGACA CCCGTGCTCG GGCTGGTGGT GCGCCGCGAT GAAGAGATTG AAAACTTCGT GGCGAAAGAC TTCTTTGAAG TCAAAGCACA TATCGTGACA CCTGCCGATG AGCGGTTTAC CGCTATCTGG CAACCGAGCG AAGCGTGTGA ACCGTACCAG GATGAAGAAG GGCGCTTGTT ACATCGTCCA CTGGCGGAGC ATGTGGTTAA CCGCATTAGT GGTCAACCAG CAATTGTCAC CAGCTATAAC GATAAACGGG AATCAGAATC CGCGCCGCTG CCTTTTTCGC TTTCGGCGTT GCAGATTGAA GCGGCAAAAC GCTTTGGTCT GAGCGCGCAG AACGTGCTTG ATATCTGCCA GAAGCTGTAC GAAACGCACA AGCTAATCAC TTATCCGCGT TCCGATTGTC GCTATTTGCC AGAAGAACAT TTTGCCGGAC GCCACGCGGT GATGAATGCC ATCAGCGTTC ATGCACCAGA TCTGTTGCCG CAACCAGTGG TAGATCCAGA TATACGCAAC CGCTGTTGGG ATGACAAAAA GGTCGATGCA CACCACGCCA TCATCCCGAC CGCACGGAGT TCTGAGATCA ACCTGACGGA GAACGAAGCG AAGGTCTATA ACCTGATTGC CCGTCAGTAC TTGATGCAGT TCTGCCCGGA TGCGGTGTTC CGCAAGTGTG TTATCGAACT GGACATTGCC AAAGGCAAAT TTGTCGCTAA AGCGCGTTTT CTTGCTGAAG CAGGCTGGCG CACGCTGTTA GGCAGCAAAG AGCGCGATGA AGAAAACGAC GGTACGCCAT TGCCTGTGGT GGCGAAAGGC GATGAGTTGC TGTGTGAAAA AGGTGAAGTG GTAGAGCGGC AAACCCAGCC GCCGCGCCAT TTTACCGATG CAACACTGCT TTCGGCGATG ACCGGGATCG CGCGCTTTGT GCAGGACAAA GATCTGAAAA AGATCCTTCG TGCGACCGAT GGTCTGGGGA CAGAAGCAAC CCGTGCCGGG ATTATTGAAC TGCTGTTCAA GCGTGGTTTT CTGACCAAAA AAGGGCGCTA TATCCACTCT ACCGACGCCG GAAAAGCGTT ATTCCATTCG CTGCCAGAAA TGGCGACGCG ACCGGACATG ACCGCGCACT GGGAATCGGT GCTGACGCAA ATCAGCGAAA AGCAGTGTCG CTATCAGGAC TTTATGCAGC CGCTGGTGGG GACGCTATAT CAGCTTATTG ATCAAGCCAA ACGTACGCCG GTGCGGCAGT TTCGCGGCAT TGTGGCTCCG GGCAGTGGTG GCAGTGCTGA TAAGAAAAAG GCTGCACCGC GTAAACGTAG TGCGAAAAAA AGTCCGCCAG CAGATGAAGT CGGAAGCGGG GCGATAGCGT AA
|
Protein sequence | MRLFIAEKPS LARAIADVLP KPHRKGDGFI ECGNGQVVTW CIGHLLEQAQ PDAYDSRYAR WNLADLPIVP EKWQLQPRPS VTKQLNVIKR FLHEASEIVH AGDPDREGQL LVDEVLDYLQ LAPEKRQQVQ RCLINDLNPQ AVERAIDRLR SNSEFVPLCV SALARARADW LYGINMTRAY TILGRNAGYQ GVLSVGRVQT PVLGLVVRRD EEIENFVAKD FFEVKAHIVT PADERFTAIW QPSEACEPYQ DEEGRLLHRP LAEHVVNRIS GQPAIVTSYN DKRESESAPL PFSLSALQIE AAKRFGLSAQ NVLDICQKLY ETHKLITYPR SDCRYLPEEH FAGRHAVMNA ISVHAPDLLP QPVVDPDIRN RCWDDKKVDA HHAIIPTARS SEINLTENEA KVYNLIARQY LMQFCPDAVF RKCVIELDIA KGKFVAKARF LAEAGWRTLL GSKERDEEND GTPLPVVAKG DELLCEKGEV VERQTQPPRH FTDATLLSAM TGIARFVQDK DLKKILRATD GLGTEATRAG IIELLFKRGF LTKKGRYIHS TDAGKALFHS LPEMATRPDM TAHWESVLTQ ISEKQCRYQD FMQPLVGTLY QLIDQAKRTP VRQFRGIVAP GSGGSADKKK AAPRKRSAKK SPPADEVGSG AIA
|
| |