Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2483 |
Symbol | topB |
ID | 6970094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2350805 |
End bp | 2352766 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643386352 |
Product | DNA topoisomerase III |
Protein accession | YP_002270834 |
Protein GI | 209398550 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000298229 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.566589 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGTTGT TTATTGCCGA AAAACCGAGT CTGGCGCGCG CCATTGCTGA TGTCCTGCCC AAACCGCACC GGAAAGGCGA TGGCTTTATC GAGTGCGGTA ATGGTCAGGT GGTGACCTGG TGTATCGGTC ACCTGCTTGA GCAGGCGCAG CCAGACGCCT ACGACAGCCG CTATGCGCGC TGGAATCTTG CGGATTTGCC GATTGTCCCG GAAAAGTGGC AATTACAGCC CCGACCCTCC GTGACCAAAC AACTTAACGT CATCAAACGG TACCTGCATG AAGCCAGCGA AATCGTTCAC GCCGGGGACC CGGATCGCGA AGGGCAACTG CTGGTGGATG AAGTGCTGGA CTATCTACAA CTGGCACCGG AAAAGCGCCA GCAGGTGCAG CGTTGCTTGA TAAACGACCT GAACCCGCAG GCGGTTGAGC GGGCGATCGA CCGTCTTCGT TCCAACAGTG AGTTTGTACC GCTGTGCGTT TCTGCGCTGG CGCGAGCGCG TGCCGACTGG CTGTACGGCA TCAATATGAC CCGTGCGTAT ACCATTCTCG GTCGCAATGC CGGTTATCAG GGCGTGCTTT CCGTGGGACG CGTGCAGACG CCCGTGCTTG GGCTGGTGGT GCGCCGCGAT GAAGAGATTG AAAACTTCGT GGCGAAAGAC TTCTTTGAAG TCAAAGCACA TATCGTGACA CCTGCCGATG AGCGGTTTAC CGCTATCTGG CAACCGAGCG AAGCGTGTGA ACCGTACCAG GATGAAGAAG GGCGCTTGTT ACATCGTCCA CTGGCGGAGC ATGTGGTTAA CCGCATTAGT GGTCAACCGG CTATTGTCAC CAGCTATAAC GATAAACGGG AATCAGAATC CGCGCCGCTG CCGTTTTCGC TTTCGGCGTT GCAGATTGAA GCGGCAAAAC GCTTTGGTCT GAGCGCGCAG AACGTGCTTG ATATCTGCCA GAAGCTGTAC GAAACGCACA AGCTAATCAC TTATCCGCGT TCCGATTGTC GCTATTTGCC AGAAGAACAT TTTGCCGGAC GCCACGCGGT GATGAATGCC ATCAGCGTTC ATGCACCAGA TCTGTTGCCG CAGCCAGTGG TAGATCCAGA TATACGCAAC CGCTGTTGGG ATGACAAAAA GGTCGATGCG CACCACGCCA TCATTCCGAC CGCACGGAGT TCTGCGATCA ACCTGACGGA GAACGAAGCG AAGGTCTATA ACCTGATTTC CCGTCAGTAT CTGATGCAGT TCTGCCCGGA TGCGGTGTTC CGCAAGTGTG TTATCGAACT GGACATTGCC AAAGGCAAAT TTGTCGCTAA AGCGCGTTTT CTTGCTGAAG CAGGCTGGCG CACGCTGTTA GGCAGCAAAG AGCGCGATGA AGAAAACGAC GGTACGCCAT TGCCTGTGGT GGCGAAAGGC GATGAGTTGC TGTGTGAAAA AGGTGAAGTG GTAGAGCGGC AAACCCAGCC GCCGCGCCAT TTTACCGATG CAACACTGCT TTCGGCGATG ACCGGGATCG CGCGCTTTGT GCAGGACAAA GATCTGAAAA AGATCCTTCG TGCGACCGAT GGTCTGGGGA CAGAAGCAAC GCGTGCCGGG ATTATTGAAC TGTTGTTCAA GCGTGGTTTC CTCATCAAAA AAGGGCGCTA TATCCACTCC ACCGACGCCG GAAAAGCGTT ATTCCATTCG CTGCCAGAAA TGGCTACGCG ACCGGACATG ACCGCGCACT GGGAATCGGT GCTGACGCAA ATCAGCGAAA AGCAGTGTCG CTATCAGGAC TTTATGCAGC CGCTGGTGGG GACGCTATAT CAGCTTATTG ATCAAGCCAA ACGTACTCCG GTGCGGCAGT TTCGCGGCAT TGTGGCTCCG GGCAGTGGTG GCAGTGCTGA TAAGAAAAAG GCTGCACCGC GTAAACGTAG TGCGAAAAAA AGTCCGCCAG CAGATGAAGC CGGAAGCGGG GCGATAGCGT AA
|
Protein sequence | MRLFIAEKPS LARAIADVLP KPHRKGDGFI ECGNGQVVTW CIGHLLEQAQ PDAYDSRYAR WNLADLPIVP EKWQLQPRPS VTKQLNVIKR YLHEASEIVH AGDPDREGQL LVDEVLDYLQ LAPEKRQQVQ RCLINDLNPQ AVERAIDRLR SNSEFVPLCV SALARARADW LYGINMTRAY TILGRNAGYQ GVLSVGRVQT PVLGLVVRRD EEIENFVAKD FFEVKAHIVT PADERFTAIW QPSEACEPYQ DEEGRLLHRP LAEHVVNRIS GQPAIVTSYN DKRESESAPL PFSLSALQIE AAKRFGLSAQ NVLDICQKLY ETHKLITYPR SDCRYLPEEH FAGRHAVMNA ISVHAPDLLP QPVVDPDIRN RCWDDKKVDA HHAIIPTARS SAINLTENEA KVYNLISRQY LMQFCPDAVF RKCVIELDIA KGKFVAKARF LAEAGWRTLL GSKERDEEND GTPLPVVAKG DELLCEKGEV VERQTQPPRH FTDATLLSAM TGIARFVQDK DLKKILRATD GLGTEATRAG IIELLFKRGF LIKKGRYIHS TDAGKALFHS LPEMATRPDM TAHWESVLTQ ISEKQCRYQD FMQPLVGTLY QLIDQAKRTP VRQFRGIVAP GSGGSADKKK AAPRKRSAKK SPPADEAGSG AIA
|
| |