Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1428 |
Symbol | topB |
ID | 6146168 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1413353 |
End bp | 1415308 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641616306 |
Product | DNA topoisomerase III |
Protein accession | YP_001743486 |
Protein GI | 170682846 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0037378 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.804884 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGTTGT TTATTGCCGA AAAACCGAGT CTGGCGCGCG CCATTGCTGA TGTCCTGCCT AAACCGCACC GGAAAGGCGA TGGCTTTATC GAGTGCGGTA ATGGTCAGGT GGTGACCTGG TGTATCGGTC ACCTGCTTGA GCAGGCGCAG CCAGACGCCT ACGACAGCCG CTATGCGCGC TGGAATCTTG CGGATTTGCC AATTGTCCCG GAAAAGTGGC AATTACAGCC CCGACCCTCC GTGACTAAAC AACTTAATGT CATCAAACGC TTCCTGCATG AAGCCAGCGA AATCGTTCAC GCCGGAGACC CGGATCGCGA AGGGCAACTG CTGGTGGATG AAGTGCTGGA CTACCTGCAA CTGGCACCGG AAAAGCGCCA GCAGGTACAG CGTTGTTTGA TAAACGACCT GAACCCGCAG GCGGTTGAGC GGGCGATCGA TCGTCTTCGC TCCAATAGCG AGTTTGTACC GCTGTGCGTT TCTGCACTGG CGCGAGCCCG AGCTGACTGG CTGTACGGCA TCAATATGAC CCGCGCTTAC ACCATTCTCG GTCGCAATGC CGGTTATCAG GGCGTACTTT CCGTGGGACG CGTGCAGACG CCCGTACTCG GACTGGTGGT GCGCCGCGAC GAAGAGATTG AAAACTTCGT GGCGAAAGAT TTCTTTGAAG TCAAAGCACA TATCGTGACA CCTGCCGATG AGCGGTTTAC CGCTATCTGG CAACCGAGCG AAGCGTGTGA ACCGTACCAG GATGAAGAAG GGCGCTTGTT ACATCGTCCA CTGGCGGAGC ATGTGGTTAA CCGCATTAGT GGTCAACCAG CTATTGTCAC CAGCTATAAC GATAAACGGG AATCAGAATC CGCGCCGCTT CCTTTTTCGC TTTCGGCGTT GCAGATTGAA GCGGCTAAAC GTTTTGGTCT GAGCGCGCAG AACGTGCTTG ATATCTGCCA GAAGCTGTAC GAAACGCACA AGCTAATCAC TTATCCGCGT TCCGATTGCC GCTATTTGCC AGAAGAACAT TTTGCCGGAC GCCACGCGGT GATGAATGCT ATCAGCGTTC ATGCACCGGA TCTGTTGCCG CAGCCAGTGG TAGATCCAGA TATACGCAAC CGCTGTTGGG ATGACAAAAA GGTCGATGCG CACCACGCCA TCATTCCGAC CGCACGGAGT TCTGCGATCA ACCTGACGGA GAACGAAGCG AAGGTCTATA ACCTGATTGC CCGTCAGTAC TTGATGCAGT TCTGCCCGGA TGCGGTGTTC CGCAAGTGTG TTATCGAACT GGACATTGCC AAAGGCAAAT TTGTCGCTAA AGCCCGTTTT CTTGCTGAAG CAGGCTGGCG CACGCTGTTA GGCAGCAAAG AGCGCGATGA AGAAAACGAC GGCATGCCGC TGCCTGTGGT GGCGAAAGGC GATGAATTGT TGTGCGAAAA AGGTGAAGTG GTAGAGCGGC AAACCCAGCC GCCGCGCCAT TTTACCGATG CAACACTGCT TTCGGCGATG ACCGGGATCG CACGCTTTGT GCAGGACAAA GATCTGAAAA AGATCCTTCG TGCGACCGAT GGTCTGGGGA CAGAAGCAAC CCGTGCCGGG ATTATTGAAC TGTTGTTCAA GCGTGGTTTC CTGACCAAAA AAGGGCGCTA TATCCACTCC ACCGACGCCG GAAAAGCGCT ATTCCATTCG CTGCCAGAAA TGGCGACGCG ACCGGACATG ACCGCGCACT GGGAATCGGT GCTGACGCAA ATCAGTGAAA AGCAGTGTCG CTATCAGGAC TTTATGCAGC CGCTGGTGGG GACATTGTAC CAGTTGATTG ATCAGGCAAA GCGCACGTCG GTAAGGCAGT TTCGGGGAAT AATGGCTCCC GGCGGTGGAG AAGGGAAGAA AAAGGATTCG CCACGCAAGA GAGCGCCGAA AAAAAGCCCG TCATCAGAAG AGGCGGGCAA TGGAGTGATA ATATAA
|
Protein sequence | MRLFIAEKPS LARAIADVLP KPHRKGDGFI ECGNGQVVTW CIGHLLEQAQ PDAYDSRYAR WNLADLPIVP EKWQLQPRPS VTKQLNVIKR FLHEASEIVH AGDPDREGQL LVDEVLDYLQ LAPEKRQQVQ RCLINDLNPQ AVERAIDRLR SNSEFVPLCV SALARARADW LYGINMTRAY TILGRNAGYQ GVLSVGRVQT PVLGLVVRRD EEIENFVAKD FFEVKAHIVT PADERFTAIW QPSEACEPYQ DEEGRLLHRP LAEHVVNRIS GQPAIVTSYN DKRESESAPL PFSLSALQIE AAKRFGLSAQ NVLDICQKLY ETHKLITYPR SDCRYLPEEH FAGRHAVMNA ISVHAPDLLP QPVVDPDIRN RCWDDKKVDA HHAIIPTARS SAINLTENEA KVYNLIARQY LMQFCPDAVF RKCVIELDIA KGKFVAKARF LAEAGWRTLL GSKERDEEND GMPLPVVAKG DELLCEKGEV VERQTQPPRH FTDATLLSAM TGIARFVQDK DLKKILRATD GLGTEATRAG IIELLFKRGF LTKKGRYIHS TDAGKALFHS LPEMATRPDM TAHWESVLTQ ISEKQCRYQD FMQPLVGTLY QLIDQAKRTS VRQFRGIMAP GGGEGKKKDS PRKRAPKKSP SSEEAGNGVI I
|
| |