Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_1986 |
Symbol | topB |
ID | 5587755 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 1970621 |
End bp | 1972582 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640925658 |
Product | DNA topoisomerase III |
Protein accession | YP_001463061 |
Protein GI | 157157904 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000000136291 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGTTGT TTATTGCCGA AAAACCGAGT CTGGCGCGCG CCATTGCTGA TGTCCTGCCC AAACCGCACC GTAAAGGCGA TGGCTTTATC GAGTGCGGTA ATGGTCAGGT GGTGACCTGG TGTATCGGTC ACCTGCTTGA GCAGGCGCAG CCAGACGCCT ACGACAGCCG CTATGCGCGC TGGAATCTTG CGGATTTGCC GATTGTCCCG GAAAAGTGGC AATTACAGCC CCGACCCTCC GTGACCAAAC AACTTAACGT CATCAAACGC TTCCTGCATG AAGCCAGCGA AATCGTTCAC GCCGGGGACC CGGATCGTGA AGGGCAATTG CTGGTGGATG AAGTGCTGGA CTATCTGCAA CTGGCACCGG AAAAGCGCCA GCAGGTGCAG CGTTGCTTGA TAAACGACCT GAACCCGCAG GCGGTTGAGC GGGCGATCGA CCGTCTTCGT TCCAACAGTG AGTTTGTGCC ATTGTGCGTT TCTGCGCTGG CGCGAGCGCG TGCCGACTGG CTGTACGGCA TCAATATGAC CCGTGCGTAT ACCATTCTCG GTCGCAATGC CGGTTATCAG GGCGTACTTT CCGTGGGACG CGTGCAGACA CCCGTGCTCG GGCTGGTGGT GCGCCGCGAT GAAGAGATTG AAAACTTCGT GGCGAAAGAC TTCTTTGAAG TCAAAGCGCA TATCGTGACA CCTGCCGATG AGCGGTTTAC CGCTATCTGG CAACCGAGCG AAGCGTGTGA ACCGTACCAG GATGAAGAAG GGCGCTTGTT ACATCGTCCA CTGGCGGAGC ATGTGGTTAA CCGCATTAGT GGTCAACCGG CTATTGTCAC CAGCTATAAC GATAAACGGG AATCAGAATC CGCGCCGCTG CCGTTTTCGC TTTCGGCGTT GCAGATTGAA GCGGCAAAAC GCTTTGGTCT GAGCGCGCAG AACGTGCTTG ATATCTGCCA GAAGCTGTAC GAAACACACA AGTTAATCAC TTATCCGCGT TCCGATTGTC GCTATTTGCC AGAAGAACAT TTTGCCGGAC GCCACGCGGT GATGAATGCC ATCAGCGTTC ATGCACCAGA TCTGTTGCCG CAGCCAGTGG TAGATCCAGA TATACGCAAC CGCTGTTGGG ATGACAAAAA GGTCGATGCG CACCACGCCA TCATTCCGAC CGCACGGAGT TCTGCGATCA ACCTGACGGA GAACGAAGCG AAGGTCTATA ACCTGATTGC CCGTCAGTAT CTGATGCAGT TCTGCCCGGA TGCGGTGTTC CGCAAGTGTG TTATCGAACT GGACATTGCC AAAGGCAAAT TTGTCGCTAA AGCGCGTTTT CTTGCTGAAG CAGGTTGGCG CGCGCTGTTA GGCAGCAAAG AGCGCGATGA AGAAAACGAC GGCACGCCAT TGCCTGTGGT GGCGAAAGGC GATGAGTTAC TGTGTGAAAA AGGTGAAGTG GTAGAGCGGC AAACCCAGCC GCCGCGCCAT TTTACCGATG CAACACTGCT TTCGGCGATG ACCGGGATTG CGCGCTTTGT GCAGGACAAA GATCTGAAAA AGATCCTTCG TGCGACCGAT GGTCTGGGGA CAGAAGCAAC GCGTGCCGGG ATTATTGAAC TGTTGTTCAA GCGTGGTTTC CTCACCAAAA AAGGGCGCTA TATCCACTCC ACCGACGCCG GAAAAGCGTT ATTCCATTCG CTGCCAGAAA TGGCTACGCG ACCGGACATG ACCGCGCACT GGGAATCGGT GCTGACGCAA ATCAGCGAAA AGCAGTGTCG CTATCAGGAC TTTATGCAGC CGCTGGTGGG GACGCTATAT CAGCTTATTG ATCAAGCCAA ACGTACTCCG GTGCGGCAGT TTCGCGGCAT TGTGGCTCCG GGCAGTGGTG GCAGTGCTGA TAAGAAAAAG GCTGCACCGC GTAAACGTAG TGCGAAAAAA AGTCCGCCAG CAGATGAAGC CGGAAGCGGG GCGATAGCGT AA
|
Protein sequence | MRLFIAEKPS LARAIADVLP KPHRKGDGFI ECGNGQVVTW CIGHLLEQAQ PDAYDSRYAR WNLADLPIVP EKWQLQPRPS VTKQLNVIKR FLHEASEIVH AGDPDREGQL LVDEVLDYLQ LAPEKRQQVQ RCLINDLNPQ AVERAIDRLR SNSEFVPLCV SALARARADW LYGINMTRAY TILGRNAGYQ GVLSVGRVQT PVLGLVVRRD EEIENFVAKD FFEVKAHIVT PADERFTAIW QPSEACEPYQ DEEGRLLHRP LAEHVVNRIS GQPAIVTSYN DKRESESAPL PFSLSALQIE AAKRFGLSAQ NVLDICQKLY ETHKLITYPR SDCRYLPEEH FAGRHAVMNA ISVHAPDLLP QPVVDPDIRN RCWDDKKVDA HHAIIPTARS SAINLTENEA KVYNLIARQY LMQFCPDAVF RKCVIELDIA KGKFVAKARF LAEAGWRALL GSKERDEEND GTPLPVVAKG DELLCEKGEV VERQTQPPRH FTDATLLSAM TGIARFVQDK DLKKILRATD GLGTEATRAG IIELLFKRGF LTKKGRYIHS TDAGKALFHS LPEMATRPDM TAHWESVLTQ ISEKQCRYQD FMQPLVGTLY QLIDQAKRTP VRQFRGIVAP GSGGSADKKK AAPRKRSAKK SPPADEAGSG AIA
|
| |