Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_A0022 |
Symbol | topB |
ID | 6966544 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011351 |
Strand | + |
Start bp | 16875 |
End bp | 19031 |
Gene Length | 2157 bp |
Protein Length | 718 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 643384053 |
Product | DNA topoisomerase III |
Protein accession | YP_002268532 |
Protein GI | 209395643 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.270283 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.995092 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACTTT TTATTGCAGA AAAACCCGCA GTAGCGAATG ATATTGTTAA GGCACTTGGT GGCAATTTTA CTCGACATGA TGGCTGGTTC GAAAGTGATA ACGCAATTGT GACTAACTGT TTTGGTCATA TTATCGAATC ACAGCCACCG GAAAACTATA ATCCTGAATA TAAAGTCTGG AAAGTTGAAA CGCTTCCTTT ACGTCTTTAT CCCGTGAAGT ATCAGCCTGT CGAAAGTGCC GCAAAACAGG TTAAAACGAT TCTCGAACTT ATCAGCCGTG GAGACGTGAC TGAAATTGTT CACGCTGGCG ATCCTGATGA TGAGGGACAG CTACTGGTTG ATGAAGTCCT GGAATATGCA GGCAACACAA AACCCGTAAA GCGCGTTCTG ATTAACGACA ACACGCTTCC GGCAGTGAAA AAGGCACTGG CAAATCTTAA AGACAATCGT GATTTCAAAG GACTTTACCT TAAGGCGCTG GCGCGTTCAG TTGCCGATGC CGTCTATGGC TTCTCTATGA CACGTGCGTA CACCATTCCG GCAAAAGCCA GAGGATATCA GGGCGTTCTG TCTGTCGGGC GCGTCCAGAC TCCCGTTCTT GGCCTCATTG TGAATCGTAC CCGTGCTAAC CAGAACCATA AATCCAGTTT TTACTACACC ATGACCGGAG TCTTTCAGCG TGGTGCTGAT GTTCTCAGTG CAAACTGGAA ACCAGGGGAA TTTGCTCCGC TGACAGATCG TAAATTGCTT GATAAGGCAT GGGCAAACGG AACGGCGGCA TCTCTTGCGG GAAAACCAGC CACCGTTGAA GCGGCAGCAA CTGATGATAA AAAAACTGCC GCACCGTTGC CATTTAACCT GGTCAGGCTC CAGCAATACA TGAACAAAAA GTTCAAAATG ACAGCGCAAA AAACGCTGGA TGTTACGCAA CAACTACGCG AAAAATACAA AGCGATCACT TATAACCGCT CTGATTGCTC ATATCTTTCT GATGAACAAT TCAGCGAAGC GCCACAGGTT ATCGATGCCC TGAAATCAGT ATTTCCTCAG TCGTTGGATA TTGATTCCGC ACGTAAAAGC AAGGCGTTTA ACAGTGCAAA GGTGACTGCG CATACTGCGA TAATCCCGAC AGCCAGTGTG CCTGATGTTA ACGCACTCAG CACCGACGAG CGCAATGTTT ACCTTGCGAT CGCACAACAC TATCTTGTTC AGTTCATGCC TGAAAAAGCA TACCAGGAAG TATCGGTTGC CATTCAGTGT GGTGATGAGT CGTTCTATGC TCGTGCCAGA AAAACAACTG ACAGCGGATT TGAGGCATTT CTTGGTGCGG AAACCACAGA TGAAGGTGAA TCAGAAGATA ACGATGATTC CGCTTTTGAA CTGCTCTGTA AAATTCGCAC AGGAGAAACA CTGACGACAA AAGAAGTTGT AGTTAATGAG AAGAAAACGA CACCGCCGCC ATTATTTACA GAAGCCTCCT TGCTTGCTGC GCTTGTTCGT GTCGCGGATT TTGTCACTGA CCCAACGATT AAAAAATTGT TGAAGGATAA AGATAAAGAC AAAAAAGATG AACATGGCGG CATTGGTACG CCAGCTACCC GCGCAGCGAT TCTGGAAACG TTGAAGAAGA GAAATTATAT CACGCTGGAA AAAGGAAAAC TCATTCCGAC AGATACCGGA TATGCGCTTA TTGATGCTCT GCCGGATATA GCTGTTAATC CAGATATGAC AGCATTATGG GCTGAAAAGC AGGCAGCTAT TGAAAATGGT GATCTGACGG TTGAACAGTT TATTAATGAG CTGTACGGTG AACTGACAGG CATGATTTCT GATGTTGACC TGGGCGCGAT GAAGATTGAA GCAGCAGCGC CAGCAGCCCA ATCTCAACGC CTGAATGCTC CCTGTCCCTC CTGTGGTAAG CAGATTGCTA TCAGGCCAAA AGGTTATTTC TGTACAGGAT GTGAATTTAA AATCTGGAAG AACTTCTCTG GCAAGGTTCT TTCTGATAAG CAAGTAGAAT CCTTGCTGAC AAAAGGTATT ACAGGGGAGC TAAAAGGGTT TGTTAGTTCC AGGACGAATA AAGAATTTTC GGCTAAAGTT AAATTGATTG ATAAAACAAC CGGAAAGTTA GGGTTTGAAT TTCCCCCTAA AAAGTAA
|
Protein sequence | MRLFIAEKPA VANDIVKALG GNFTRHDGWF ESDNAIVTNC FGHIIESQPP ENYNPEYKVW KVETLPLRLY PVKYQPVESA AKQVKTILEL ISRGDVTEIV HAGDPDDEGQ LLVDEVLEYA GNTKPVKRVL INDNTLPAVK KALANLKDNR DFKGLYLKAL ARSVADAVYG FSMTRAYTIP AKARGYQGVL SVGRVQTPVL GLIVNRTRAN QNHKSSFYYT MTGVFQRGAD VLSANWKPGE FAPLTDRKLL DKAWANGTAA SLAGKPATVE AAATDDKKTA APLPFNLVRL QQYMNKKFKM TAQKTLDVTQ QLREKYKAIT YNRSDCSYLS DEQFSEAPQV IDALKSVFPQ SLDIDSARKS KAFNSAKVTA HTAIIPTASV PDVNALSTDE RNVYLAIAQH YLVQFMPEKA YQEVSVAIQC GDESFYARAR KTTDSGFEAF LGAETTDEGE SEDNDDSAFE LLCKIRTGET LTTKEVVVNE KKTTPPPLFT EASLLAALVR VADFVTDPTI KKLLKDKDKD KKDEHGGIGT PATRAAILET LKKRNYITLE KGKLIPTDTG YALIDALPDI AVNPDMTALW AEKQAAIENG DLTVEQFINE LYGELTGMIS DVDLGAMKIE AAAPAAQSQR LNAPCPSCGK QIAIRPKGYF CTGCEFKIWK NFSGKVLSDK QVESLLTKGI TGELKGFVSS RTNKEFSAKV KLIDKTTGKL GFEFPPKK
|
| |