Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1906 |
Symbol | topA |
ID | 6969066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1798769 |
End bp | 1801366 |
Gene Length | 2598 bp |
Protein Length | 865 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643385839 |
Product | DNA topoisomerase I |
Protein accession | YP_002270328 |
Protein GI | 209398456 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA [COG0551] Zn-finger domain associated with topoisomerase type I |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.176274 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000000000000101765 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGTAAAG CTCTTGTCAT CGTTGAGTCC CCGGCAAAAG CCAAAACGAT CAACAAGTAT CTGGGTAGTG ACTACGTGGT GAAATCCAGC GTCGGTCACA TCCGCGATTT GCCGACCAGT GGCTCAGCTG CCAAAAAGAG TGCCGACTCT ACCTCCACCA AGACGGCTAA AAAGCCTAAA AAGGATGAAC GTGGCGCTCT CGTCAACCGT ATGGGGGTTG ACCCGTGGCA CAATTGGGAG GCGCACTATG AAGTGTTGCC TGGTAAAGAG AAGGTCGTCT CTGAACTGAA ACAACTGGCT GAAAAAGCCG ACCACATCTA TCTCGCAACC GACCTTGACC GCGAAGGGGA AGCCATTGCA TGGCACCTGC GGGAAGTGAT TGGTGGTGAT GATGCGCGCT ATAGCCGAGT GGTGTTTAAC GAAATTACTA AAAACGCGAT CCGCCAGGCA TTTAACAAAC CGGGTGAGCT GAATATTGAT CGTGTTAATG CCCAGCAGGC GCGTCGCTTT ATGGACCGCG TGGTGGGGTA TATGGTTTCG CCGCTGCTAT GGAAAAAGAT CGCTCGTGGT CTGTCTGCCG GTCGTGTGCA GTCGGTGGCA GTCCGCCTGG TGGTCGAGCG TGAGCGTGAA ATTAAAGCGT TCGTGCCGGA AGAGTTCTGG GAAGTCGATG CCAGCACGAC CACGCCATCT GGTGAAGCGT TGGCGTTGCA GGTGACTCAT CAGAACGACA AACCGTTCCG TCCGGTCAAC AAAGAACAAA CTCAGGCTGC GGTAAGTCTG CTGGAAAAAG CGCGCTACAG CGTGCTGGAA CGTGAAGACA AACCGACAAC CAGTAAACCT GGCGCTCCTT TTATTACCTC TACGCTGCAA CAAGCTGCCA GCACCCGTCT TGGATTTGGC GTGAAAAAAA CCATGATGAT GGCGCAGCGT TTGTATGAAG CAGGCTATAT CACTTACATG CGTACCGACT CCACTAACCT GAGTCAGGAC GCGGTAAATA TGGTTCGCGG TTATATCAGC GATAATTTTG GTAAGAAATA TCTGCCGGAA AGTCCGAATC AGTACGCCAG CAAAGAAAAC TCACAGGAAG CGCACGAAGC GATTCGTCCT TCTGACGTCA ATGTGATGGC GGAATCGCTG AAGGATATGG AAGCAGATGC GCAGAAACTG TACCAGTTAA TCTGGCGTCA GTTCGTTGCC TGCCAGATGA CCCCAGCGAA ATATGACTCC ACGACGCTGA CCGTTGGTGC GGGCGATTTC CGCCTGAAAG CACGCGGTCG TATTTTGCGC TTTGATGGCT GGACGAAAGT GATGCCTGCA CTGCGTAAAG GCGATGAAGA TCGTATCTTA CCTGCAGTCG ATAAAGGCGA TGCTCTGACG CTCGTTGAAC TGACACCAGC CCAGCACTTT ACCAAGCCGC CAGCCCGTTT CAGTGAAGCA TCGCTGGTTA AAGAACTGGA AAAACGTGGT ATCGGTCGTC CGTCTACCTA TGCGTCGATC ATTTCGACCA TTCAGGATCG TGGCTATGTG CGAGTAGAAA ATCGTCGTTT CTATGCGGAA AAAATGGGCG AAATCGTCAC CGATCGCCTG GAAGAGAATT TCCGCGAGTT AATGAACTAC GACTTCACCG CGCAGATGGA AAACAGCCTT GACCAGGTGG CAAATCACGA AGCAGAGTGG AAAGCTGTAC TGGATCACTT CTTCTCGGAT TTCACTCAGC AGTTAGATAA AGCTGAAAAA GATCCGGAAG AGGGGGGTAT GCGTCCGAAC CAGATGGTTC TGACCAGCAT CGACTGCCCG ACCTGTGGTC GCAAAATGGG GATTCGCACA GCGAGCACCG GGGTATTCCT TGGCTGTTCT GGCTATGCGC TGCCGCCGAA AGAGCGTTGC AAAACAACCA TTAACCTGGT GCCGGAAAAC GAAGTGCTGA ACGTGCTGGA AGGCGAAGAC GCTGAAACCA ACGCGCTGCG CGCAAAACGT CGTTGCCCCA AATGCGGCAC GGCGATGGAC AGCTATCTCA TCGATCCGAA ACGTAAGTTG CATGTCTGTG GTAATAACCC AACCTGCGAC GGTTACGAGA TCGAAGAGGG CGAATTCCGC ATTAAAGGTT ATGACGGCCC GATCGTTGAG TGTGAAAAAT GTGGTTCTGA AATGCACCTG AAAATGGGGC GTTTCGGTAA ATATATGGCC TGCACCAACG AAGAGTGTAA AAACACGCGT AAGATTTTAC GTAACGGCGA AGTGGCTCCA CCGAAAGAAG ATCCGGTACC ATTACCGGAG CTGCCGTGCG AAAAATCAGA TGCCTATTTC GTGCTGCGTG ACGGTGCTGC CGGTGTGTTC CTGGCGGCCA ATACCTTCCC GAAATCGCGT GAAACGCGTG CGCCGCTGGT GGAAGAGCTG TATCGCTTCC GCGATCGTCT GCCGGAAAAA CTGCGTTATC TGGCCGATGC GCCGCAGCAG GATCCGGAAG GTAATAAGAC TATGGTTCGC TTTAGCCGTA AAACCAAACA GCAATATGTC TCTTCGGAAA AAGACGGAAA GGCGACTGGC TGGTCAGCAT TTTATGTTGA TGGCAAATGG GTTGAAGGGA AAAAATAA
|
Protein sequence | MGKALVIVES PAKAKTINKY LGSDYVVKSS VGHIRDLPTS GSAAKKSADS TSTKTAKKPK KDERGALVNR MGVDPWHNWE AHYEVLPGKE KVVSELKQLA EKADHIYLAT DLDREGEAIA WHLREVIGGD DARYSRVVFN EITKNAIRQA FNKPGELNID RVNAQQARRF MDRVVGYMVS PLLWKKIARG LSAGRVQSVA VRLVVERERE IKAFVPEEFW EVDASTTTPS GEALALQVTH QNDKPFRPVN KEQTQAAVSL LEKARYSVLE REDKPTTSKP GAPFITSTLQ QAASTRLGFG VKKTMMMAQR LYEAGYITYM RTDSTNLSQD AVNMVRGYIS DNFGKKYLPE SPNQYASKEN SQEAHEAIRP SDVNVMAESL KDMEADAQKL YQLIWRQFVA CQMTPAKYDS TTLTVGAGDF RLKARGRILR FDGWTKVMPA LRKGDEDRIL PAVDKGDALT LVELTPAQHF TKPPARFSEA SLVKELEKRG IGRPSTYASI ISTIQDRGYV RVENRRFYAE KMGEIVTDRL EENFRELMNY DFTAQMENSL DQVANHEAEW KAVLDHFFSD FTQQLDKAEK DPEEGGMRPN QMVLTSIDCP TCGRKMGIRT ASTGVFLGCS GYALPPKERC KTTINLVPEN EVLNVLEGED AETNALRAKR RCPKCGTAMD SYLIDPKRKL HVCGNNPTCD GYEIEEGEFR IKGYDGPIVE CEKCGSEMHL KMGRFGKYMA CTNEECKNTR KILRNGEVAP PKEDPVPLPE LPCEKSDAYF VLRDGAAGVF LAANTFPKSR ETRAPLVEEL YRFRDRLPEK LRYLADAPQQ DPEGNKTMVR FSRKTKQQYV SSEKDGKATG WSAFYVDGKW VEGKK
|
| |