Gene ECH74115_1906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1906 
SymboltopA 
ID6969066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1798769 
End bp1801366 
Gene Length2598 bp 
Protein Length865 aa 
Translation table11 
GC content52% 
IMG OID643385839 
ProductDNA topoisomerase I 
Protein accessionYP_002270328 
Protein GI209398456 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA
[COG0551] Zn-finger domain associated with topoisomerase type I 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.176274 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000000101765 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGTAAAG CTCTTGTCAT CGTTGAGTCC CCGGCAAAAG CCAAAACGAT CAACAAGTAT 
CTGGGTAGTG ACTACGTGGT GAAATCCAGC GTCGGTCACA TCCGCGATTT GCCGACCAGT
GGCTCAGCTG CCAAAAAGAG TGCCGACTCT ACCTCCACCA AGACGGCTAA AAAGCCTAAA
AAGGATGAAC GTGGCGCTCT CGTCAACCGT ATGGGGGTTG ACCCGTGGCA CAATTGGGAG
GCGCACTATG AAGTGTTGCC TGGTAAAGAG AAGGTCGTCT CTGAACTGAA ACAACTGGCT
GAAAAAGCCG ACCACATCTA TCTCGCAACC GACCTTGACC GCGAAGGGGA AGCCATTGCA
TGGCACCTGC GGGAAGTGAT TGGTGGTGAT GATGCGCGCT ATAGCCGAGT GGTGTTTAAC
GAAATTACTA AAAACGCGAT CCGCCAGGCA TTTAACAAAC CGGGTGAGCT GAATATTGAT
CGTGTTAATG CCCAGCAGGC GCGTCGCTTT ATGGACCGCG TGGTGGGGTA TATGGTTTCG
CCGCTGCTAT GGAAAAAGAT CGCTCGTGGT CTGTCTGCCG GTCGTGTGCA GTCGGTGGCA
GTCCGCCTGG TGGTCGAGCG TGAGCGTGAA ATTAAAGCGT TCGTGCCGGA AGAGTTCTGG
GAAGTCGATG CCAGCACGAC CACGCCATCT GGTGAAGCGT TGGCGTTGCA GGTGACTCAT
CAGAACGACA AACCGTTCCG TCCGGTCAAC AAAGAACAAA CTCAGGCTGC GGTAAGTCTG
CTGGAAAAAG CGCGCTACAG CGTGCTGGAA CGTGAAGACA AACCGACAAC CAGTAAACCT
GGCGCTCCTT TTATTACCTC TACGCTGCAA CAAGCTGCCA GCACCCGTCT TGGATTTGGC
GTGAAAAAAA CCATGATGAT GGCGCAGCGT TTGTATGAAG CAGGCTATAT CACTTACATG
CGTACCGACT CCACTAACCT GAGTCAGGAC GCGGTAAATA TGGTTCGCGG TTATATCAGC
GATAATTTTG GTAAGAAATA TCTGCCGGAA AGTCCGAATC AGTACGCCAG CAAAGAAAAC
TCACAGGAAG CGCACGAAGC GATTCGTCCT TCTGACGTCA ATGTGATGGC GGAATCGCTG
AAGGATATGG AAGCAGATGC GCAGAAACTG TACCAGTTAA TCTGGCGTCA GTTCGTTGCC
TGCCAGATGA CCCCAGCGAA ATATGACTCC ACGACGCTGA CCGTTGGTGC GGGCGATTTC
CGCCTGAAAG CACGCGGTCG TATTTTGCGC TTTGATGGCT GGACGAAAGT GATGCCTGCA
CTGCGTAAAG GCGATGAAGA TCGTATCTTA CCTGCAGTCG ATAAAGGCGA TGCTCTGACG
CTCGTTGAAC TGACACCAGC CCAGCACTTT ACCAAGCCGC CAGCCCGTTT CAGTGAAGCA
TCGCTGGTTA AAGAACTGGA AAAACGTGGT ATCGGTCGTC CGTCTACCTA TGCGTCGATC
ATTTCGACCA TTCAGGATCG TGGCTATGTG CGAGTAGAAA ATCGTCGTTT CTATGCGGAA
AAAATGGGCG AAATCGTCAC CGATCGCCTG GAAGAGAATT TCCGCGAGTT AATGAACTAC
GACTTCACCG CGCAGATGGA AAACAGCCTT GACCAGGTGG CAAATCACGA AGCAGAGTGG
AAAGCTGTAC TGGATCACTT CTTCTCGGAT TTCACTCAGC AGTTAGATAA AGCTGAAAAA
GATCCGGAAG AGGGGGGTAT GCGTCCGAAC CAGATGGTTC TGACCAGCAT CGACTGCCCG
ACCTGTGGTC GCAAAATGGG GATTCGCACA GCGAGCACCG GGGTATTCCT TGGCTGTTCT
GGCTATGCGC TGCCGCCGAA AGAGCGTTGC AAAACAACCA TTAACCTGGT GCCGGAAAAC
GAAGTGCTGA ACGTGCTGGA AGGCGAAGAC GCTGAAACCA ACGCGCTGCG CGCAAAACGT
CGTTGCCCCA AATGCGGCAC GGCGATGGAC AGCTATCTCA TCGATCCGAA ACGTAAGTTG
CATGTCTGTG GTAATAACCC AACCTGCGAC GGTTACGAGA TCGAAGAGGG CGAATTCCGC
ATTAAAGGTT ATGACGGCCC GATCGTTGAG TGTGAAAAAT GTGGTTCTGA AATGCACCTG
AAAATGGGGC GTTTCGGTAA ATATATGGCC TGCACCAACG AAGAGTGTAA AAACACGCGT
AAGATTTTAC GTAACGGCGA AGTGGCTCCA CCGAAAGAAG ATCCGGTACC ATTACCGGAG
CTGCCGTGCG AAAAATCAGA TGCCTATTTC GTGCTGCGTG ACGGTGCTGC CGGTGTGTTC
CTGGCGGCCA ATACCTTCCC GAAATCGCGT GAAACGCGTG CGCCGCTGGT GGAAGAGCTG
TATCGCTTCC GCGATCGTCT GCCGGAAAAA CTGCGTTATC TGGCCGATGC GCCGCAGCAG
GATCCGGAAG GTAATAAGAC TATGGTTCGC TTTAGCCGTA AAACCAAACA GCAATATGTC
TCTTCGGAAA AAGACGGAAA GGCGACTGGC TGGTCAGCAT TTTATGTTGA TGGCAAATGG
GTTGAAGGGA AAAAATAA
 
Protein sequence
MGKALVIVES PAKAKTINKY LGSDYVVKSS VGHIRDLPTS GSAAKKSADS TSTKTAKKPK 
KDERGALVNR MGVDPWHNWE AHYEVLPGKE KVVSELKQLA EKADHIYLAT DLDREGEAIA
WHLREVIGGD DARYSRVVFN EITKNAIRQA FNKPGELNID RVNAQQARRF MDRVVGYMVS
PLLWKKIARG LSAGRVQSVA VRLVVERERE IKAFVPEEFW EVDASTTTPS GEALALQVTH
QNDKPFRPVN KEQTQAAVSL LEKARYSVLE REDKPTTSKP GAPFITSTLQ QAASTRLGFG
VKKTMMMAQR LYEAGYITYM RTDSTNLSQD AVNMVRGYIS DNFGKKYLPE SPNQYASKEN
SQEAHEAIRP SDVNVMAESL KDMEADAQKL YQLIWRQFVA CQMTPAKYDS TTLTVGAGDF
RLKARGRILR FDGWTKVMPA LRKGDEDRIL PAVDKGDALT LVELTPAQHF TKPPARFSEA
SLVKELEKRG IGRPSTYASI ISTIQDRGYV RVENRRFYAE KMGEIVTDRL EENFRELMNY
DFTAQMENSL DQVANHEAEW KAVLDHFFSD FTQQLDKAEK DPEEGGMRPN QMVLTSIDCP
TCGRKMGIRT ASTGVFLGCS GYALPPKERC KTTINLVPEN EVLNVLEGED AETNALRAKR
RCPKCGTAMD SYLIDPKRKL HVCGNNPTCD GYEIEEGEFR IKGYDGPIVE CEKCGSEMHL
KMGRFGKYMA CTNEECKNTR KILRNGEVAP PKEDPVPLPE LPCEKSDAYF VLRDGAAGVF
LAANTFPKSR ETRAPLVEEL YRFRDRLPEK LRYLADAPQQ DPEGNKTMVR FSRKTKQQYV
SSEKDGKATG WSAFYVDGKW VEGKK