Gene ECH74115_2483 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2483 
SymboltopB 
ID6970094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2350805 
End bp2352766 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content55% 
IMG OID643386352 
ProductDNA topoisomerase III 
Protein accessionYP_002270834 
Protein GI209398550 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000298229 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.566589 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTTGT TTATTGCCGA AAAACCGAGT CTGGCGCGCG CCATTGCTGA TGTCCTGCCC 
AAACCGCACC GGAAAGGCGA TGGCTTTATC GAGTGCGGTA ATGGTCAGGT GGTGACCTGG
TGTATCGGTC ACCTGCTTGA GCAGGCGCAG CCAGACGCCT ACGACAGCCG CTATGCGCGC
TGGAATCTTG CGGATTTGCC GATTGTCCCG GAAAAGTGGC AATTACAGCC CCGACCCTCC
GTGACCAAAC AACTTAACGT CATCAAACGG TACCTGCATG AAGCCAGCGA AATCGTTCAC
GCCGGGGACC CGGATCGCGA AGGGCAACTG CTGGTGGATG AAGTGCTGGA CTATCTACAA
CTGGCACCGG AAAAGCGCCA GCAGGTGCAG CGTTGCTTGA TAAACGACCT GAACCCGCAG
GCGGTTGAGC GGGCGATCGA CCGTCTTCGT TCCAACAGTG AGTTTGTACC GCTGTGCGTT
TCTGCGCTGG CGCGAGCGCG TGCCGACTGG CTGTACGGCA TCAATATGAC CCGTGCGTAT
ACCATTCTCG GTCGCAATGC CGGTTATCAG GGCGTGCTTT CCGTGGGACG CGTGCAGACG
CCCGTGCTTG GGCTGGTGGT GCGCCGCGAT GAAGAGATTG AAAACTTCGT GGCGAAAGAC
TTCTTTGAAG TCAAAGCACA TATCGTGACA CCTGCCGATG AGCGGTTTAC CGCTATCTGG
CAACCGAGCG AAGCGTGTGA ACCGTACCAG GATGAAGAAG GGCGCTTGTT ACATCGTCCA
CTGGCGGAGC ATGTGGTTAA CCGCATTAGT GGTCAACCGG CTATTGTCAC CAGCTATAAC
GATAAACGGG AATCAGAATC CGCGCCGCTG CCGTTTTCGC TTTCGGCGTT GCAGATTGAA
GCGGCAAAAC GCTTTGGTCT GAGCGCGCAG AACGTGCTTG ATATCTGCCA GAAGCTGTAC
GAAACGCACA AGCTAATCAC TTATCCGCGT TCCGATTGTC GCTATTTGCC AGAAGAACAT
TTTGCCGGAC GCCACGCGGT GATGAATGCC ATCAGCGTTC ATGCACCAGA TCTGTTGCCG
CAGCCAGTGG TAGATCCAGA TATACGCAAC CGCTGTTGGG ATGACAAAAA GGTCGATGCG
CACCACGCCA TCATTCCGAC CGCACGGAGT TCTGCGATCA ACCTGACGGA GAACGAAGCG
AAGGTCTATA ACCTGATTTC CCGTCAGTAT CTGATGCAGT TCTGCCCGGA TGCGGTGTTC
CGCAAGTGTG TTATCGAACT GGACATTGCC AAAGGCAAAT TTGTCGCTAA AGCGCGTTTT
CTTGCTGAAG CAGGCTGGCG CACGCTGTTA GGCAGCAAAG AGCGCGATGA AGAAAACGAC
GGTACGCCAT TGCCTGTGGT GGCGAAAGGC GATGAGTTGC TGTGTGAAAA AGGTGAAGTG
GTAGAGCGGC AAACCCAGCC GCCGCGCCAT TTTACCGATG CAACACTGCT TTCGGCGATG
ACCGGGATCG CGCGCTTTGT GCAGGACAAA GATCTGAAAA AGATCCTTCG TGCGACCGAT
GGTCTGGGGA CAGAAGCAAC GCGTGCCGGG ATTATTGAAC TGTTGTTCAA GCGTGGTTTC
CTCATCAAAA AAGGGCGCTA TATCCACTCC ACCGACGCCG GAAAAGCGTT ATTCCATTCG
CTGCCAGAAA TGGCTACGCG ACCGGACATG ACCGCGCACT GGGAATCGGT GCTGACGCAA
ATCAGCGAAA AGCAGTGTCG CTATCAGGAC TTTATGCAGC CGCTGGTGGG GACGCTATAT
CAGCTTATTG ATCAAGCCAA ACGTACTCCG GTGCGGCAGT TTCGCGGCAT TGTGGCTCCG
GGCAGTGGTG GCAGTGCTGA TAAGAAAAAG GCTGCACCGC GTAAACGTAG TGCGAAAAAA
AGTCCGCCAG CAGATGAAGC CGGAAGCGGG GCGATAGCGT AA
 
Protein sequence
MRLFIAEKPS LARAIADVLP KPHRKGDGFI ECGNGQVVTW CIGHLLEQAQ PDAYDSRYAR 
WNLADLPIVP EKWQLQPRPS VTKQLNVIKR YLHEASEIVH AGDPDREGQL LVDEVLDYLQ
LAPEKRQQVQ RCLINDLNPQ AVERAIDRLR SNSEFVPLCV SALARARADW LYGINMTRAY
TILGRNAGYQ GVLSVGRVQT PVLGLVVRRD EEIENFVAKD FFEVKAHIVT PADERFTAIW
QPSEACEPYQ DEEGRLLHRP LAEHVVNRIS GQPAIVTSYN DKRESESAPL PFSLSALQIE
AAKRFGLSAQ NVLDICQKLY ETHKLITYPR SDCRYLPEEH FAGRHAVMNA ISVHAPDLLP
QPVVDPDIRN RCWDDKKVDA HHAIIPTARS SAINLTENEA KVYNLISRQY LMQFCPDAVF
RKCVIELDIA KGKFVAKARF LAEAGWRTLL GSKERDEEND GTPLPVVAKG DELLCEKGEV
VERQTQPPRH FTDATLLSAM TGIARFVQDK DLKKILRATD GLGTEATRAG IIELLFKRGF
LIKKGRYIHS TDAGKALFHS LPEMATRPDM TAHWESVLTQ ISEKQCRYQD FMQPLVGTLY
QLIDQAKRTP VRQFRGIVAP GSGGSADKKK AAPRKRSAKK SPPADEAGSG AIA