Gene ECH74115_A0022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_A0022 
SymboltopB 
ID6966544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011351 
Strand
Start bp16875 
End bp19031 
Gene Length2157 bp 
Protein Length718 aa 
Translation table11 
GC content45% 
IMG OID643384053 
ProductDNA topoisomerase III 
Protein accessionYP_002268532 
Protein GI209395643 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.270283 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.995092 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACTTT TTATTGCAGA AAAACCCGCA GTAGCGAATG ATATTGTTAA GGCACTTGGT 
GGCAATTTTA CTCGACATGA TGGCTGGTTC GAAAGTGATA ACGCAATTGT GACTAACTGT
TTTGGTCATA TTATCGAATC ACAGCCACCG GAAAACTATA ATCCTGAATA TAAAGTCTGG
AAAGTTGAAA CGCTTCCTTT ACGTCTTTAT CCCGTGAAGT ATCAGCCTGT CGAAAGTGCC
GCAAAACAGG TTAAAACGAT TCTCGAACTT ATCAGCCGTG GAGACGTGAC TGAAATTGTT
CACGCTGGCG ATCCTGATGA TGAGGGACAG CTACTGGTTG ATGAAGTCCT GGAATATGCA
GGCAACACAA AACCCGTAAA GCGCGTTCTG ATTAACGACA ACACGCTTCC GGCAGTGAAA
AAGGCACTGG CAAATCTTAA AGACAATCGT GATTTCAAAG GACTTTACCT TAAGGCGCTG
GCGCGTTCAG TTGCCGATGC CGTCTATGGC TTCTCTATGA CACGTGCGTA CACCATTCCG
GCAAAAGCCA GAGGATATCA GGGCGTTCTG TCTGTCGGGC GCGTCCAGAC TCCCGTTCTT
GGCCTCATTG TGAATCGTAC CCGTGCTAAC CAGAACCATA AATCCAGTTT TTACTACACC
ATGACCGGAG TCTTTCAGCG TGGTGCTGAT GTTCTCAGTG CAAACTGGAA ACCAGGGGAA
TTTGCTCCGC TGACAGATCG TAAATTGCTT GATAAGGCAT GGGCAAACGG AACGGCGGCA
TCTCTTGCGG GAAAACCAGC CACCGTTGAA GCGGCAGCAA CTGATGATAA AAAAACTGCC
GCACCGTTGC CATTTAACCT GGTCAGGCTC CAGCAATACA TGAACAAAAA GTTCAAAATG
ACAGCGCAAA AAACGCTGGA TGTTACGCAA CAACTACGCG AAAAATACAA AGCGATCACT
TATAACCGCT CTGATTGCTC ATATCTTTCT GATGAACAAT TCAGCGAAGC GCCACAGGTT
ATCGATGCCC TGAAATCAGT ATTTCCTCAG TCGTTGGATA TTGATTCCGC ACGTAAAAGC
AAGGCGTTTA ACAGTGCAAA GGTGACTGCG CATACTGCGA TAATCCCGAC AGCCAGTGTG
CCTGATGTTA ACGCACTCAG CACCGACGAG CGCAATGTTT ACCTTGCGAT CGCACAACAC
TATCTTGTTC AGTTCATGCC TGAAAAAGCA TACCAGGAAG TATCGGTTGC CATTCAGTGT
GGTGATGAGT CGTTCTATGC TCGTGCCAGA AAAACAACTG ACAGCGGATT TGAGGCATTT
CTTGGTGCGG AAACCACAGA TGAAGGTGAA TCAGAAGATA ACGATGATTC CGCTTTTGAA
CTGCTCTGTA AAATTCGCAC AGGAGAAACA CTGACGACAA AAGAAGTTGT AGTTAATGAG
AAGAAAACGA CACCGCCGCC ATTATTTACA GAAGCCTCCT TGCTTGCTGC GCTTGTTCGT
GTCGCGGATT TTGTCACTGA CCCAACGATT AAAAAATTGT TGAAGGATAA AGATAAAGAC
AAAAAAGATG AACATGGCGG CATTGGTACG CCAGCTACCC GCGCAGCGAT TCTGGAAACG
TTGAAGAAGA GAAATTATAT CACGCTGGAA AAAGGAAAAC TCATTCCGAC AGATACCGGA
TATGCGCTTA TTGATGCTCT GCCGGATATA GCTGTTAATC CAGATATGAC AGCATTATGG
GCTGAAAAGC AGGCAGCTAT TGAAAATGGT GATCTGACGG TTGAACAGTT TATTAATGAG
CTGTACGGTG AACTGACAGG CATGATTTCT GATGTTGACC TGGGCGCGAT GAAGATTGAA
GCAGCAGCGC CAGCAGCCCA ATCTCAACGC CTGAATGCTC CCTGTCCCTC CTGTGGTAAG
CAGATTGCTA TCAGGCCAAA AGGTTATTTC TGTACAGGAT GTGAATTTAA AATCTGGAAG
AACTTCTCTG GCAAGGTTCT TTCTGATAAG CAAGTAGAAT CCTTGCTGAC AAAAGGTATT
ACAGGGGAGC TAAAAGGGTT TGTTAGTTCC AGGACGAATA AAGAATTTTC GGCTAAAGTT
AAATTGATTG ATAAAACAAC CGGAAAGTTA GGGTTTGAAT TTCCCCCTAA AAAGTAA
 
Protein sequence
MRLFIAEKPA VANDIVKALG GNFTRHDGWF ESDNAIVTNC FGHIIESQPP ENYNPEYKVW 
KVETLPLRLY PVKYQPVESA AKQVKTILEL ISRGDVTEIV HAGDPDDEGQ LLVDEVLEYA
GNTKPVKRVL INDNTLPAVK KALANLKDNR DFKGLYLKAL ARSVADAVYG FSMTRAYTIP
AKARGYQGVL SVGRVQTPVL GLIVNRTRAN QNHKSSFYYT MTGVFQRGAD VLSANWKPGE
FAPLTDRKLL DKAWANGTAA SLAGKPATVE AAATDDKKTA APLPFNLVRL QQYMNKKFKM
TAQKTLDVTQ QLREKYKAIT YNRSDCSYLS DEQFSEAPQV IDALKSVFPQ SLDIDSARKS
KAFNSAKVTA HTAIIPTASV PDVNALSTDE RNVYLAIAQH YLVQFMPEKA YQEVSVAIQC
GDESFYARAR KTTDSGFEAF LGAETTDEGE SEDNDDSAFE LLCKIRTGET LTTKEVVVNE
KKTTPPPLFT EASLLAALVR VADFVTDPTI KKLLKDKDKD KKDEHGGIGT PATRAAILET
LKKRNYITLE KGKLIPTDTG YALIDALPDI AVNPDMTALW AEKQAAIENG DLTVEQFINE
LYGELTGMIS DVDLGAMKIE AAAPAAQSQR LNAPCPSCGK QIAIRPKGYF CTGCEFKIWK
NFSGKVLSDK QVESLLTKGI TGELKGFVSS RTNKEFSAKV KLIDKTTGKL GFEFPPKK