Gene EcSMS35_1428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1428 
SymboltopB 
ID6146168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1413353 
End bp1415308 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content54% 
IMG OID641616306 
ProductDNA topoisomerase III 
Protein accessionYP_001743486 
Protein GI170682846 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0037378 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.804884 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTTGT TTATTGCCGA AAAACCGAGT CTGGCGCGCG CCATTGCTGA TGTCCTGCCT 
AAACCGCACC GGAAAGGCGA TGGCTTTATC GAGTGCGGTA ATGGTCAGGT GGTGACCTGG
TGTATCGGTC ACCTGCTTGA GCAGGCGCAG CCAGACGCCT ACGACAGCCG CTATGCGCGC
TGGAATCTTG CGGATTTGCC AATTGTCCCG GAAAAGTGGC AATTACAGCC CCGACCCTCC
GTGACTAAAC AACTTAATGT CATCAAACGC TTCCTGCATG AAGCCAGCGA AATCGTTCAC
GCCGGAGACC CGGATCGCGA AGGGCAACTG CTGGTGGATG AAGTGCTGGA CTACCTGCAA
CTGGCACCGG AAAAGCGCCA GCAGGTACAG CGTTGTTTGA TAAACGACCT GAACCCGCAG
GCGGTTGAGC GGGCGATCGA TCGTCTTCGC TCCAATAGCG AGTTTGTACC GCTGTGCGTT
TCTGCACTGG CGCGAGCCCG AGCTGACTGG CTGTACGGCA TCAATATGAC CCGCGCTTAC
ACCATTCTCG GTCGCAATGC CGGTTATCAG GGCGTACTTT CCGTGGGACG CGTGCAGACG
CCCGTACTCG GACTGGTGGT GCGCCGCGAC GAAGAGATTG AAAACTTCGT GGCGAAAGAT
TTCTTTGAAG TCAAAGCACA TATCGTGACA CCTGCCGATG AGCGGTTTAC CGCTATCTGG
CAACCGAGCG AAGCGTGTGA ACCGTACCAG GATGAAGAAG GGCGCTTGTT ACATCGTCCA
CTGGCGGAGC ATGTGGTTAA CCGCATTAGT GGTCAACCAG CTATTGTCAC CAGCTATAAC
GATAAACGGG AATCAGAATC CGCGCCGCTT CCTTTTTCGC TTTCGGCGTT GCAGATTGAA
GCGGCTAAAC GTTTTGGTCT GAGCGCGCAG AACGTGCTTG ATATCTGCCA GAAGCTGTAC
GAAACGCACA AGCTAATCAC TTATCCGCGT TCCGATTGCC GCTATTTGCC AGAAGAACAT
TTTGCCGGAC GCCACGCGGT GATGAATGCT ATCAGCGTTC ATGCACCGGA TCTGTTGCCG
CAGCCAGTGG TAGATCCAGA TATACGCAAC CGCTGTTGGG ATGACAAAAA GGTCGATGCG
CACCACGCCA TCATTCCGAC CGCACGGAGT TCTGCGATCA ACCTGACGGA GAACGAAGCG
AAGGTCTATA ACCTGATTGC CCGTCAGTAC TTGATGCAGT TCTGCCCGGA TGCGGTGTTC
CGCAAGTGTG TTATCGAACT GGACATTGCC AAAGGCAAAT TTGTCGCTAA AGCCCGTTTT
CTTGCTGAAG CAGGCTGGCG CACGCTGTTA GGCAGCAAAG AGCGCGATGA AGAAAACGAC
GGCATGCCGC TGCCTGTGGT GGCGAAAGGC GATGAATTGT TGTGCGAAAA AGGTGAAGTG
GTAGAGCGGC AAACCCAGCC GCCGCGCCAT TTTACCGATG CAACACTGCT TTCGGCGATG
ACCGGGATCG CACGCTTTGT GCAGGACAAA GATCTGAAAA AGATCCTTCG TGCGACCGAT
GGTCTGGGGA CAGAAGCAAC CCGTGCCGGG ATTATTGAAC TGTTGTTCAA GCGTGGTTTC
CTGACCAAAA AAGGGCGCTA TATCCACTCC ACCGACGCCG GAAAAGCGCT ATTCCATTCG
CTGCCAGAAA TGGCGACGCG ACCGGACATG ACCGCGCACT GGGAATCGGT GCTGACGCAA
ATCAGTGAAA AGCAGTGTCG CTATCAGGAC TTTATGCAGC CGCTGGTGGG GACATTGTAC
CAGTTGATTG ATCAGGCAAA GCGCACGTCG GTAAGGCAGT TTCGGGGAAT AATGGCTCCC
GGCGGTGGAG AAGGGAAGAA AAAGGATTCG CCACGCAAGA GAGCGCCGAA AAAAAGCCCG
TCATCAGAAG AGGCGGGCAA TGGAGTGATA ATATAA
 
Protein sequence
MRLFIAEKPS LARAIADVLP KPHRKGDGFI ECGNGQVVTW CIGHLLEQAQ PDAYDSRYAR 
WNLADLPIVP EKWQLQPRPS VTKQLNVIKR FLHEASEIVH AGDPDREGQL LVDEVLDYLQ
LAPEKRQQVQ RCLINDLNPQ AVERAIDRLR SNSEFVPLCV SALARARADW LYGINMTRAY
TILGRNAGYQ GVLSVGRVQT PVLGLVVRRD EEIENFVAKD FFEVKAHIVT PADERFTAIW
QPSEACEPYQ DEEGRLLHRP LAEHVVNRIS GQPAIVTSYN DKRESESAPL PFSLSALQIE
AAKRFGLSAQ NVLDICQKLY ETHKLITYPR SDCRYLPEEH FAGRHAVMNA ISVHAPDLLP
QPVVDPDIRN RCWDDKKVDA HHAIIPTARS SAINLTENEA KVYNLIARQY LMQFCPDAVF
RKCVIELDIA KGKFVAKARF LAEAGWRTLL GSKERDEEND GMPLPVVAKG DELLCEKGEV
VERQTQPPRH FTDATLLSAM TGIARFVQDK DLKKILRATD GLGTEATRAG IIELLFKRGF
LTKKGRYIHS TDAGKALFHS LPEMATRPDM TAHWESVLTQ ISEKQCRYQD FMQPLVGTLY
QLIDQAKRTS VRQFRGIMAP GGGEGKKKDS PRKRAPKKSP SSEEAGNGVI I