Gene EcHS_A1847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1847 
SymboltopB 
ID5591031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1863081 
End bp1865042 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content56% 
IMG OID640920991 
ProductDNA topoisomerase III 
Protein accessionYP_001458543 
Protein GI157161225 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00000137821 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGTTGT TTATTGCCGA AAAACCGAGT CTGGCGCGCG CCATTGCTGA TGTCCTGCCC 
AAACCGCACC GGAAAGGCGA TGGCTTTATC GAGTGCGGTA ATGGTCAGGT GGTGACCTGG
TGTATCGGTC ACCTGCTTGA GCAGGCGCAG CCAGACGCCT ACGACAGCCG CTATGCGCGC
TGGAATCTTG CGGATTTGCC GATTGTCCCG GAAAAGTGGC AATTACAGCC CCGACCCTCC
GTGACCAAAC AACTTAACGT CATCAAACGC TTCCTGCATG AAGCCAGCGA AATCGTTCAC
GCCGGGGACC CGGATCGTGA AGGGCAATTG CTGGTGGATG AAGTGCTGGA CTATCTGCAA
CTGGCACCGG AAAAGCGCCA GCAGGTGCAG CGTTGCTTGA TAAACGACCT GAACCCGCAG
GCGGTTGAGC GGGCGATCGA CCGTCTTCGT TCCAACAGTG AGTTTGTACC GCTGTGCGTT
TCTGCGCTGG CGCGAGCGCG TGCCGACTGG CTGTACGGCA TCAATATGAC CCGTGCTTAC
ACCATTCTCG GTCGCAATGC TGGTTATCAG GGCGTGCTTT CCGTGGGGCG CGTGCAGACA
CCCGTGCTCG GGCTGGTGGT GCGCCGCGAT GAAGAGATTG AAAACTTCGT GGCGAAAGAC
TTCTTTGAAG TCAAAGCGCA TATCGTGACA CCTGCCGATG AGCGGTTTAC CGCTATCTGG
CAACCGAGCG AAGCGTGTGA ACCGTACCAG GATGAAGAAG GGCGCTTGTT ACATCGTCCA
CTGGCGGAGC ATGTGGTTAA CCGCATTAGT GGTCAACCGG CTATTGTCAC CAGCTATAAC
GATAAACGGG AATCAGAATC CGCGCCGCTG CCGTTTTCGC TTTCGGCGTT GCAGATTGAA
GCGGCAAAAC GCTTTGGTCT GAGCGCGCAG AACGTGCTTG ATATCTGCCA GAAACTGTAC
GAAACGCACA AGCTAATCAC TTATCCGCGT TCTGATTGTC GCTATTTGCC AGAAGAACAT
TTTGCCGGAC GCCACGCGGT GATGAATGCC ATCAGTGTTC ATGCACCGGA TCTGCTGCCG
CAGCCAGTGG TAGATCCAGA TATACGCAAC CGCTGTTGGG ATGACAAAAA GGTCGATGCG
CACCACGCCA TCATTCCGAC CGCACGGAGT TCTGCGATCA ACCTGACGGA GAACGAAGCG
AAGGTCTATA ACCTGATTGC CCGTCAGTAT CTGATGCAAT TCTGCCCGGA TGCGGTGTTC
CGCAAGTGTG TTATCGAACT GGACATTGCC AAAGGCAAAT TTGTCGCTAA AGCGCGTTTT
CTTGCTGAAG CAGGCTGGCG CACGCTGTTA GGCAGCAAAG AGCGCGATGA AGAAAACGAC
GGCACGCCAC TGCCTGTGGT GGCGAAAGGC GATGAGTTGC TGTGTGAAAA AGGTGAAGTG
GTAGAGCGGC AAACCCAGCC GCCGCGCCAT TTTACCGATG CAACACTGCT TTCGGCGATG
ACCGGGATCG CGCGCTTTGT GCAGGATAAA GATCTGAAAA AGATCCTCCG TGCGACCGAT
GGTCTGGGGA CAGAGGCAAC GCGTGCCGGG ATTATTGAAC TGTTGTTCAA GCGTGGTTTC
CTGACCAAAA AAGGGCGCTA TATCCACTCC ACCGACGCCG GAAAAGCGCT ATTCCATTCG
CTGCCGGAGA TGGCGACGCG ACCGGACATG ACCGCGCACT GGGAATCGGT GCTGACGCAA
ATCAGCGAAA AGCAGTGTCG CTATCAGGAC TTTATGCAGC CGCTGGTGGG GACGCTATAT
CAGCTTATTG ATCAAGCCAA ACGTACGCCG GTGCGGCAGT TTCGCGGCAT TGTGGCTCCG
GGCAGTGGTG GCAGTGCTGA TAAGAAAAAG GCTGCACCGC GTAAACGTAG TGCGAAAAAA
AGTCCGCCAG CAGATGAAGT CGGAAGCGGG GCGATAGCGT AA
 
Protein sequence
MRLFIAEKPS LARAIADVLP KPHRKGDGFI ECGNGQVVTW CIGHLLEQAQ PDAYDSRYAR 
WNLADLPIVP EKWQLQPRPS VTKQLNVIKR FLHEASEIVH AGDPDREGQL LVDEVLDYLQ
LAPEKRQQVQ RCLINDLNPQ AVERAIDRLR SNSEFVPLCV SALARARADW LYGINMTRAY
TILGRNAGYQ GVLSVGRVQT PVLGLVVRRD EEIENFVAKD FFEVKAHIVT PADERFTAIW
QPSEACEPYQ DEEGRLLHRP LAEHVVNRIS GQPAIVTSYN DKRESESAPL PFSLSALQIE
AAKRFGLSAQ NVLDICQKLY ETHKLITYPR SDCRYLPEEH FAGRHAVMNA ISVHAPDLLP
QPVVDPDIRN RCWDDKKVDA HHAIIPTARS SAINLTENEA KVYNLIARQY LMQFCPDAVF
RKCVIELDIA KGKFVAKARF LAEAGWRTLL GSKERDEEND GTPLPVVAKG DELLCEKGEV
VERQTQPPRH FTDATLLSAM TGIARFVQDK DLKKILRATD GLGTEATRAG IIELLFKRGF
LTKKGRYIHS TDAGKALFHS LPEMATRPDM TAHWESVLTQ ISEKQCRYQD FMQPLVGTLY
QLIDQAKRTP VRQFRGIVAP GSGGSADKKK AAPRKRSAKK SPPADEVGSG AIA