Gene SeD_A2048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2048 
SymboltopB 
ID6875680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1979021 
End bp1980970 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content56% 
IMG OID642785162 
ProductDNA topoisomerase III 
Protein accessionYP_002215828 
Protein GI198245120 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.177023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value0.939846 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTTGT TTATTGCCGA AAAACCGAGT CTGGGGCGCG CCATTGCGGA TGTGCTGCCA 
AAACCGCACC GTAAAGGCGA TGGTTTTATT GAGTGCGGAA ACGGGCAGGT CGTCACCTGG
TGTATTGGTC ATTTGCTGGA ACAGGCGCAG CCGGATGCGT ATGACAGCCG TTATGCGCGC
TGGAATCTGG CTGACCTGCC TATTGTGCCG GAAAAATGGC AGCTTCAGCC TCGTCCTTCC
GTCACCAAAC AGCTCAATGT GATTAAGCGC TTTTTGCATC AAGCCGGTGA AATTATTCAC
GCCGGCGACC CGGATCGCGA AGGGCAGCTT CTGGTTGATG AAGTGCTGGA TTATCTCCAG
CTTCCTGCCG AAAAGCGCCA GCAGGTGCGG CGTTGTCTGA TAAACGACCT TAACCCGCAG
GCGGTCGAGC GTGCTATTGA CAGGCTGCGG GCGAACAGCG ACTTCGTGCC GCTGTGCGTC
TCCGCGCTGG CGCGAGCGCG AGCGGACTGG CTGTATGGCA TTAATATGAC CCGCGCTTAC
ACGATCCTGG GCCGTAATGC CGGCTATCAG GGCGTGTTAT CCGTGGGACG CGTACAGACG
CCGGTACTGG GGCTGGTGGT GCGGAGAGAC GAAGAGATTG AGAACTTCGT CGCCAAAGAC
TTCTTTGAAG TAAAGGCGCA CATCGTTACG CCTGCCGACG AGCGTTTTAC CGCTATCTGG
CAGCCGAGCG AGGCGTGCGA ACCTTATCAG GATGAAGAGG GGCGCTTGCT TCATCGTCCG
CTGGCGGAGC ATGTGGTGAA CCGAATCAAC GGTCAGCCCG CGCTGGTAAC CAGTTATAAT
GATAAACGGG AATCAGAATC CGCGCCGCTG CCGTTTTCGC TCTCGACGCT ACAGATTGAA
GCCGCCAAAC GCTTTGGCCT GAGCGCGCAA AACGTGCTTG ATATTTGTCA GAAGCTATAT
GAAACCCACA AACTGATTAC CTATCCGCGT TCCGACTGCC GTTATCTGCC GGAAGAACAC
TTTGCCGGAC GGCAGGCGGT CATGAACGCG ATTAGCGTCC ACGCCCCGGA TTTACTGCCG
CAGCCTGTGG TTAATCCTGA TACGCGCAAT CGCTGCTGGG ATGACAAAAA AGTGGATGCG
CACCACGCGA TTATCCCGAC GGCGCGCAGT TCTTCTGTCC ATCTGACGGA AAACGAAGCG
AAAGTGTACA CCCTGATTGC GCGTCAGTAT CTGATGCAGT TCTGCCCGGA CGCGGTGTTT
CGTAAATGCG TTATTGAACT GGAAATCGCC AAAGGGAAAT TTGTCGCCAA AGCGCGTTTT
CTGGCGGAGG CCGGTTGGCG GACGTTACTG GGCAGTAAAG AGCGCGACGA GGAAAACGAC
GGTACGCCGC TGCCGGTTGT CGCCAAAGGT GATGAGTTGC TGTGTGAAAA GGGGGAAGTG
GTCGAGCGCC AAACCCAGCC GCCGCGTCAT TTCACTGATG CGACATTGCT TTCCGCGATG
ACCGGAATTG CCCGCTTCGT GCAGGATAAA GATCTGAAAA AGATCCTGCG CGCGACCGAT
GGGCTGGGGA CGGAAGCCAC GCGCGCCGGG ATTATCGAGC TGCTGTTCAA ACGTAGCTTT
CTGACCAAAA AAGGGCGCTA CATTCATTCT ACCGATGCTG GCAAAGCGTT AATACATTCG
CTGCCGGAAA TGGCGGCCCG TCCAGATATG ACCGCGCACT GGGAATCTGT TTTGACGCAA
ATCAGCGAAA AGCAGTGCCG TTACCAGGAT TTCATGCAAC CGCTGGTCGG CACGTTATAT
CAGCTGATCG AGCAGGCTAA GCGCACGCCG GTGAAGCGCT TCAGAGGGAT AGTCGCGCCA
GGCGGTGGAG ACAAGAAAAA GAGCGCGCCG CGTAAGCGAG CGGGCAAAAA AAGCCCGCCT
GCTGAGGAGA CAGGCCGTCA GACCGAATAA
 
Protein sequence
MRLFIAEKPS LGRAIADVLP KPHRKGDGFI ECGNGQVVTW CIGHLLEQAQ PDAYDSRYAR 
WNLADLPIVP EKWQLQPRPS VTKQLNVIKR FLHQAGEIIH AGDPDREGQL LVDEVLDYLQ
LPAEKRQQVR RCLINDLNPQ AVERAIDRLR ANSDFVPLCV SALARARADW LYGINMTRAY
TILGRNAGYQ GVLSVGRVQT PVLGLVVRRD EEIENFVAKD FFEVKAHIVT PADERFTAIW
QPSEACEPYQ DEEGRLLHRP LAEHVVNRIN GQPALVTSYN DKRESESAPL PFSLSTLQIE
AAKRFGLSAQ NVLDICQKLY ETHKLITYPR SDCRYLPEEH FAGRQAVMNA ISVHAPDLLP
QPVVNPDTRN RCWDDKKVDA HHAIIPTARS SSVHLTENEA KVYTLIARQY LMQFCPDAVF
RKCVIELEIA KGKFVAKARF LAEAGWRTLL GSKERDEEND GTPLPVVAKG DELLCEKGEV
VERQTQPPRH FTDATLLSAM TGIARFVQDK DLKKILRATD GLGTEATRAG IIELLFKRSF
LTKKGRYIHS TDAGKALIHS LPEMAARPDM TAHWESVLTQ ISEKQCRYQD FMQPLVGTLY
QLIEQAKRTP VKRFRGIVAP GGGDKKKSAP RKRAGKKSPP AEETGRQTE