Gene SNSL254_A1409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1409 
SymboltopB 
ID6483297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1378627 
End bp1380576 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content56% 
IMG OID642736801 
ProductDNA topoisomerase III 
Protein accessionYP_002040555 
Protein GI194442516 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0199719 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTTGT TTATTGCCGA AAAACCGAGT CTGGGGCGCG CCATTGCGGA TGTGCTGCCA 
AAACCGCACC GTAAAGGCGA TGGTTTTATT GAGTGCGGAA ACGGGCAGGT CGTCACCTGG
TGTATCGGTC ATTTGCTGGA ACAGGCGCAG CCGGATGCGT ATGACAGCCG TTATGCGCGC
TGGAATCTGG CTGACCTGCC TATCGTGCCG GAAAAATGGC AGCTTCAGCC TCGTCCTTCC
GTCACCAAAC AGCTCAATGT GATTAAGCGC TTTTTGCATC AAGCCGGTGA AATTATTCAC
GCTGGCGACC CGGATCGCGA AGGGCAGCTT CTGGTTGATG AAGTGCTGGA TTATCTCCAG
CTTCCTGCCG AAAAGCGCCA GCAGGTGCGG CGTTGTCTGA TAAACGACCT TAACCCGCAG
GCGGTCGAGC GTGCTATTGA CAGGCTGCGG GCGAACAGCG ACTTCGTGCC GCTGTGCGTC
TCCGCGCTGG CGCGAGCGCG AGCGGACTGG CTGTATGGCA TTAATATGAC CCGCGCTTAC
ACGATCCTGG GCCGGAATGC CGGCTATCAG GGCGTGTTAT CCGTGGGACG CGTACAGACT
CCGGTACTGG GGCTGGTGGT GCGGCGAGAC GAAGAGATTG AGAACTTCGT CGCCAAAGAC
TTCTTTGAAG TAAAGGCGCA CATCGTTACG CCTGCCGACG AGCGTTTTAC CGCTATCTGG
CAGCCGAGCG AGGCGTGCGA ACCTTATCAG GATGAAGAGG GGCGCTTGCT TCATCGTCCG
CTGGCGGAGC ATGTAGTGAA CCGAATCAAC GGTCAGCCCG CGCTGGTAAC CAGTTATAAT
GATAAACGGG AATCAGAATC CGCGCCGCTG CCGTTTTCGC TCTCGACGCT ACAGATTGAA
GCCGCCAAAC GCTTTGGCCT GAGCGCGCAA AACGTGCTTG ATATTTGTCA GAAGCTCTAT
GAAACCCACA AACTGATTAC CTATCCGCGT TCCGACTGCC GTTATCTGCC GGAAGAACAC
TTTGCCGGAC GGCAGGCGGT CATGAACGCG ATTAGCGTCC ACGCCCCGGA TTTACTGCCG
CAGCCTGTGG TTAATCCTGA TACGCGCAAT CGCTGCTGGG ATGACAAAAA AGTGGATGCG
CACCACGCGA TTATCCCGAC GGCGCGCAGT TCTTCTGTCC ATCTGACGGA AAACGAAGCG
AAAGTGTACA CCCTGATTGC GCGTCAGTAT CTGATGCAGT TCTGCCCGGA CGCGGTGTTT
CGTAAATGCG TTATTGAACT GGAAATCGCC AAAGGGAAAT TTGTCGCCAA AGCGCGTTTT
CTGGCGGAGG CCGGTTGGCG GACGTTACTG GGCAGTAAAG AGCGCGACGA GGAAAACGAC
GGTACGCCGC TGCCGGTTGT CGCCAAAGGT GATGAGTTGC TGTGTGAAAA GGGGGAAGTG
GTCGAGCGCC AAACCCAGCC GCCGCGTCAT TTTACTGATG CGACATTGCT TTCCGCGATG
ACCGGAATTG CCCGCTTCGT GCAGGATAAA GATCTGAAAA AGATCCTGCG CGCGACCGAT
GGGCTGGGGA CGGAAGCCAC GCGCGCCGGG ATTATCGAGC TGCTGTTCAA ACGTAGCTTT
CTGACCAAAA AAGGGCGCTA CATTCATTCT ACCGATGCTG GCAAAGCGTT AATACATTCG
CTGCCGGAAA TGGCGGCCCG TCCAGATATG ACCGCGCACT GGGAATCTGT TTTGACGCAA
ATCAGCGAAA AGCAGTGCCG TTACCAGGAT TTCATGCAAC CGCTGGTCGG CACGTTATAT
CAGCTGATCG AGCAGGCTAA GCGCACGCCG GTGAAGCGCT TCAGAGGGAT AGTCGCGCCA
GGCGGTGGAG ACAAGAAAAA GAGCGCGCCG CGTAAGCGAG CGGGCAAAAA AAGCCCGCCT
GCTGAGGAGA CAGGCCGTCA GACCGAATAA
 
Protein sequence
MRLFIAEKPS LGRAIADVLP KPHRKGDGFI ECGNGQVVTW CIGHLLEQAQ PDAYDSRYAR 
WNLADLPIVP EKWQLQPRPS VTKQLNVIKR FLHQAGEIIH AGDPDREGQL LVDEVLDYLQ
LPAEKRQQVR RCLINDLNPQ AVERAIDRLR ANSDFVPLCV SALARARADW LYGINMTRAY
TILGRNAGYQ GVLSVGRVQT PVLGLVVRRD EEIENFVAKD FFEVKAHIVT PADERFTAIW
QPSEACEPYQ DEEGRLLHRP LAEHVVNRIN GQPALVTSYN DKRESESAPL PFSLSTLQIE
AAKRFGLSAQ NVLDICQKLY ETHKLITYPR SDCRYLPEEH FAGRQAVMNA ISVHAPDLLP
QPVVNPDTRN RCWDDKKVDA HHAIIPTARS SSVHLTENEA KVYTLIARQY LMQFCPDAVF
RKCVIELEIA KGKFVAKARF LAEAGWRTLL GSKERDEEND GTPLPVVAKG DELLCEKGEV
VERQTQPPRH FTDATLLSAM TGIARFVQDK DLKKILRATD GLGTEATRAG IIELLFKRSF
LTKKGRYIHS TDAGKALIHS LPEMAARPDM TAHWESVLTQ ISEKQCRYQD FMQPLVGTLY
QLIEQAKRTP VKRFRGIVAP GGGDKKKSAP RKRAGKKSPP AEETGRQTE