Gene SbBS512_E2018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2018 
SymboltopB 
ID6271493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1836243 
End bp1838204 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content55% 
IMG OID641726066 
ProductDNA topoisomerase III 
Protein accessionYP_001880560 
Protein GI187731958 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.000256516 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGTTGT TTATTGCCGA AAAACCGAGT CTGGCGCGCG CCATTGCTGA TGTCCTGCCC 
AAACCGCACC GGAAAGGCGA TGGCTTTATC GAGTGCGGTA ATGGTCAGGT GGTGACCTGG
TGTATCGGTC ACCTGCTTGA GCAGGCGCAG CCAGACGCCT ACGACAGCCG CTATGCGCGC
TGGAATCTTG CGGATTTGCC GATTGTCCCG GAAAAGTGGC AATTACAGCC CCGACCCTCC
GTGACCAAAC AACTTAACGT CATCAAACGC TTCCTGCATG AAGCCAGCGA AATCGTTCAC
GCCGGGGACC CGGATCGTGA AGGGCAACTG CTGGTGGATG AAGTGCTGGA CTATCTGCAA
CTGGCACCGG AAAAGCGCCA GCAGGTGCAG CGTTGCTTGA TAAACGACCT GAACCCGCAG
GCGGTTGAGC GGGCGATCGA CCGTCTTCGG TCCAACAGTG AGTTTGTGCC ATTGTGCGTT
TCTGCGCTGG CGCGAGCGCG TGCCGACTGG CTGTACGGCA TCAATATGAC CCGTGCGTAT
ACCATTCTCG GTCGCAATGC CGGTTATCAG GGCGTACTTT CCGTGGGACG CGTGCAGACA
CCCGTGCTCG GGCTGGTGGT GCGCCGCGAT GAAGAGATTG AAAACTTCGT GGCGAAAGAC
TTCTTTGAAG TCAAAGCACA TATCGTGACA CCTGCCGATG AGCGGTTTAC CGCTATCTGG
CAACCGAGCG AAGCGTGTGA ACCGTACCAG GATGAAGAAG GGCGCTTGTT ACATCGTCCA
CTGGCGGAGC ATGTGGTTAA CCGCATTAGT GGTCAACCAG CAATTGTCAC CAGCTATAAC
GATAAACGGG AATCAGAATC CGCGCCGCTG CCTTTTTCGC TTTCGGCGTT GCAGATTGAA
GCGGCAAAAC GCTTTGGTCT GAGCGCGCAG AACGTGCTTG ATATCTGCCA GAAGCTGTAC
GAAACGCACA AGCTAATCAC TTATCCGCGT TCCGATTGTC GCTATTTGCC AGAAGAACAT
TTTGCCGGAC GCCACGCGGT GATGAATGCC ATCAGCGTTC ATGCACCAGA TCTGTTGCCG
CAACCAGTGG TAGATCCAGA TATACGCAAC CGCTGTTGGG ATGACAAAAA GGTCGATGCA
CACCACGCCA TCATCCCGAC CGCACGGAGT TCTGAGATCA ACCTGACGGA GAACGAAGCG
AAGGTCTATA ACCTGATTGC CCGTCAGTAC TTGATGCAGT TCTGCCCGGA TGCGGTGTTC
CGCAAGTGTG TTATCGAACT GGACATTGCC AAAGGCAAAT TTGTCGCTAA AGCGCGTTTT
CTTGCTGAAG CAGGCTGGCG CACGCTGTTA GGCAGCAAAG AGCGCGATGA AGAAAACGAC
GGTACGCCAT TGCCTGTGGT GGCGAAAGGC GATGAGTTGC TGTGTGAAAA AGGTGAAGTG
GTAGAGCGGC AAACCCAGCC GCCGCGCCAT TTTACCGATG CAACACTGCT TTCGGCGATG
ACCGGGATCG CGCGCTTTGT GCAGGACAAA GATCTGAAAA AGATCCTTCG TGCGACCGAT
GGTCTGGGGA CAGAAGCAAC CCGTGCCGGG ATTATTGAAC TGCTGTTCAA GCGTGGTTTT
CTGACCAAAA AAGGGCGCTA TATCCACTCT ACCGACGCCG GAAAAGCGTT ATTCCATTCG
CTGCCAGAAA TGGCGACGCG ACCGGACATG ACCGCGCACT GGGAATCGGT GCTGACGCAA
ATCAGCGAAA AGCAGTGTCG CTATCAGGAC TTTATGCAGC CGCTGGTGGG GACGCTATAT
CAGCTTATTG ATCAAGCCAA ACGTACGCCG GTGCGGCAGT TTCGCGGCAT TGTGGCTCCG
GGCAGTGGTG GCAGTGCTGA TAAGAAAAAG GCTGCACCGC GTAAACGTAG TGCGAAAAAA
AGTCCGCCAG CAGATGAAGT CGGAAGCGGG GCGATAGCGT AA
 
Protein sequence
MRLFIAEKPS LARAIADVLP KPHRKGDGFI ECGNGQVVTW CIGHLLEQAQ PDAYDSRYAR 
WNLADLPIVP EKWQLQPRPS VTKQLNVIKR FLHEASEIVH AGDPDREGQL LVDEVLDYLQ
LAPEKRQQVQ RCLINDLNPQ AVERAIDRLR SNSEFVPLCV SALARARADW LYGINMTRAY
TILGRNAGYQ GVLSVGRVQT PVLGLVVRRD EEIENFVAKD FFEVKAHIVT PADERFTAIW
QPSEACEPYQ DEEGRLLHRP LAEHVVNRIS GQPAIVTSYN DKRESESAPL PFSLSALQIE
AAKRFGLSAQ NVLDICQKLY ETHKLITYPR SDCRYLPEEH FAGRHAVMNA ISVHAPDLLP
QPVVDPDIRN RCWDDKKVDA HHAIIPTARS SEINLTENEA KVYNLIARQY LMQFCPDAVF
RKCVIELDIA KGKFVAKARF LAEAGWRTLL GSKERDEEND GTPLPVVAKG DELLCEKGEV
VERQTQPPRH FTDATLLSAM TGIARFVQDK DLKKILRATD GLGTEATRAG IIELLFKRGF
LTKKGRYIHS TDAGKALFHS LPEMATRPDM TAHWESVLTQ ISEKQCRYQD FMQPLVGTLY
QLIDQAKRTP VRQFRGIVAP GSGGSADKKK AAPRKRSAKK SPPADEVGSG AIA