Gene SbBS512_0003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_0003 
Symbol 
ID6268317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010657 
Strand
Start bp738 
End bp2891 
Gene Length2154 bp 
Protein Length717 aa 
Translation table11 
GC content46% 
IMG OID641724211 
ProductDNA topoisomerase 3 
Protein accessionYP_001878771 
Protein GI187730005 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid 


Plasmid Coverage information

Num covering plasmid clones77 
Plasmid unclonability p-value0.0000970275 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACTTT TTATCGCAGA AAAACCCGCA GTAGCAAATG ATATTGTTAA GGCACTTGGT 
GGCAATTTTA CCCGCCATGA TGGCTGGTTT GAAAGTGATA ACGCCATTGT GACTAACTGT
TTTGGTCATA TTATCGAATC ACAACCGCCG GAAAACTATA ATCCTGAATA CAAAGCCTGG
AAGGTTGAAA CGCTTCCTTT ACGTCTTTAT CCCGTGAAGT ATCAGCCTGT TGAAAGTGCC
GCAAAACAGG TTAAAACGAT TCTCGAACTT ATCAGACGTG GAGACGTGAC TGAAATTGTT
CACGCTGGCG ATCCTGATGA TGAGGGACAG TTACTTGTTG ATGAAGTCCT GGAATATGCA
GGCAACACAA AACCCGTAAA GCGCGTTCTG ATTAACGACA ACACGCTTCC GGCAGTGAAA
AAGGCACTGG CAAATCTTAA AGATAATCGT GATTTCAAAG GGCTTTACCT TAAGGCGCTG
GCGCGTTCAG TTGCTGATGC CGTCTATGGC TTCTCCATGA CGCGTGCTTA CACCATTCCT
GCAAAAGCCA GAGGATATCA GGGCGTTCTG TCTGTCGGGC GCGTCCAGAC TCCCGTTCTT
GGCCTGATTG TGAATCGTAC CCGTGCTAAC CAGAACCATA AATCCAGTTT TTACTACACC
ATGACCGGAG TCTTTCAGCG TGGTGCTGAT GTTATCAGGG CGAACTGGAA GCCAGGGGAA
TTTGCTCCGC TGACAGACCG TAAATTACTT GATAAAGCGT GGGCAGACGG AACGGCAGCC
TCCCTTGCAG GAAAACCGGC TACAGTTGAA GCGGCAGCAA CTGATGATAA AAAAACGGCT
GCACCGTTGC CCTTTAACCT GGTACGACTC CAGCAATACA TGAACAAAAA GTTCAAAATG
ACGGCACAAA AAACGCTGGA TATTACGCAA CAATTACGTG AAAAATATAA AGCGATTACT
TATAACCGCT CTGATTGCTC ATATTTATCT GATGAACAAT TCAGCGAAGC GCCGCAGGTT
ATCGATGCCC TGAAATCAGT CTTTCCTCAG TCGCTGGATA TTGATTCTTC ACGTAAAAGC
AAAGCGTTTA ACAGTGCAAA GGTGACTGCG CATACTGCTA TAATCCCGAC CGCCAGTGTG
CCTGATGTTA ACGCACTCAG CACCGACGAG CGCAATGTTT ACCTGGCGAT CGCACAACAC
TATCTTGTTC AGTTCATGCC TGAAAAAGCA TACCAGGAAG TATCGGTTGC CATTCAGTGT
GGTGATGAGT CGTTCTATGC CCGTGCCAGA AAAACAACTG ACAGCGGATT TGAGGCGTTT
CTTGGCGCGG AAACCACAGA CGAAGGTGAA TCAGAAGATA ATGATGATTC CGCTTTTGAA
CTGCTCTGTA AAATTCGCAC AGGAGAAACA CTGACGACAA AAGAAGTTAT TGTTAATGAG
AAGAAAACAA CACCGCCGCC GTTATTCACC GAAGCCTCCT TGCTTGCTGC GCTTGTTCGT
GTCGCGGATT TTGTCACTGA CCCAACGATT AAAAAATTGT TGAAGGATAA GGATAAAGAC
AAAAAAGATG AACATGGTGG CATTGGTACG CCAGCTACCC GCGCAGCCAT TCTGGAAACG
CTGAAGAAAA GAAACTATAT CACGCTGGAA AAAGGGAAAC TTATTCCTAC TGATACCGGA
TATGCGCTTA TTGATGCCCT GCCTGGTATA GCGGTTAATC CTGATATGAC AGCATTATGG
TCTGAAAAGC AGGCTGCCAT AGAAAATGGC GATCTGACGG TTGAACAGTT TATTAATGAT
CTGTACGGTG AACTGACAGG CATGATTTCT GATGTTGACC TGGGCGAGAT GAAGATTGAA
CCCGCTGCGC CAGCAGGGCA GTTTCAACGC CTGGACTCTC CCTGCCCTTC CTGTGGTAAA
CATATTGTTA TCAGGCCGAA AGGTTATTTC TGTACCGGAT GTGAATTTAA AATCTGGAGT
GAGTTTTATG GTAAGAAAAT CACCCAAGCA CAGGCCGAAA AACTGGTTAA ATCAGGGAAA
ACCGATTTGA TTAAGGGATT TAAAAAGAAA AGTGGTGGAA CGTATGATAC AGTTCTTGTC
CTTGAGGATA AGAAAACAGG GAAACTGGGT TTTCCGGCAA GGGCTAAGAA GTGA
 
Protein sequence
MRLFIAEKPA VANDIVKALG GNFTRHDGWF ESDNAIVTNC FGHIIESQPP ENYNPEYKAW 
KVETLPLRLY PVKYQPVESA AKQVKTILEL IRRGDVTEIV HAGDPDDEGQ LLVDEVLEYA
GNTKPVKRVL INDNTLPAVK KALANLKDNR DFKGLYLKAL ARSVADAVYG FSMTRAYTIP
AKARGYQGVL SVGRVQTPVL GLIVNRTRAN QNHKSSFYYT MTGVFQRGAD VIRANWKPGE
FAPLTDRKLL DKAWADGTAA SLAGKPATVE AAATDDKKTA APLPFNLVRL QQYMNKKFKM
TAQKTLDITQ QLREKYKAIT YNRSDCSYLS DEQFSEAPQV IDALKSVFPQ SLDIDSSRKS
KAFNSAKVTA HTAIIPTASV PDVNALSTDE RNVYLAIAQH YLVQFMPEKA YQEVSVAIQC
GDESFYARAR KTTDSGFEAF LGAETTDEGE SEDNDDSAFE LLCKIRTGET LTTKEVIVNE
KKTTPPPLFT EASLLAALVR VADFVTDPTI KKLLKDKDKD KKDEHGGIGT PATRAAILET
LKKRNYITLE KGKLIPTDTG YALIDALPGI AVNPDMTALW SEKQAAIENG DLTVEQFIND
LYGELTGMIS DVDLGEMKIE PAAPAGQFQR LDSPCPSCGK HIVIRPKGYF CTGCEFKIWS
EFYGKKITQA QAEKLVKSGK TDLIKGFKKK SGGTYDTVLV LEDKKTGKLG FPARAKK