Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_0003 |
Symbol | |
ID | 6268317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010657 |
Strand | + |
Start bp | 738 |
End bp | 2891 |
Gene Length | 2154 bp |
Protein Length | 717 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641724211 |
Product | DNA topoisomerase 3 |
Protein accession | YP_001878771 |
Protein GI | 187730005 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 77 |
Plasmid unclonability p-value | 0.0000970275 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGACTTT TTATCGCAGA AAAACCCGCA GTAGCAAATG ATATTGTTAA GGCACTTGGT GGCAATTTTA CCCGCCATGA TGGCTGGTTT GAAAGTGATA ACGCCATTGT GACTAACTGT TTTGGTCATA TTATCGAATC ACAACCGCCG GAAAACTATA ATCCTGAATA CAAAGCCTGG AAGGTTGAAA CGCTTCCTTT ACGTCTTTAT CCCGTGAAGT ATCAGCCTGT TGAAAGTGCC GCAAAACAGG TTAAAACGAT TCTCGAACTT ATCAGACGTG GAGACGTGAC TGAAATTGTT CACGCTGGCG ATCCTGATGA TGAGGGACAG TTACTTGTTG ATGAAGTCCT GGAATATGCA GGCAACACAA AACCCGTAAA GCGCGTTCTG ATTAACGACA ACACGCTTCC GGCAGTGAAA AAGGCACTGG CAAATCTTAA AGATAATCGT GATTTCAAAG GGCTTTACCT TAAGGCGCTG GCGCGTTCAG TTGCTGATGC CGTCTATGGC TTCTCCATGA CGCGTGCTTA CACCATTCCT GCAAAAGCCA GAGGATATCA GGGCGTTCTG TCTGTCGGGC GCGTCCAGAC TCCCGTTCTT GGCCTGATTG TGAATCGTAC CCGTGCTAAC CAGAACCATA AATCCAGTTT TTACTACACC ATGACCGGAG TCTTTCAGCG TGGTGCTGAT GTTATCAGGG CGAACTGGAA GCCAGGGGAA TTTGCTCCGC TGACAGACCG TAAATTACTT GATAAAGCGT GGGCAGACGG AACGGCAGCC TCCCTTGCAG GAAAACCGGC TACAGTTGAA GCGGCAGCAA CTGATGATAA AAAAACGGCT GCACCGTTGC CCTTTAACCT GGTACGACTC CAGCAATACA TGAACAAAAA GTTCAAAATG ACGGCACAAA AAACGCTGGA TATTACGCAA CAATTACGTG AAAAATATAA AGCGATTACT TATAACCGCT CTGATTGCTC ATATTTATCT GATGAACAAT TCAGCGAAGC GCCGCAGGTT ATCGATGCCC TGAAATCAGT CTTTCCTCAG TCGCTGGATA TTGATTCTTC ACGTAAAAGC AAAGCGTTTA ACAGTGCAAA GGTGACTGCG CATACTGCTA TAATCCCGAC CGCCAGTGTG CCTGATGTTA ACGCACTCAG CACCGACGAG CGCAATGTTT ACCTGGCGAT CGCACAACAC TATCTTGTTC AGTTCATGCC TGAAAAAGCA TACCAGGAAG TATCGGTTGC CATTCAGTGT GGTGATGAGT CGTTCTATGC CCGTGCCAGA AAAACAACTG ACAGCGGATT TGAGGCGTTT CTTGGCGCGG AAACCACAGA CGAAGGTGAA TCAGAAGATA ATGATGATTC CGCTTTTGAA CTGCTCTGTA AAATTCGCAC AGGAGAAACA CTGACGACAA AAGAAGTTAT TGTTAATGAG AAGAAAACAA CACCGCCGCC GTTATTCACC GAAGCCTCCT TGCTTGCTGC GCTTGTTCGT GTCGCGGATT TTGTCACTGA CCCAACGATT AAAAAATTGT TGAAGGATAA GGATAAAGAC AAAAAAGATG AACATGGTGG CATTGGTACG CCAGCTACCC GCGCAGCCAT TCTGGAAACG CTGAAGAAAA GAAACTATAT CACGCTGGAA AAAGGGAAAC TTATTCCTAC TGATACCGGA TATGCGCTTA TTGATGCCCT GCCTGGTATA GCGGTTAATC CTGATATGAC AGCATTATGG TCTGAAAAGC AGGCTGCCAT AGAAAATGGC GATCTGACGG TTGAACAGTT TATTAATGAT CTGTACGGTG AACTGACAGG CATGATTTCT GATGTTGACC TGGGCGAGAT GAAGATTGAA CCCGCTGCGC CAGCAGGGCA GTTTCAACGC CTGGACTCTC CCTGCCCTTC CTGTGGTAAA CATATTGTTA TCAGGCCGAA AGGTTATTTC TGTACCGGAT GTGAATTTAA AATCTGGAGT GAGTTTTATG GTAAGAAAAT CACCCAAGCA CAGGCCGAAA AACTGGTTAA ATCAGGGAAA ACCGATTTGA TTAAGGGATT TAAAAAGAAA AGTGGTGGAA CGTATGATAC AGTTCTTGTC CTTGAGGATA AGAAAACAGG GAAACTGGGT TTTCCGGCAA GGGCTAAGAA GTGA
|
Protein sequence | MRLFIAEKPA VANDIVKALG GNFTRHDGWF ESDNAIVTNC FGHIIESQPP ENYNPEYKAW KVETLPLRLY PVKYQPVESA AKQVKTILEL IRRGDVTEIV HAGDPDDEGQ LLVDEVLEYA GNTKPVKRVL INDNTLPAVK KALANLKDNR DFKGLYLKAL ARSVADAVYG FSMTRAYTIP AKARGYQGVL SVGRVQTPVL GLIVNRTRAN QNHKSSFYYT MTGVFQRGAD VIRANWKPGE FAPLTDRKLL DKAWADGTAA SLAGKPATVE AAATDDKKTA APLPFNLVRL QQYMNKKFKM TAQKTLDITQ QLREKYKAIT YNRSDCSYLS DEQFSEAPQV IDALKSVFPQ SLDIDSSRKS KAFNSAKVTA HTAIIPTASV PDVNALSTDE RNVYLAIAQH YLVQFMPEKA YQEVSVAIQC GDESFYARAR KTTDSGFEAF LGAETTDEGE SEDNDDSAFE LLCKIRTGET LTTKEVIVNE KKTTPPPLFT EASLLAALVR VADFVTDPTI KKLLKDKDKD KKDEHGGIGT PATRAAILET LKKRNYITLE KGKLIPTDTG YALIDALPGI AVNPDMTALW SEKQAAIENG DLTVEQFIND LYGELTGMIS DVDLGEMKIE PAAPAGQFQR LDSPCPSCGK HIVIRPKGYF CTGCEFKIWS EFYGKKITQA QAEKLVKSGK TDLIKGFKKK SGGTYDTVLV LEDKKTGKLG FPARAKK
|
| |