Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4923 |
Symbol | |
ID | 6272441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 4588906 |
End bp | 4589769 |
Gene Length | 864 bp |
Protein Length | 287 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641728651 |
Product | radical SAM domain protein |
Protein accession | YP_001883042 |
Protein GI | 187731719 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1180] Pyruvate-formate lyase-activating enzyme |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAGCA GATGCGCTTT AGTCAGTAAG ATTATCCCCT TCTCCTGCGT TGACGGGCCA GGCAGTCGTC TGGCTCTGTT TCTGCAGGGC TGCAATCTGC GCTGCAAAAA CTGTCACAAT CCGTGGACGA TGGGACGCTG CAATGATTGC GGAGAGTGCG TGCCACAGTG TCCGCATCAG GCGTTGCAGA TTGTTGACGG CAAAGTGCTG TGGAACGCTG TGGTCTGCGA GCAGTGTGAT ACCTGCCTGA AGATGTGTCC GCAACATGCC ACGCCAATGG CGCAATCCAT GAGCGTGGAC GAAGTGCTTA GCCATGTCCG CAAAGCGGTG CTGTTTATTG AAGGGATAAC GGTAAGCGGC GGCGAAGCCA CGACCCAACT GCCGTTTGTA GTGGCGCTGT TTACTGCTAT CAAAAACGAT CCGCAACTGC GCCACCTCAC CTGCCTGGTG GACAGTAACG GCATGTTGAG CGAAACCGGC TGGGAAAAAT TACTCCCGGT GTGTGACGGC GCAATGCTCG ATCTCAAAGC GTGGGGGAGC GAATGTCATC AACACTTAAC CGGACGCGAT AATCAGCAGA TTAAGCGCAG CATCTGTTTG CTTGCAGAGC GCGGCAAGCT GGCGGAACTG CGTTTGCTGG TAATTCCAGA CCAGGTGGAT TATTTGCATC ACATCGATGA GCTGGCGACG TTTATCAAGA GACTTGGCGA TGTTCCGGTT CGCCTGAATG CGTTTCATAC CCACGGCGTG TATGGCGAGG CGCAAAGCTG GGCGAGCGCC ACGCCGGAAG ACGTTGAGCC GTTGGCTGAT GCGTTAAAGG TGCGCGGGGT GAGCCGGTTG ATATTTCCGG CGCTCTATTT GTGA
|
Protein sequence | MNSRCALVSK IIPFSCVDGP GSRLALFLQG CNLRCKNCHN PWTMGRCNDC GECVPQCPHQ ALQIVDGKVL WNAVVCEQCD TCLKMCPQHA TPMAQSMSVD EVLSHVRKAV LFIEGITVSG GEATTQLPFV VALFTAIKND PQLRHLTCLV DSNGMLSETG WEKLLPVCDG AMLDLKAWGS ECHQHLTGRD NQQIKRSICL LAERGKLAEL RLLVIPDQVD YLHHIDELAT FIKRLGDVPV RLNAFHTHGV YGEAQSWASA TPEDVEPLAD ALKVRGVSRL IFPALYL
|
| |