Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4808 |
Symbol | |
ID | 6270587 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 4476714 |
End bp | 4480127 |
Gene Length | 3414 bp |
Protein Length | 1137 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641728550 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_001882945 |
Protein GI | 187730438 |
COG category | [V] Defense mechanisms |
COG ID | [COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.564367 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGCAGT CTCTCAATTT TGAAATGTTG CGCAGCCAGT GGCCGGAACT GGCAGAACTC GCATGTATGG CAGAGCGCTA TGTTCACTCC GATCCGGAAA GCTGCCTGGT TAAGCTGCGC AACTACACCG AATTGATGGT GCGCTGGTTG TATCGTCAGG AGCGGTTGCC GGAAGGTATT AAGGCTAATC TTTACGATTT AATGAACGCT GATGTCTTTA CCAGCATGAT GCCGGAAGCC ATCATCATGA AAATGGATGC TCTGCGTATC CATGGCAACC GTGCCGCGCA CGGCGGACGT ATCAAAGCTA AAGATACTTA CTGGCTGCTC AAAGAAGCGT ATTTGTTGGG AATTTGGCTG TATGTTCGCT ACGCCCACGG TAATGTTGAT GACTGCCCCA AATTTACACT CCCTCCATTA ACACAATCTT CAGGTCGTGC AGATGAAAAA CGTCTGGAAG ATGCAATCAA GGCTCAGGAT GAAAGCCGTG AGCGCGAACT GGCGCTACAA CGCGCACTAC AGCAAGAACA GGAAAAGGCC GAACACCTCA CTCAACGCCT GAATGAAGCC AGAGCGCGTA ACAAGCATGT TGCCGATATT CTCTCTATTG ATGAAGCGGA AACCCGCCGT CGCCTGATTG ACTCTCGCTT ACTTGCAGCC GACTGGAATG TAGGGGAAGA ACTTAAAAAC ACGGATCAAG TCACACAAGA ACATCCAGTC AAAGAGCAAC CCACCACCTC TGGTGACGGT TATGCAGACT ATGTCTTGTG GGATGAGGCA CACAAACCGT TGGCGGTGGT GGAAGCAAAA AAAACCAGCG TCAATGCCGA GCAAGGACGA ATTCAGGCCC GGCTGTATGC AGACTGGCTG GAAAAAGAAT ACGGTCAGCG TCCGGTCATC TTCTACACCA ACGGCTATGA TATCTGGCTG TGGGATGACC ATAAAACTCA TGGTTATCCC CCGCGTCGGG TGTTTGGCTT CTACAGCAAG GAAAGCCTGC AATATCTTAT TCAGCAGCGT GAAACCCGTC TGCCGCTGAA CAGAGTGCCA CACGTAAAAG ATAACGAAGG TAAGGCCGTT GCCGGACGTT TGTACCAGCT CGAAACCATT GCCCGCGTCA GTGAACGGTT TACCAACAAG TACCGACAAT CGTTAATTGT TCAGGCTACG GGTACAGGCA AAACCCGCGT GGCAATTGCA CTAAGCAAAC TGATGATTGA TGCCCGCTGG GTAAAACGCG TCCTGTTTCT TTGCGACCGT AAAGAACTGC GTAAACAGGC GGCAAATGCC TTTAATCAAT TCACCAATGA GCCGCTGTAT GTGGTCGGGA AATCGAAAAA AGCAGACAGG CAGAATGCCA GAATCTACAT CGCTACCTAT CCCGGTATGA TGAAAATCAT GGACCATTTC GATGTCGGTT ATTTCGATTT GATCATCGCC GATGAATCCC ACCGCTCTAT CTACAACGTC TACGGCGATC TGTTTAAGTA TTTTGATGCC CTCCAGATTG GCCTGACGGC CACGCCAATT GACATGGTGA GTAAAACCAC TTTCGGCCTG TTTGGTTGTG AAGGACGTAT TCCTACCGCC AACTACAGCC TGGAAGATGC CATCGCCGAT AACAATCTGG TTCCCTATGA GGTTGTCACA CACACAACAG AATTCCTGCG CGAAGGTATC AAACGGGAGA AATTAAGCGA CGCACAAATC CGCGAGCTGG AAGAACAAGG CATAGATCCC AATACGCTGG AGTTTGATGG CAAGGCGCTG GATGAGGCGA TCTACAACAA AGACACCAAC CGCTATATCC TGCGTAACCT GATGGAAAAT GGCCTGAAAG ATCGGGACGG CCAGTTGCCT GGTAAAACCA TCATTTTCGC CCGTAACCAC AAACACGCGC TGCTGTTGAA TGAATTGTTC GACGATATGT ACCCACAGTT TGCCGGGCGC TTCTGCCAGG TCATCGACAA CTACGATCCC CGCGCCGAGC AACTGATTGA TGATTTTAAA GGGCTGGATG AAAGCACCAA CAAAGAGCTG ACCATCGCTA TCTCCGTTGA CATGCTCGAT ACCGGTATTG ATGTCCCGGA AATTGTGAAT CTGGTCTTTG CCAAACCGGT CAAATCAAAA GTGAAGTTCT GGCAGATGAT TGGCCGTGGT ACGCGCCTCT GTCCGGGGTT GTACGGTTAC GACGATAACG GCAAACCGCT GGATAAACAG AAATTCCGCA TTTTCGATCA CTGGGGCAAT TTTGAGTATC ACGAGCTGCA TACCGAAGAG GCGGAAGTCA CAGCGACAAA ATCGCTTGCG CAAAAACGCT TCGAAGCGTG GGTTATGCTG GGGGCCGCTG CGCAGCGTAA ATTCGACAAA CAGGCGGTGG ATTTAGTTGC TCACCAGCTT CGCGAGCAAA TCAACGCACT GGATGAAAAG TCGATTGCCG TACAGGAAAA ATGGCAGCAG AAAGCGCAGT ACAGTGATGA AAAGGTGCTG CGCCAGCTTT CTCCGAAAAC ACAGCAGGAT CTGCTATCTG TTCTGGCCCC GTTGATGCAG TGGCTGGACG TGCGCGGGCA AAGCGATGCC ATACGCTTTG ATATGGATAT TCTGGCGGCG CAAACTGCCC GTTATACCAA CCCGGAAGAG CTGGATGTTC TCTGGCCGAT CATCGTCGAA AAAGTGGAGC GCCTGCCGCC GCATTTGGCG CAGGTGCAGC AACAGGGACA ACGAATCAAT CAGCTTCGTG ATTTAAGCTG GTGGAAGCAG GCCAGCCTGG AAGAGCTGGA GGATATTCGC ATTCATTTGC GCGGTATCAT GCACCTGATG GAAAAAGACG CGACGCCTAA ATTTGGTTCG ATACAGGTGG ATATTACCGA AGATGCGAAC CTGATCCAGA CGGAGACCCG CAAAACTAAT ATCCGCTCGA TTGATTTCAA ACTCTATCGC CAGCAGGTTC AGGGGGCGCT GGAGCCGCTG TTCCAGCAAA ACCCGGTTCT GAAGAAAATC CGCAACGGCG AGCCAGTCAC GCAAAACGAG CTGGATGAGC TGGCGAAGCT GGTGCTGATC CAGAACCCCA ACGTTGATAT TCGTGCGTTG AAAGAGTTTT ATCCGCAGGC AACCGCCAGC CTGGATAAAC TTTTGCGTAC CATCATCGGG ATGGACAGCG ACGCGGTGGA AGTGCGTTTT GCCCAGTTCG CCGCTGATAA CAGCCTGACC AGCCAACAAC TGCGCTTCCT GTCATTGCTG AAAAACCACA TTCGCGATTA CGGCACCATT GAAATGCGGC AGCTCTTTGA ACAGCCTTTT ACGCATATCC ACAACGAAGG CGTTACCGGT GTGTTCCCTG ATATAGCGCA AATCGCCCGC CTGCAAAAGA TAGTCGAAGA GCTGGGTGTT GTGACCGACG CAGCGACGGT ATAA
|
Protein sequence | MEQSLNFEML RSQWPELAEL ACMAERYVHS DPESCLVKLR NYTELMVRWL YRQERLPEGI KANLYDLMNA DVFTSMMPEA IIMKMDALRI HGNRAAHGGR IKAKDTYWLL KEAYLLGIWL YVRYAHGNVD DCPKFTLPPL TQSSGRADEK RLEDAIKAQD ESRERELALQ RALQQEQEKA EHLTQRLNEA RARNKHVADI LSIDEAETRR RLIDSRLLAA DWNVGEELKN TDQVTQEHPV KEQPTTSGDG YADYVLWDEA HKPLAVVEAK KTSVNAEQGR IQARLYADWL EKEYGQRPVI FYTNGYDIWL WDDHKTHGYP PRRVFGFYSK ESLQYLIQQR ETRLPLNRVP HVKDNEGKAV AGRLYQLETI ARVSERFTNK YRQSLIVQAT GTGKTRVAIA LSKLMIDARW VKRVLFLCDR KELRKQAANA FNQFTNEPLY VVGKSKKADR QNARIYIATY PGMMKIMDHF DVGYFDLIIA DESHRSIYNV YGDLFKYFDA LQIGLTATPI DMVSKTTFGL FGCEGRIPTA NYSLEDAIAD NNLVPYEVVT HTTEFLREGI KREKLSDAQI RELEEQGIDP NTLEFDGKAL DEAIYNKDTN RYILRNLMEN GLKDRDGQLP GKTIIFARNH KHALLLNELF DDMYPQFAGR FCQVIDNYDP RAEQLIDDFK GLDESTNKEL TIAISVDMLD TGIDVPEIVN LVFAKPVKSK VKFWQMIGRG TRLCPGLYGY DDNGKPLDKQ KFRIFDHWGN FEYHELHTEE AEVTATKSLA QKRFEAWVML GAAAQRKFDK QAVDLVAHQL REQINALDEK SIAVQEKWQQ KAQYSDEKVL RQLSPKTQQD LLSVLAPLMQ WLDVRGQSDA IRFDMDILAA QTARYTNPEE LDVLWPIIVE KVERLPPHLA QVQQQGQRIN QLRDLSWWKQ ASLEELEDIR IHLRGIMHLM EKDATPKFGS IQVDITEDAN LIQTETRKTN IRSIDFKLYR QQVQGALEPL FQQNPVLKKI RNGEPVTQNE LDELAKLVLI QNPNVDIRAL KEFYPQATAS LDKLLRTIIG MDSDAVEVRF AQFAADNSLT SQQLRFLSLL KNHIRDYGTI EMRQLFEQPF THIHNEGVTG VFPDIAQIAR LQKIVEELGV VTDAATV
|
| |