Gene SbBS512_E4808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4808 
Symbol 
ID6270587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp4476714 
End bp4480127 
Gene Length3414 bp 
Protein Length1137 aa 
Translation table11 
GC content51% 
IMG OID641728550 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_001882945 
Protein GI187730438 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.564367 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCAGT CTCTCAATTT TGAAATGTTG CGCAGCCAGT GGCCGGAACT GGCAGAACTC 
GCATGTATGG CAGAGCGCTA TGTTCACTCC GATCCGGAAA GCTGCCTGGT TAAGCTGCGC
AACTACACCG AATTGATGGT GCGCTGGTTG TATCGTCAGG AGCGGTTGCC GGAAGGTATT
AAGGCTAATC TTTACGATTT AATGAACGCT GATGTCTTTA CCAGCATGAT GCCGGAAGCC
ATCATCATGA AAATGGATGC TCTGCGTATC CATGGCAACC GTGCCGCGCA CGGCGGACGT
ATCAAAGCTA AAGATACTTA CTGGCTGCTC AAAGAAGCGT ATTTGTTGGG AATTTGGCTG
TATGTTCGCT ACGCCCACGG TAATGTTGAT GACTGCCCCA AATTTACACT CCCTCCATTA
ACACAATCTT CAGGTCGTGC AGATGAAAAA CGTCTGGAAG ATGCAATCAA GGCTCAGGAT
GAAAGCCGTG AGCGCGAACT GGCGCTACAA CGCGCACTAC AGCAAGAACA GGAAAAGGCC
GAACACCTCA CTCAACGCCT GAATGAAGCC AGAGCGCGTA ACAAGCATGT TGCCGATATT
CTCTCTATTG ATGAAGCGGA AACCCGCCGT CGCCTGATTG ACTCTCGCTT ACTTGCAGCC
GACTGGAATG TAGGGGAAGA ACTTAAAAAC ACGGATCAAG TCACACAAGA ACATCCAGTC
AAAGAGCAAC CCACCACCTC TGGTGACGGT TATGCAGACT ATGTCTTGTG GGATGAGGCA
CACAAACCGT TGGCGGTGGT GGAAGCAAAA AAAACCAGCG TCAATGCCGA GCAAGGACGA
ATTCAGGCCC GGCTGTATGC AGACTGGCTG GAAAAAGAAT ACGGTCAGCG TCCGGTCATC
TTCTACACCA ACGGCTATGA TATCTGGCTG TGGGATGACC ATAAAACTCA TGGTTATCCC
CCGCGTCGGG TGTTTGGCTT CTACAGCAAG GAAAGCCTGC AATATCTTAT TCAGCAGCGT
GAAACCCGTC TGCCGCTGAA CAGAGTGCCA CACGTAAAAG ATAACGAAGG TAAGGCCGTT
GCCGGACGTT TGTACCAGCT CGAAACCATT GCCCGCGTCA GTGAACGGTT TACCAACAAG
TACCGACAAT CGTTAATTGT TCAGGCTACG GGTACAGGCA AAACCCGCGT GGCAATTGCA
CTAAGCAAAC TGATGATTGA TGCCCGCTGG GTAAAACGCG TCCTGTTTCT TTGCGACCGT
AAAGAACTGC GTAAACAGGC GGCAAATGCC TTTAATCAAT TCACCAATGA GCCGCTGTAT
GTGGTCGGGA AATCGAAAAA AGCAGACAGG CAGAATGCCA GAATCTACAT CGCTACCTAT
CCCGGTATGA TGAAAATCAT GGACCATTTC GATGTCGGTT ATTTCGATTT GATCATCGCC
GATGAATCCC ACCGCTCTAT CTACAACGTC TACGGCGATC TGTTTAAGTA TTTTGATGCC
CTCCAGATTG GCCTGACGGC CACGCCAATT GACATGGTGA GTAAAACCAC TTTCGGCCTG
TTTGGTTGTG AAGGACGTAT TCCTACCGCC AACTACAGCC TGGAAGATGC CATCGCCGAT
AACAATCTGG TTCCCTATGA GGTTGTCACA CACACAACAG AATTCCTGCG CGAAGGTATC
AAACGGGAGA AATTAAGCGA CGCACAAATC CGCGAGCTGG AAGAACAAGG CATAGATCCC
AATACGCTGG AGTTTGATGG CAAGGCGCTG GATGAGGCGA TCTACAACAA AGACACCAAC
CGCTATATCC TGCGTAACCT GATGGAAAAT GGCCTGAAAG ATCGGGACGG CCAGTTGCCT
GGTAAAACCA TCATTTTCGC CCGTAACCAC AAACACGCGC TGCTGTTGAA TGAATTGTTC
GACGATATGT ACCCACAGTT TGCCGGGCGC TTCTGCCAGG TCATCGACAA CTACGATCCC
CGCGCCGAGC AACTGATTGA TGATTTTAAA GGGCTGGATG AAAGCACCAA CAAAGAGCTG
ACCATCGCTA TCTCCGTTGA CATGCTCGAT ACCGGTATTG ATGTCCCGGA AATTGTGAAT
CTGGTCTTTG CCAAACCGGT CAAATCAAAA GTGAAGTTCT GGCAGATGAT TGGCCGTGGT
ACGCGCCTCT GTCCGGGGTT GTACGGTTAC GACGATAACG GCAAACCGCT GGATAAACAG
AAATTCCGCA TTTTCGATCA CTGGGGCAAT TTTGAGTATC ACGAGCTGCA TACCGAAGAG
GCGGAAGTCA CAGCGACAAA ATCGCTTGCG CAAAAACGCT TCGAAGCGTG GGTTATGCTG
GGGGCCGCTG CGCAGCGTAA ATTCGACAAA CAGGCGGTGG ATTTAGTTGC TCACCAGCTT
CGCGAGCAAA TCAACGCACT GGATGAAAAG TCGATTGCCG TACAGGAAAA ATGGCAGCAG
AAAGCGCAGT ACAGTGATGA AAAGGTGCTG CGCCAGCTTT CTCCGAAAAC ACAGCAGGAT
CTGCTATCTG TTCTGGCCCC GTTGATGCAG TGGCTGGACG TGCGCGGGCA AAGCGATGCC
ATACGCTTTG ATATGGATAT TCTGGCGGCG CAAACTGCCC GTTATACCAA CCCGGAAGAG
CTGGATGTTC TCTGGCCGAT CATCGTCGAA AAAGTGGAGC GCCTGCCGCC GCATTTGGCG
CAGGTGCAGC AACAGGGACA ACGAATCAAT CAGCTTCGTG ATTTAAGCTG GTGGAAGCAG
GCCAGCCTGG AAGAGCTGGA GGATATTCGC ATTCATTTGC GCGGTATCAT GCACCTGATG
GAAAAAGACG CGACGCCTAA ATTTGGTTCG ATACAGGTGG ATATTACCGA AGATGCGAAC
CTGATCCAGA CGGAGACCCG CAAAACTAAT ATCCGCTCGA TTGATTTCAA ACTCTATCGC
CAGCAGGTTC AGGGGGCGCT GGAGCCGCTG TTCCAGCAAA ACCCGGTTCT GAAGAAAATC
CGCAACGGCG AGCCAGTCAC GCAAAACGAG CTGGATGAGC TGGCGAAGCT GGTGCTGATC
CAGAACCCCA ACGTTGATAT TCGTGCGTTG AAAGAGTTTT ATCCGCAGGC AACCGCCAGC
CTGGATAAAC TTTTGCGTAC CATCATCGGG ATGGACAGCG ACGCGGTGGA AGTGCGTTTT
GCCCAGTTCG CCGCTGATAA CAGCCTGACC AGCCAACAAC TGCGCTTCCT GTCATTGCTG
AAAAACCACA TTCGCGATTA CGGCACCATT GAAATGCGGC AGCTCTTTGA ACAGCCTTTT
ACGCATATCC ACAACGAAGG CGTTACCGGT GTGTTCCCTG ATATAGCGCA AATCGCCCGC
CTGCAAAAGA TAGTCGAAGA GCTGGGTGTT GTGACCGACG CAGCGACGGT ATAA
 
Protein sequence
MEQSLNFEML RSQWPELAEL ACMAERYVHS DPESCLVKLR NYTELMVRWL YRQERLPEGI 
KANLYDLMNA DVFTSMMPEA IIMKMDALRI HGNRAAHGGR IKAKDTYWLL KEAYLLGIWL
YVRYAHGNVD DCPKFTLPPL TQSSGRADEK RLEDAIKAQD ESRERELALQ RALQQEQEKA
EHLTQRLNEA RARNKHVADI LSIDEAETRR RLIDSRLLAA DWNVGEELKN TDQVTQEHPV
KEQPTTSGDG YADYVLWDEA HKPLAVVEAK KTSVNAEQGR IQARLYADWL EKEYGQRPVI
FYTNGYDIWL WDDHKTHGYP PRRVFGFYSK ESLQYLIQQR ETRLPLNRVP HVKDNEGKAV
AGRLYQLETI ARVSERFTNK YRQSLIVQAT GTGKTRVAIA LSKLMIDARW VKRVLFLCDR
KELRKQAANA FNQFTNEPLY VVGKSKKADR QNARIYIATY PGMMKIMDHF DVGYFDLIIA
DESHRSIYNV YGDLFKYFDA LQIGLTATPI DMVSKTTFGL FGCEGRIPTA NYSLEDAIAD
NNLVPYEVVT HTTEFLREGI KREKLSDAQI RELEEQGIDP NTLEFDGKAL DEAIYNKDTN
RYILRNLMEN GLKDRDGQLP GKTIIFARNH KHALLLNELF DDMYPQFAGR FCQVIDNYDP
RAEQLIDDFK GLDESTNKEL TIAISVDMLD TGIDVPEIVN LVFAKPVKSK VKFWQMIGRG
TRLCPGLYGY DDNGKPLDKQ KFRIFDHWGN FEYHELHTEE AEVTATKSLA QKRFEAWVML
GAAAQRKFDK QAVDLVAHQL REQINALDEK SIAVQEKWQQ KAQYSDEKVL RQLSPKTQQD
LLSVLAPLMQ WLDVRGQSDA IRFDMDILAA QTARYTNPEE LDVLWPIIVE KVERLPPHLA
QVQQQGQRIN QLRDLSWWKQ ASLEELEDIR IHLRGIMHLM EKDATPKFGS IQVDITEDAN
LIQTETRKTN IRSIDFKLYR QQVQGALEPL FQQNPVLKKI RNGEPVTQNE LDELAKLVLI
QNPNVDIRAL KEFYPQATAS LDKLLRTIIG MDSDAVEVRF AQFAADNSLT SQQLRFLSLL
KNHIRDYGTI EMRQLFEQPF THIHNEGVTG VFPDIAQIAR LQKIVEELGV VTDAATV