Gene SbBS512_E4809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4809 
Symbol 
ID6270323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp4480191 
End bp4481825 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content51% 
IMG OID641728551 
ProductN-6 DNA methylase 
Protein accessionYP_001882946 
Protein GI187732102 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.136314 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTACTG GCGACTTAAA AAGTAAAATC GACGGACTAT GGGAAGATTT CTGGGTGGGT 
GGTATCACCA ACCCGCTGAC CGTAATCGAA CAGATCACCT ATCTGATGTA TTCCCGGATG
CTGGATACCC AGGAACAACG CGACGAAAAG CGTAAACAAA TCGCGGGTAT TGATTTTAAA
CCGCGTTTTG CGCCAGAACA GCAGGAGTTC CGTTTCAGTC ACTATAGCAA CCTTGGCTCG
GATGAGATGA TGGAAGTGGT GCGCGATGGC GTATTCCAGC ATTTCCGTCA GCTCGGCCAG
GCCGATGCTT CGAAGGTGAC GCTGCTGGGC AACTTTATGA AAGATGCTCG TCTGGAGATT
GTTAAGCCGT CATTGCTGAC CAAAGCGGTT GAGGTAATCA AAAACCTGCC ATTGGATCGC
GGCGACACCA AAGGCGACCT TTACGAATAC CTGTTAAGCA AGCTGACAAC TGCCGGAATC
AACGGACAGT TCCGCACACC GCGCCACATT ATCCGCACGA TGGTTGAAAT GATGGAGCCG
AACCCGGCCC GCGGCGAGAC GATTTGCGAT CCCGCCTGTG GCACCGGTGG TTTTCTGGCA
ACCAGCTATG AATATCTGCT GGAGAAGTAC AGCTCGCTGG AATCCATTCA TACTGAGATT
GGCACCAACG AACGTGGCGA GCTGGAAGAG CAAAAAATCT TTACCGGCGA TCTGCTGACA
CCGTGGCGTA ACCATGTGGA TAACAACATG TTCCACGGTT ACGACTTTGA CACCACAATG
CTACGTATCG CCGCCATGAA CCTGATTATG CACGGCGTGG ATGCGCCTGA TATCCACTAT
CAGGACACAA TGAGCCAGAG TTTCAGCACA AACTTCCCGC AGGCCAGTAA AAACGCCTTC
AACCTGATTC TGGCGAACCC GCCGTTTACC GGTTCACTGG ACGAGGAAGA TATCGACTCC
ACGCTGTCGG CAATGGTGAA AACCAAAAAA ACCGAACTAC TGTTCCTGGC GCGTATTCTG
CAAATGCTGA AAGTGGGCGG GCGCAGTGCC ACTATCGTGC CGCAGGGCGT GCTGTTTGGC
TCTAGCAAGG CGCACCAGTC ACTTCGCAAA ACGCTGGTGG AAGATAACCA ACTGGAAGCG
GTGATCAATC TGCCTTCTGG TGTATTTAAA CCTTACGCTG GCGTGGCGAC GGCGATCTTG
ATCTTTACCA AAGGCGGTCA AACGGATGAG GTCTGGTTCT ACGATCTACA AAATGACGGC
TACAGCCTGG ATGATAAGCG CAACCCGATA AAAGACAACG ATCTGCCGCA TCTGCTGGCA
AGCTGGAAGC ATTACCGTAC TTTACGCGGG CTACCGGTTG ATAACTTTAT GGGTAAGAAG
TTAGCCTCGT TGCTTAAACA GCAGTACCCG GAAGGGATTA ATGCTGGCGT TGATTTTAAA
GATCGCACGC AGGCGGCGTT TGTTGTACCG AAAGCGGATA TTGCTGCGCA GAAATACGAT
CTATCCATCA ACCGTTATAA AGAAGTCGTG TATCAGGCGG AGGAATATGA AGATCCGAAG
GTGATATTGA AGCGGTTAAA GGATCTGGAA AAAGAGATTC TGGCGGATTT GGATGAGCTG
GAGGGGATGC TGTGA
 
Protein sequence
MITGDLKSKI DGLWEDFWVG GITNPLTVIE QITYLMYSRM LDTQEQRDEK RKQIAGIDFK 
PRFAPEQQEF RFSHYSNLGS DEMMEVVRDG VFQHFRQLGQ ADASKVTLLG NFMKDARLEI
VKPSLLTKAV EVIKNLPLDR GDTKGDLYEY LLSKLTTAGI NGQFRTPRHI IRTMVEMMEP
NPARGETICD PACGTGGFLA TSYEYLLEKY SSLESIHTEI GTNERGELEE QKIFTGDLLT
PWRNHVDNNM FHGYDFDTTM LRIAAMNLIM HGVDAPDIHY QDTMSQSFST NFPQASKNAF
NLILANPPFT GSLDEEDIDS TLSAMVKTKK TELLFLARIL QMLKVGGRSA TIVPQGVLFG
SSKAHQSLRK TLVEDNQLEA VINLPSGVFK PYAGVATAIL IFTKGGQTDE VWFYDLQNDG
YSLDDKRNPI KDNDLPHLLA SWKHYRTLRG LPVDNFMGKK LASLLKQQYP EGINAGVDFK
DRTQAAFVVP KADIAAQKYD LSINRYKEVV YQAEEYEDPK VILKRLKDLE KEILADLDEL
EGML