Gene SNSL254_A4881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4881 
Symbol 
ID6482369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4751315 
End bp4752784 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content57% 
IMG OID642740092 
Producttype I restriction-modification system M subunit 
Protein accessionYP_002043769 
Protein GI194443867 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value0.900731 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAATCA GCTCGGCTAT CAAATCATTA CAAGACATCA TGCGCAAAGA TGCAGGCGTC 
GATGGCGACG CACAGCGTCT GGGTCAGCTC TCCTGGTTAC TGTTCCTGAA GATCTTCGAC
GCCCAGGAAC AGGCGCTGGA AATCGAGCAG GAAAAATACA GGCTGCCGAT GCCGGAGCGC
TACCTGTGGC GCAACTGGGC GGCCGATAAC GAAGGGATCA CCGGCGATAA ACTGCTGGCG
TTCGTCAACG ACGACCTGTT TCCGACGCTA AAAGATCTGC CCGCGCAGAT CGATATCAAC
CCGCGCGGCT ATGTGGTTAA GCAGGCATTC AGCGATGCCT ATAACTACAT GAAAAACGGT
ACCCTGCTGC GTCAGGTGAT CAACAAGCTG AACGAGATCG ATTTTACCCG CGCCAGCGAA
CGTCATCTGT TCGGCGATAT CTACGAACAG ATCCTCCGGG ATTTACAGGC CGCAGGCAAC
GCGGGTGAAT TCTACACCCC ACGCGCCGTG ACCCGTTTTA TGGTCGAGCG CGTCGATCCG
AAACTCGGCG AATCGATTAT GGATCCGGCC TGCGGCACCG GCGGCTTCCT CGCCTGCGCG
TTTGATCATG TTAAAAATCA TTATGCGCAC ACGGTCACCG ACCACCAGAT CCTGCAAAAA
CAGATCCACG GCGTGGAGAA GAAACAGCTA CCGCATCTGC TTTGCACCAC CAATATGCTG
CTGCACGGTA TCGAAGTGCC AGTACAGATC CGCCACGATA ACACGCTGAA CAAACCGCTC
TCCTCCTGGG ACGAGCAGAT GGATGTGATT ATCACCAACC CGCCGTTCGG CGGCACCGAA
GAAGACGGCA TCGAGAAGAA CTTCCCTTCC GATATGCAGA CCCGCGAAAC GGCGGATCTG
TTCCTGCAAC TGATCATTGA AGTGCTGGCG AAAAACGGCC GCGCGGCGGT GGTGCTGCCG
GACGGCACGC TGTTTGGCGA AGGCGTGAAA ACCAAGATCA AAAAGCTGCT CACCGAGGAG
TGCAACCTGC ACACCATCGT GCGTCTGCCG AACGGCGTGT TTAACCCGTA CACCGGGATC
AAAACTAACC TGCTGTTCTT TACCAAAGGT CAGCCGACCA AAGAGATCTG GTTCTACGAG
CACCCGCACC CGGCGGGCGT GAAGAACTAC AGCAAAACCA AGCCGATGAA GTTTGAAGAG
TTCCAGGCCG AGATCGACTG GTGGGGCAAC GAAGCCGACG GCTTCGCCAG CCGCGTCGAG
AACGAACAGG CGTGGAAGGT CAGCATCGAC GAGGTGATTG CCCGCAACTT TAACCTCGAT
ATCAAAAACC CGCACCAGGC GGAGACCGTC AGCCACGATC CGGACGAACT GCTGGCGCAG
TACGCGAAGC AGCAGGAAGA AATCCAGACC CTGCGCCACC AGCTGCGCGA TATTCTCGGC
GCGGCGCTCT CCGGCAAGGA GGCCAACTGA
 
Protein sequence
MSISSAIKSL QDIMRKDAGV DGDAQRLGQL SWLLFLKIFD AQEQALEIEQ EKYRLPMPER 
YLWRNWAADN EGITGDKLLA FVNDDLFPTL KDLPAQIDIN PRGYVVKQAF SDAYNYMKNG
TLLRQVINKL NEIDFTRASE RHLFGDIYEQ ILRDLQAAGN AGEFYTPRAV TRFMVERVDP
KLGESIMDPA CGTGGFLACA FDHVKNHYAH TVTDHQILQK QIHGVEKKQL PHLLCTTNML
LHGIEVPVQI RHDNTLNKPL SSWDEQMDVI ITNPPFGGTE EDGIEKNFPS DMQTRETADL
FLQLIIEVLA KNGRAAVVLP DGTLFGEGVK TKIKKLLTEE CNLHTIVRLP NGVFNPYTGI
KTNLLFFTKG QPTKEIWFYE HPHPAGVKNY SKTKPMKFEE FQAEIDWWGN EADGFASRVE
NEQAWKVSID EVIARNFNLD IKNPHQAETV SHDPDELLAQ YAKQQEEIQT LRHQLRDILG
AALSGKEAN