Gene SNSL254_A4880 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4880 
Symbol 
ID6482757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4749600 
End bp4751315 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content45% 
IMG OID642740091 
Productputative type I restriction-modification system S subunit 
Protein accessionYP_002043768 
Protein GI194442247 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value0.960431 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTGG AAAAACTGAT TGTTGACCAT ATCGACACCT GGACCACGGC GCTGCAAACC 
CGTTCCACGG CGGGGCGCGG TAGCTCGGGT AAGATTGACC TGTACGGCAT TAAGAAGCTG
CGCGAGCTGA TTCTGGAGCT GGCGGTGCGC GGCAAGCTGG TGCCGCAGGA TCCGAACGAT
GAACCGGCGT CGGTGCTGCT GAAACGCATT GCGGCGGAAA AAGCCGAGCT GGTGAAGCAG
GGGAAAATTA AAAAGCAAAA GCCGCTGCCG GAGATTAGCG AGGAGGAGAA ACCGTTTGAG
CTGCCGATGG GGTGGGAATG GACGAGGTTA GGATCTATTT CAAACTATGG TTTTTGTGAT
AAAGCAGAAC CTGAAGACGT AACACCTGAA ACATGGATCC TTGAATTAGA GGATATAGAG
AAAGTCACAT CAAAGCTTAT CAATAAGGTA ACTTTTGCAG AAAGACCGTT TAAAAGCTCT
AAGAATCGAT TCTCACAAGG TGATGTACTT TATGGAAAAT TACGTCCGTA CCTGGATAAA
GTGATCGTTG CTAATGAACC GGGTGTATGT ACTACTGAAA TTATCCCAAT AACAAGCTAT
GGTAATATTT ACCCAGAGTT CTTACGTCTA TTGCTGAAAG CACCAAATTT CATTATTTAT
GCAAATAGCT CTACACATGG AATGAACTTG CCAAGGCTTG GTACAGAAAA AGCTCAGCAG
GCTGTCATCG AATTAGCTCC TATCCAGGAG CAACTGCGAA TTGTTTCACG TGTTGATAAA
CTCATGTCCC TCTGCGATCA ACTGGAACAG CACTCCCTGA CCAGTCTGGA TGCCCATCAA
CAGCTGGTAG AAACCCTGCT AACCACGCTG ACCGACAGCC AGAACGCCGA TGAACTGGCC
GAAAACTGGG CGCGTATCAG CGAGCATTTC GACACGCTGT TTACCACCGA AGCCAGTATT
GCCGCCTTAA AACAGACCAT TCTGCAACTG GCGGTGATGG GCAAACTAGT GCCGCAGGAT
CCGAACGATG AACCGGCCTC TGAACTGCTC AAACGTATTG CGCAGGAAAA AGCGCAGTTG
GTAAAAGACG GGAAAATGAA AAAACAAAAA CCGTTGCCAC CGATTAGCGA TGAGGAAAAA
CCGTTTGAAT TGCCAATAGG TTGGGAATGG TGTCGTTTAG GTGAATGCAT CAACCTAATT
TCTGGACAGC ACCTGAAACC AGATGAATAT GAAGAAGAGT GCCATGGTGA AATGCTTCCT
TATATTACTG GACCGGCCGA ATTTGGACTA ATCAGCCCAA CTTATTCCAA ATATACAAAT
GAAAAAAGGG CTATTGCTGC TAAGGGCGAC ATTCTAATTA CATGTAAAGG CGCAGGGCTT
GGAAAGCTTA ACGTCGCTGA TACCAATATA GCCATTAGTC GTCAACTAAT GGCTATTAAT
GTCATTAGGA TGAATTCAGA ATATCTTAAA ATTATACTTG ATAGCATGTA TGGTTATTTT
CAATCTAAAG GGGTTGGTAT AGCTATACCT GGAATATCAC GAGAAGATGT GATGGAGCCA
TTAATTATGC TTCCTCCATT CGAAGAACAA AAAAGGATAA TGGAAAACTT ATATAAATTA
AATTTTTTTA TCGAAGATAT AAAATTCAGG ATTAAATCCG CCCAACAAAC CCAGCTCCAC
CTGGCCGACG CCCTTACCGA CGCCGCCATC AATTAA
 
Protein sequence
MAVEKLIVDH IDTWTTALQT RSTAGRGSSG KIDLYGIKKL RELILELAVR GKLVPQDPND 
EPASVLLKRI AAEKAELVKQ GKIKKQKPLP EISEEEKPFE LPMGWEWTRL GSISNYGFCD
KAEPEDVTPE TWILELEDIE KVTSKLINKV TFAERPFKSS KNRFSQGDVL YGKLRPYLDK
VIVANEPGVC TTEIIPITSY GNIYPEFLRL LLKAPNFIIY ANSSTHGMNL PRLGTEKAQQ
AVIELAPIQE QLRIVSRVDK LMSLCDQLEQ HSLTSLDAHQ QLVETLLTTL TDSQNADELA
ENWARISEHF DTLFTTEASI AALKQTILQL AVMGKLVPQD PNDEPASELL KRIAQEKAQL
VKDGKMKKQK PLPPISDEEK PFELPIGWEW CRLGECINLI SGQHLKPDEY EEECHGEMLP
YITGPAEFGL ISPTYSKYTN EKRAIAAKGD ILITCKGAGL GKLNVADTNI AISRQLMAIN
VIRMNSEYLK IILDSMYGYF QSKGVGIAIP GISREDVMEP LIMLPPFEEQ KRIMENLYKL
NFFIEDIKFR IKSAQQTQLH LADALTDAAI N