Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4880 |
Symbol | |
ID | 6482757 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 4749600 |
End bp | 4751315 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 642740091 |
Product | putative type I restriction-modification system S subunit |
Protein accession | YP_002043768 |
Protein GI | 194442247 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 66 |
Fosmid unclonability p-value | 0.960431 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGTGG AAAAACTGAT TGTTGACCAT ATCGACACCT GGACCACGGC GCTGCAAACC CGTTCCACGG CGGGGCGCGG TAGCTCGGGT AAGATTGACC TGTACGGCAT TAAGAAGCTG CGCGAGCTGA TTCTGGAGCT GGCGGTGCGC GGCAAGCTGG TGCCGCAGGA TCCGAACGAT GAACCGGCGT CGGTGCTGCT GAAACGCATT GCGGCGGAAA AAGCCGAGCT GGTGAAGCAG GGGAAAATTA AAAAGCAAAA GCCGCTGCCG GAGATTAGCG AGGAGGAGAA ACCGTTTGAG CTGCCGATGG GGTGGGAATG GACGAGGTTA GGATCTATTT CAAACTATGG TTTTTGTGAT AAAGCAGAAC CTGAAGACGT AACACCTGAA ACATGGATCC TTGAATTAGA GGATATAGAG AAAGTCACAT CAAAGCTTAT CAATAAGGTA ACTTTTGCAG AAAGACCGTT TAAAAGCTCT AAGAATCGAT TCTCACAAGG TGATGTACTT TATGGAAAAT TACGTCCGTA CCTGGATAAA GTGATCGTTG CTAATGAACC GGGTGTATGT ACTACTGAAA TTATCCCAAT AACAAGCTAT GGTAATATTT ACCCAGAGTT CTTACGTCTA TTGCTGAAAG CACCAAATTT CATTATTTAT GCAAATAGCT CTACACATGG AATGAACTTG CCAAGGCTTG GTACAGAAAA AGCTCAGCAG GCTGTCATCG AATTAGCTCC TATCCAGGAG CAACTGCGAA TTGTTTCACG TGTTGATAAA CTCATGTCCC TCTGCGATCA ACTGGAACAG CACTCCCTGA CCAGTCTGGA TGCCCATCAA CAGCTGGTAG AAACCCTGCT AACCACGCTG ACCGACAGCC AGAACGCCGA TGAACTGGCC GAAAACTGGG CGCGTATCAG CGAGCATTTC GACACGCTGT TTACCACCGA AGCCAGTATT GCCGCCTTAA AACAGACCAT TCTGCAACTG GCGGTGATGG GCAAACTAGT GCCGCAGGAT CCGAACGATG AACCGGCCTC TGAACTGCTC AAACGTATTG CGCAGGAAAA AGCGCAGTTG GTAAAAGACG GGAAAATGAA AAAACAAAAA CCGTTGCCAC CGATTAGCGA TGAGGAAAAA CCGTTTGAAT TGCCAATAGG TTGGGAATGG TGTCGTTTAG GTGAATGCAT CAACCTAATT TCTGGACAGC ACCTGAAACC AGATGAATAT GAAGAAGAGT GCCATGGTGA AATGCTTCCT TATATTACTG GACCGGCCGA ATTTGGACTA ATCAGCCCAA CTTATTCCAA ATATACAAAT GAAAAAAGGG CTATTGCTGC TAAGGGCGAC ATTCTAATTA CATGTAAAGG CGCAGGGCTT GGAAAGCTTA ACGTCGCTGA TACCAATATA GCCATTAGTC GTCAACTAAT GGCTATTAAT GTCATTAGGA TGAATTCAGA ATATCTTAAA ATTATACTTG ATAGCATGTA TGGTTATTTT CAATCTAAAG GGGTTGGTAT AGCTATACCT GGAATATCAC GAGAAGATGT GATGGAGCCA TTAATTATGC TTCCTCCATT CGAAGAACAA AAAAGGATAA TGGAAAACTT ATATAAATTA AATTTTTTTA TCGAAGATAT AAAATTCAGG ATTAAATCCG CCCAACAAAC CCAGCTCCAC CTGGCCGACG CCCTTACCGA CGCCGCCATC AATTAA
|
Protein sequence | MAVEKLIVDH IDTWTTALQT RSTAGRGSSG KIDLYGIKKL RELILELAVR GKLVPQDPND EPASVLLKRI AAEKAELVKQ GKIKKQKPLP EISEEEKPFE LPMGWEWTRL GSISNYGFCD KAEPEDVTPE TWILELEDIE KVTSKLINKV TFAERPFKSS KNRFSQGDVL YGKLRPYLDK VIVANEPGVC TTEIIPITSY GNIYPEFLRL LLKAPNFIIY ANSSTHGMNL PRLGTEKAQQ AVIELAPIQE QLRIVSRVDK LMSLCDQLEQ HSLTSLDAHQ QLVETLLTTL TDSQNADELA ENWARISEHF DTLFTTEASI AALKQTILQL AVMGKLVPQD PNDEPASELL KRIAQEKAQL VKDGKMKKQK PLPPISDEEK PFELPIGWEW CRLGECINLI SGQHLKPDEY EEECHGEMLP YITGPAEFGL ISPTYSKYTN EKRAIAAKGD ILITCKGAGL GKLNVADTNI AISRQLMAIN VIRMNSEYLK IILDSMYGYF QSKGVGIAIP GISREDVMEP LIMLPPFEEQ KRIMENLYKL NFFIEDIKFR IKSAQQTQLH LADALTDAAI N
|
| |