Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B4821 |
Symbol | |
ID | 6793305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 4701711 |
End bp | 4702859 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 642778886 |
Product | type I restriction-modification system, endonuclease S subunit |
Protein accession | YP_002149447 |
Protein GI | 197249026 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGAAC AGCAATTACC GGAAGGCTGG CAAATGGTGA AGTTTGGCGA TATTGCGAAA CATATCTCTA AACGTGTAGA ACCGAGCGAG ACAGACCTTA AAATCTATGT TGGCTTGGAA CACTTAGATC CTGATAGTCT GAAGATTAAG CGACATGGCG TACCGGCGGA TGTTGAAGGG CAAAAGCTGT TGGTAAAGAA AGGGCAGATC ATTTTTGGCA AGCGTCGTGC GTATCAACGT AAGGTAGCTG TGGCTGATTG GGATTGTATC TGCTCAGCGC ACGCAATGGT TCTTGAAGAA AATTCTAAAA TGGTTATTCC CGGGTTTCTT CCTTTCTTTA TGCAATCCGA TATATTTATG AATCGAGCTG TAGCTATATC GGAAGGGTCA TTATCACCAA CGATAAAGTG GAAAGTTTTA GCCGAACAAG TTTTCCTATT CCCGTCAAAA AATCGACAGC TGAAAATGTT GCCAATATTA TCATCATGTA ATTTGGCAAG TTTAAAAAAC GATGCAGCCT TAGAATCATT ACTCTTTTTT AGGAAGGTTA TTTTTAGGGA ACATATATCT AAGCTAATAA TCCGTCATAA TGTTTCTAGA GAAAAACTAG GTGATGTTTG TCGAATCAGT ACAGGGAAGA CGCCTCCTCC TAATGAACGT GAATACTGGG AGGGGGATAT TCCGTTTATA ACTCCGGGTG ATATTAGCTC TGATTCATTA TATATTAATA GCGGCGAGCG TAATATTACT CATAAAGGGC TAGAAAAAAC TCCATCTGTG CCTAAAGGAT CGGTTTTACT GACTTGTATA GGTTCAACAA TAGGAAAAGC AGCAATTGCA TCGTGTGATT TATCTACTAA CCAGCAGATA AATTCATTAA TTTGTAGTGA AAAAATATTG CCAGAATACC TTATTGTTTG GATTCAGAAT AATTTGGAAG TTATTAAAAA ATACACAGGC ATTCAGGCCG TCCCTATTAT TAATAAATCC ACGTTAGCAA ATATTGATGT CGATGTTCCT TTTCTCGAAG AACAGTTAAA GCTTGTGATG GTAGTTAGAG AAATGGATAG CTTGAGACAT AAATTGAAAA AAAAAGGTGT AATTCTTACA AATTTAACAA AATCTCTCTT TGTTCAGGAT AGTGATTAA
|
Protein sequence | MTEQQLPEGW QMVKFGDIAK HISKRVEPSE TDLKIYVGLE HLDPDSLKIK RHGVPADVEG QKLLVKKGQI IFGKRRAYQR KVAVADWDCI CSAHAMVLEE NSKMVIPGFL PFFMQSDIFM NRAVAISEGS LSPTIKWKVL AEQVFLFPSK NRQLKMLPIL SSCNLASLKN DAALESLLFF RKVIFREHIS KLIIRHNVSR EKLGDVCRIS TGKTPPPNER EYWEGDIPFI TPGDISSDSL YINSGERNIT HKGLEKTPSV PKGSVLLTCI GSTIGKAAIA SCDLSTNQQI NSLICSEKIL PEYLIVWIQN NLEVIKKYTG IQAVPIINKS TLANIDVDVP FLEEQLKLVM VVREMDSLRH KLKKKGVILT NLTKSLFVQD SD
|
| |