Gene SeAg_B4820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B4820 
Symbol 
ID6797221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp4700220 
End bp4701710 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content46% 
IMG OID642778885 
ProductN-6 DNA methylase 
Protein accessionYP_002149446 
Protein GI197249396 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAATA AAAAGCTGGA AGAGCTGCTC TGGGGGGCCG CCGAATTTCT TCGTGGCCAA 
ATTGACGCAT CAGACTACAA GCAGTATATC TTCCCGTTGC TGTTTTACAA ACGCCTGTCA
GATGTCTATC TGGAAGAATA TAATGAGGCG ATGGAGCTCC ATGAAGGCGA TGCCGAATAT
GCCGCCATGC CGATGTTTCA CCGTTTTAAC ATTCCCTCTG AGGCAGCTTG GGAAAAGGTC
CGCAATACCA GTAAAAACAT TGGCGAAGCG ATCCAGAATG CGCTTCGACT AATTGAAGTC
AATAACCCGC GTTTACATGG CGTCTTCGGT GATGCGCAGT GGACCAATAA AGAGCGCCTG
CCCGATCATC TGCTGGCTGA TCTGATTGAA CATTTCAGTA AAATTCCGCT CGGTATTAAA
TCTGTCGCCC AGGATGATCT TGGTGAAGCC TACGAATACC TGATTAAAAA GTTCGCTGAT
GATTCCGGTC ACACGGCTGC AGAGTTCTAC ACCAACCGAA CAGTCGTGCA TTTAATGACG
CGCATTATGG GATTAAAACC GGGTGAAACC GCCTATGATC CGACGTGCGG CACTGGCGGG
ATGTTGCTAA ATGCAGTGAT GGATCTCCGT GCAAGAGGTG AAGAGTGGCG CTCAGTGCAT
CTTTATGGTC AGGAGGTGAA CCTGTTGACC TCCGCTATCG CCCGTATGAA TATGTTCCTG
CACGATATCG AAGAATTTGA TGTGCTGCGC GGTGATACTT TGGCTGAGCC AAAGTTTATT
GAAAACGATC GGCTCAAGCA GTTTGATGTG ATTTTTGCCA ACCCGCCATA CTCCATAAAA
AAATGGAATC GTGACAAGTT TGCTGCCGAT CCATATGGTC GTAATCTTTA TGGTGTACCA
CCGCAGGGCT GCGCTGATTA TGCTTTTTAT ACCCATATAA TCAAAAGTTT AAAACCAGAT
ACTGGTCGCG CCGCCATGCT CTGGCCACAT GGCGTGCTGT TCCGTGATTC AGAGCAAACC
ATTCGTAAAC AGGTGGTTGA ATCGGACATC ATTGAAGCGG TGATTGGGTT AGGCCCGAAT
TTGTTCTACA ACTCTCCGAT GGAGTCTTGC GTGGTAGTGC TTAACTGCAA TAAACCTGCT
GAGCGTAAAA ACAAGGTGTT ATTTATTAAT GGGGTGGAAC ACGTTACTCG TGAGCGCGCC
CATAGTCGCT TATCCAAAGA TGATTTGGCT GTGTTATGCG AGGCTTATTT TAGCCCTGAA
AACCAAAATA ATATCACTGC ACTGGTGGAT ATCGACGCTA TTAAAGGGAA TCTCTACAAC
CTGTCGATCC CGCTGTATGT GCAAGCGCAA CAAAACGGTA AAGTACATAA TATTGAACAT
GCGATTGAAG CGTGGAAAGT AAGCCGTATA CAGTTGAAAA AACAAACTAA TAAATTATTC
CAAAGCCTTG CGGAGCTTGG GTATAATGTT CAAAGCAAGG TGGGGCAGTA A
 
Protein sequence
MSNKKLEELL WGAAEFLRGQ IDASDYKQYI FPLLFYKRLS DVYLEEYNEA MELHEGDAEY 
AAMPMFHRFN IPSEAAWEKV RNTSKNIGEA IQNALRLIEV NNPRLHGVFG DAQWTNKERL
PDHLLADLIE HFSKIPLGIK SVAQDDLGEA YEYLIKKFAD DSGHTAAEFY TNRTVVHLMT
RIMGLKPGET AYDPTCGTGG MLLNAVMDLR ARGEEWRSVH LYGQEVNLLT SAIARMNMFL
HDIEEFDVLR GDTLAEPKFI ENDRLKQFDV IFANPPYSIK KWNRDKFAAD PYGRNLYGVP
PQGCADYAFY THIIKSLKPD TGRAAMLWPH GVLFRDSEQT IRKQVVESDI IEAVIGLGPN
LFYNSPMESC VVVLNCNKPA ERKNKVLFIN GVEHVTRERA HSRLSKDDLA VLCEAYFSPE
NQNNITALVD IDAIKGNLYN LSIPLYVQAQ QNGKVHNIEH AIEAWKVSRI QLKKQTNKLF
QSLAELGYNV QSKVGQ