Gene SeAg_B4822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B4822 
Symbol 
ID6797320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp4702862 
End bp4705924 
Gene Length3063 bp 
Protein Length1020 aa 
Translation table11 
GC content46% 
IMG OID642778887 
Producttype I site-specific deoxyribonuclease, HsdR family 
Protein accessionYP_002149448 
Protein GI197250570 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.5346 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAACG AACAAACCGT CACAGAAAAT GGAATTATCG ATCGCTTAAA AGGGTTGAGT 
GGGGTCAAGT GGTCTTACTG CCACGGTGAA AAACTGCCTA AAAAAGCGCA GGATATCTTC
GTTGATGAGT GGCTAAAAGA CGCGCTTTGT TCGTTAAACC CTGATATCGG CAGACAGCCT
GACTATGCCG ATGAGGTCAT TTACAAACTT CGGGGAGTGG TTTTAGAAGC CAGACATACC
GGTTTAGTAA AGGCCAATGA AAACTTCCAG GAATGGTTGA TGGCAGACAA AACCTTGCCC
TTTGGCGAAA ATGGCGACCA TATCACCATC AATCTGATTG ATTTTGAGAA CATAGAGAAC
AACCATTTCG TGGTGGCACA GCAGGTTCAC TATATCGCGG CCACTGAAGT CTATTTTGAT
ATTGTCCTTT ATGTGAACGG TATACCACTA GTTGTAGGCG AAGTTAAAAC CGCAACACGT
CCTAGCGTAA CCTGGCAAGA CGGTGCCGCC GACTTTATGG GCGGCAAAAA GTACTACTGG
AAAAATGTTG AATCTTTCTT TGTACCTAAC TTGCTGTGTT TTGCCAGCGA AGGTAAAACC
TTTGCCTATG GTGCCATTAA TGCCCGAGTT AAAGATTGGG GACCATGGCA TAATACTGAG
CTTCGTGATG AGATTCTGCC GGGCCTGGCA TCGGTACTCA ACAGCTGCGA GGGTTTATTA
AATCCGCAGA CCTTATTGCA ACTACTGGAA TCTTTTGCAT TATTTTCAAC CGTCAAAACA
GGCAAAAATA CTCCACCTAA ACGCGTTAAA ATCCTGCCGC GTTATCCGCA ATTTGAAGCA
GCAAAACAAA TCGTTGAGCG TGTTCGCAGA GGTTATCCAA AAAAAGGTCT GATTTGGCAT
TTCCAGGGAT CGGGTAAATC CTTGTTGATG CTTTACGCCG CTAAAATGCT GCGTGCCGAT
AATGCGTTAA AAAACCCGAC GGTACTTATC GTTGTGGACC GGCGAGATCT GGACAGCCAA
ATTAATGAAA CTTTTGGTGG GGCCGACGTT AAGAATCTGA TTAAAGTGCA AAGCTGCAAA
AAGTTGGGTG AATACATTGA GCAAGACAGC CGTGGCATTT TAATCACCAC CATCTTTAAA
TTTAAAGATA TCGAAATTGA CGATAGCAAC CCTAATGGCC TGAACAACCG TGACAACATC
ATTGTGTTGG TTGATGAAGC CCACCGTACC CAGGAAGGTG GGTTAGGTGA GAAAATGCGT
TGGGCGTTGC CTAATGCGCA CTTCTATGGC CTGACTGGCA CACCGATTTC TGGCATTGAT
CGTAATACAT TCAAGTTGTT CGGTGCTGAA GAAGATCCTG GTCGCTATAT GAGCCGCTAT
AGCTATAAGC AGTCGATCCG TGACGGAGCG ACTAACCCAG TGAAGTTCGA ACCTCGGTTG
GCTGAACTCC GAGTGGATCG TGATGCTATC AACGAAGAGT TCGAGCAACT GGCCACCGAA
AACAACCTGG ATGAAGAAGA AAAAGCAGCA TTGTCCAGAC GCGCTGGCAA GCTGGCTATT
ATGCTGAAAT CCCCCAGACG CATGGCTGCG GTGAGTAATG ATATTGTTGA GCATTTTACC
AGCCATGTTA TGCCGAAAAA GATGAAAGGC ATGGTTGTGG TATACGATCG TGAAGCCTGT
GTGCAGATGT ATTATTTGCT CGGTGAAAAG CTCGGTTTTG ATGCGGTTGA AGTGGTTATG
AACGTTGACC AGGCTCCGGT TAAAGTTGAA GAGGGTGGCA AAAAAGATAA GCTTAATAAG
GACTGGCTCA AATGGCATGA CGAATTGGAG CTACCTATTA AACAAGCCGA TTTCGAACGC
TGGCAGCACA TTGATGCGGA AGAGCAAGTG CAAAAGGATC TGATTGAATG CTTTAAAGAT
CCTGTGCATC CGTTACAGCT TATCATCGTT ACCGCCAAGC TGCTTACGGG CTTTGATGCG
CCAATTTGCT ATTGCATGTA CCTCGATAAG CCTCTACGCG ATCATACTCT TCTTCAGGCC
ATGTGCCGAA CCAACCGGTT GTACGAAACC GATGATGTGC GCAAGGACAT GGGATTAATT
GTCGACTACC TCGGCGTTTT CGAAAATCTG CGTACTGCTC TGGCCTATAA CCCTGAAGAA
ATTGAAGGGG TTGTAGAGGG GATCGAGGCA TTTAAAGAGC TATTGCCGTT ACAACTGAAT
AAATGTCTGG CCTTCTTCCC TGGTATAGAT CGTTCTATAG AAGGTTTTGA AGGCATTATG
GCTGCGCAGG AATGCCTGCC AACCAACGAG AAACGCGATG AATTCGCTGC CAGCTTTGGT
GTGTTATCCA AGTTATGGTC AGCTATCAAC CCAGATCCTT TTTTGACTCC TTATCGTCAG
GACTATAAAT GGCTAGCGCA AATTTATGAA TCGGTGCGTC CGGTGGGGCA GACAGGTGCG
CTTGTTTGGG CAGCCCTTGG CCCCGAAACA ATTAAGATGA TCCATGAGCA TACTGATATC
AATCGTATTC GCGATGATAT TGACGAGTTG ATCATGGATG AGCACGCAAT TTTTACCCTG
ACGGTTAAAG AACAGGAACA ACGCGCAAAA CGCCTGGAGA TTGATCTTAT GGGGCGTTTA
CGCGGTAGCC ATGATCCTAA ATTTGTGGCA TTGGGTGAGC GTCTGGAAAA ACTGCGTCAG
GACTATGAAG CTGGTGTTAT CAAGGCCATC GACTGGTTGA AAGGCCTGTT GGATGCTGCG
AAAGATACCG TACAAGCCGA GCGCGAAACG GGCGAGCGCC CAGTAACTGA AGCTGATAAT
AAACAGGCTT TGACCAAACT TTTCCTCGAA ACACGCCCAG AAACTACCCC TAAGTTAATC
GGCGACGTTG TTGAGCAAAT CGATAAAATC GTCAAAGCAA CTCGCTTTGA TGGCTGGCAA
AGTTCCCACA GCGGTCCACG CGAAATTCAA AAGGCGCTTT TGCTCACTCT GGCTCAATTT
GGATTAGGTA AAGATAAAGA GTTATTCCAG AAGGCTTACG GGTATATCGA GGAGCATTAC
TGA
 
Protein sequence
MFNEQTVTEN GIIDRLKGLS GVKWSYCHGE KLPKKAQDIF VDEWLKDALC SLNPDIGRQP 
DYADEVIYKL RGVVLEARHT GLVKANENFQ EWLMADKTLP FGENGDHITI NLIDFENIEN
NHFVVAQQVH YIAATEVYFD IVLYVNGIPL VVGEVKTATR PSVTWQDGAA DFMGGKKYYW
KNVESFFVPN LLCFASEGKT FAYGAINARV KDWGPWHNTE LRDEILPGLA SVLNSCEGLL
NPQTLLQLLE SFALFSTVKT GKNTPPKRVK ILPRYPQFEA AKQIVERVRR GYPKKGLIWH
FQGSGKSLLM LYAAKMLRAD NALKNPTVLI VVDRRDLDSQ INETFGGADV KNLIKVQSCK
KLGEYIEQDS RGILITTIFK FKDIEIDDSN PNGLNNRDNI IVLVDEAHRT QEGGLGEKMR
WALPNAHFYG LTGTPISGID RNTFKLFGAE EDPGRYMSRY SYKQSIRDGA TNPVKFEPRL
AELRVDRDAI NEEFEQLATE NNLDEEEKAA LSRRAGKLAI MLKSPRRMAA VSNDIVEHFT
SHVMPKKMKG MVVVYDREAC VQMYYLLGEK LGFDAVEVVM NVDQAPVKVE EGGKKDKLNK
DWLKWHDELE LPIKQADFER WQHIDAEEQV QKDLIECFKD PVHPLQLIIV TAKLLTGFDA
PICYCMYLDK PLRDHTLLQA MCRTNRLYET DDVRKDMGLI VDYLGVFENL RTALAYNPEE
IEGVVEGIEA FKELLPLQLN KCLAFFPGID RSIEGFEGIM AAQECLPTNE KRDEFAASFG
VLSKLWSAIN PDPFLTPYRQ DYKWLAQIYE SVRPVGQTGA LVWAALGPET IKMIHEHTDI
NRIRDDIDEL IMDEHAIFTL TVKEQEQRAK RLEIDLMGRL RGSHDPKFVA LGERLEKLRQ
DYEAGVIKAI DWLKGLLDAA KDTVQAERET GERPVTEADN KQALTKLFLE TRPETTPKLI
GDVVEQIDKI VKATRFDGWQ SSHSGPREIQ KALLLTLAQF GLGKDKELFQ KAYGYIEEHY