Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B4822 |
Symbol | |
ID | 6797320 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 4702862 |
End bp | 4705924 |
Gene Length | 3063 bp |
Protein Length | 1020 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 642778887 |
Product | type I site-specific deoxyribonuclease, HsdR family |
Protein accession | YP_002149448 |
Protein GI | 197250570 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.5346 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTAACG AACAAACCGT CACAGAAAAT GGAATTATCG ATCGCTTAAA AGGGTTGAGT GGGGTCAAGT GGTCTTACTG CCACGGTGAA AAACTGCCTA AAAAAGCGCA GGATATCTTC GTTGATGAGT GGCTAAAAGA CGCGCTTTGT TCGTTAAACC CTGATATCGG CAGACAGCCT GACTATGCCG ATGAGGTCAT TTACAAACTT CGGGGAGTGG TTTTAGAAGC CAGACATACC GGTTTAGTAA AGGCCAATGA AAACTTCCAG GAATGGTTGA TGGCAGACAA AACCTTGCCC TTTGGCGAAA ATGGCGACCA TATCACCATC AATCTGATTG ATTTTGAGAA CATAGAGAAC AACCATTTCG TGGTGGCACA GCAGGTTCAC TATATCGCGG CCACTGAAGT CTATTTTGAT ATTGTCCTTT ATGTGAACGG TATACCACTA GTTGTAGGCG AAGTTAAAAC CGCAACACGT CCTAGCGTAA CCTGGCAAGA CGGTGCCGCC GACTTTATGG GCGGCAAAAA GTACTACTGG AAAAATGTTG AATCTTTCTT TGTACCTAAC TTGCTGTGTT TTGCCAGCGA AGGTAAAACC TTTGCCTATG GTGCCATTAA TGCCCGAGTT AAAGATTGGG GACCATGGCA TAATACTGAG CTTCGTGATG AGATTCTGCC GGGCCTGGCA TCGGTACTCA ACAGCTGCGA GGGTTTATTA AATCCGCAGA CCTTATTGCA ACTACTGGAA TCTTTTGCAT TATTTTCAAC CGTCAAAACA GGCAAAAATA CTCCACCTAA ACGCGTTAAA ATCCTGCCGC GTTATCCGCA ATTTGAAGCA GCAAAACAAA TCGTTGAGCG TGTTCGCAGA GGTTATCCAA AAAAAGGTCT GATTTGGCAT TTCCAGGGAT CGGGTAAATC CTTGTTGATG CTTTACGCCG CTAAAATGCT GCGTGCCGAT AATGCGTTAA AAAACCCGAC GGTACTTATC GTTGTGGACC GGCGAGATCT GGACAGCCAA ATTAATGAAA CTTTTGGTGG GGCCGACGTT AAGAATCTGA TTAAAGTGCA AAGCTGCAAA AAGTTGGGTG AATACATTGA GCAAGACAGC CGTGGCATTT TAATCACCAC CATCTTTAAA TTTAAAGATA TCGAAATTGA CGATAGCAAC CCTAATGGCC TGAACAACCG TGACAACATC ATTGTGTTGG TTGATGAAGC CCACCGTACC CAGGAAGGTG GGTTAGGTGA GAAAATGCGT TGGGCGTTGC CTAATGCGCA CTTCTATGGC CTGACTGGCA CACCGATTTC TGGCATTGAT CGTAATACAT TCAAGTTGTT CGGTGCTGAA GAAGATCCTG GTCGCTATAT GAGCCGCTAT AGCTATAAGC AGTCGATCCG TGACGGAGCG ACTAACCCAG TGAAGTTCGA ACCTCGGTTG GCTGAACTCC GAGTGGATCG TGATGCTATC AACGAAGAGT TCGAGCAACT GGCCACCGAA AACAACCTGG ATGAAGAAGA AAAAGCAGCA TTGTCCAGAC GCGCTGGCAA GCTGGCTATT ATGCTGAAAT CCCCCAGACG CATGGCTGCG GTGAGTAATG ATATTGTTGA GCATTTTACC AGCCATGTTA TGCCGAAAAA GATGAAAGGC ATGGTTGTGG TATACGATCG TGAAGCCTGT GTGCAGATGT ATTATTTGCT CGGTGAAAAG CTCGGTTTTG ATGCGGTTGA AGTGGTTATG AACGTTGACC AGGCTCCGGT TAAAGTTGAA GAGGGTGGCA AAAAAGATAA GCTTAATAAG GACTGGCTCA AATGGCATGA CGAATTGGAG CTACCTATTA AACAAGCCGA TTTCGAACGC TGGCAGCACA TTGATGCGGA AGAGCAAGTG CAAAAGGATC TGATTGAATG CTTTAAAGAT CCTGTGCATC CGTTACAGCT TATCATCGTT ACCGCCAAGC TGCTTACGGG CTTTGATGCG CCAATTTGCT ATTGCATGTA CCTCGATAAG CCTCTACGCG ATCATACTCT TCTTCAGGCC ATGTGCCGAA CCAACCGGTT GTACGAAACC GATGATGTGC GCAAGGACAT GGGATTAATT GTCGACTACC TCGGCGTTTT CGAAAATCTG CGTACTGCTC TGGCCTATAA CCCTGAAGAA ATTGAAGGGG TTGTAGAGGG GATCGAGGCA TTTAAAGAGC TATTGCCGTT ACAACTGAAT AAATGTCTGG CCTTCTTCCC TGGTATAGAT CGTTCTATAG AAGGTTTTGA AGGCATTATG GCTGCGCAGG AATGCCTGCC AACCAACGAG AAACGCGATG AATTCGCTGC CAGCTTTGGT GTGTTATCCA AGTTATGGTC AGCTATCAAC CCAGATCCTT TTTTGACTCC TTATCGTCAG GACTATAAAT GGCTAGCGCA AATTTATGAA TCGGTGCGTC CGGTGGGGCA GACAGGTGCG CTTGTTTGGG CAGCCCTTGG CCCCGAAACA ATTAAGATGA TCCATGAGCA TACTGATATC AATCGTATTC GCGATGATAT TGACGAGTTG ATCATGGATG AGCACGCAAT TTTTACCCTG ACGGTTAAAG AACAGGAACA ACGCGCAAAA CGCCTGGAGA TTGATCTTAT GGGGCGTTTA CGCGGTAGCC ATGATCCTAA ATTTGTGGCA TTGGGTGAGC GTCTGGAAAA ACTGCGTCAG GACTATGAAG CTGGTGTTAT CAAGGCCATC GACTGGTTGA AAGGCCTGTT GGATGCTGCG AAAGATACCG TACAAGCCGA GCGCGAAACG GGCGAGCGCC CAGTAACTGA AGCTGATAAT AAACAGGCTT TGACCAAACT TTTCCTCGAA ACACGCCCAG AAACTACCCC TAAGTTAATC GGCGACGTTG TTGAGCAAAT CGATAAAATC GTCAAAGCAA CTCGCTTTGA TGGCTGGCAA AGTTCCCACA GCGGTCCACG CGAAATTCAA AAGGCGCTTT TGCTCACTCT GGCTCAATTT GGATTAGGTA AAGATAAAGA GTTATTCCAG AAGGCTTACG GGTATATCGA GGAGCATTAC TGA
|
Protein sequence | MFNEQTVTEN GIIDRLKGLS GVKWSYCHGE KLPKKAQDIF VDEWLKDALC SLNPDIGRQP DYADEVIYKL RGVVLEARHT GLVKANENFQ EWLMADKTLP FGENGDHITI NLIDFENIEN NHFVVAQQVH YIAATEVYFD IVLYVNGIPL VVGEVKTATR PSVTWQDGAA DFMGGKKYYW KNVESFFVPN LLCFASEGKT FAYGAINARV KDWGPWHNTE LRDEILPGLA SVLNSCEGLL NPQTLLQLLE SFALFSTVKT GKNTPPKRVK ILPRYPQFEA AKQIVERVRR GYPKKGLIWH FQGSGKSLLM LYAAKMLRAD NALKNPTVLI VVDRRDLDSQ INETFGGADV KNLIKVQSCK KLGEYIEQDS RGILITTIFK FKDIEIDDSN PNGLNNRDNI IVLVDEAHRT QEGGLGEKMR WALPNAHFYG LTGTPISGID RNTFKLFGAE EDPGRYMSRY SYKQSIRDGA TNPVKFEPRL AELRVDRDAI NEEFEQLATE NNLDEEEKAA LSRRAGKLAI MLKSPRRMAA VSNDIVEHFT SHVMPKKMKG MVVVYDREAC VQMYYLLGEK LGFDAVEVVM NVDQAPVKVE EGGKKDKLNK DWLKWHDELE LPIKQADFER WQHIDAEEQV QKDLIECFKD PVHPLQLIIV TAKLLTGFDA PICYCMYLDK PLRDHTLLQA MCRTNRLYET DDVRKDMGLI VDYLGVFENL RTALAYNPEE IEGVVEGIEA FKELLPLQLN KCLAFFPGID RSIEGFEGIM AAQECLPTNE KRDEFAASFG VLSKLWSAIN PDPFLTPYRQ DYKWLAQIYE SVRPVGQTGA LVWAALGPET IKMIHEHTDI NRIRDDIDEL IMDEHAIFTL TVKEQEQRAK RLEIDLMGRL RGSHDPKFVA LGERLEKLRQ DYEAGVIKAI DWLKGLLDAA KDTVQAERET GERPVTEADN KQALTKLFLE TRPETTPKLI GDVVEQIDKI VKATRFDGWQ SSHSGPREIQ KALLLTLAQF GLGKDKELFQ KAYGYIEEHY
|
| |