Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4931 |
Symbol | |
ID | 6875474 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 4767337 |
End bp | 4768956 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642787804 |
Product | type I restriction-modification system, M subunit |
Protein accession | YP_002218397 |
Protein GI | 198242593 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | [TIGR00497] type I restriction system adenine methylase (hsdM) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.748503 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 0.593383 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAAAG CACCAACGAA AAAAGCCAAA GCAAAGAAAG GTTTTGAAGA CACATTATGG GATACCGCGA ACCAGCTTCG CGGCAGCGTT GAATCCTCCG AGTACAAGCA CGTGGTGCTG AGCCTCGTGT TCCTCAAATT TATTAGCGAT AAGTTTGAAG CCCGTCGCAA ACAGATGGAA GACGAAGGCC AGGGCGATTT CCTCGAAATG GAAGTCTTCT ATCAGCAGGA CAACATCTTC TACCTGCCGG AGGAAGCGCG CTGGTCGTTC ATCAAGCAAA ACGCTAAGCA GGACGATATC GCCGTACGCA TTGATACCGC CCTCTCCACC ATTGAGAAGC GCAACCCGTC GCTGAAAGGC GCGCTGCCGG ACAACTACTT CAGCCGTCAG AATCTGGAAA CCAAGAAGCT GGCGTCGCTT ATCGACACCA TCGATAACAT CGAAACGCTG GCCCACGAAA CTGACGTGGA AGCGCTGTCG AAAGAAGACC TTGTCGGTCG CGTGTACGAA TACTTCCTCG GCAAGTTTGC CGCAACCGAA GGCAAAGGCG GGGGCGAGTT CTATACGCCG AAATGCGTGG TGACGCTGCT GACCGAAATG CTCGAACCGT TCCAGGGCAA AATTTATGAC CCCTGCTGCG GCTCGGCGGG GATGTTTGTC CAGTCGGTGA AGTTTGTTGA AAGCCATCAG GGGAAAAGCC GCGATATCGC CCTGTACGGC CAGGAGCTGA CGGCCACCAC CTACAAGCTG GCGAAGATGA ACCTTGCGAT TCGCGGACTT TCCGCCAACC TTGGTGAGCG CCCGGCGGAC ACCTTCTTCA GCGACCAGCA CCCGGACCTG AAAGCCGACT ATATTCTGGC CAACCCGCCG TTTAACCTGA AAGACTGGCG TAACGACGCC GAGCTGACCA AAGACCCGCG CTTTGCTGGC TACCGCACGC CGCCGACCGG TAACGCCAAC TACGGCTGGA TTTTGCATAT GCTTTCCAAG TTGTCGGCCA ACGGCACCGC CGGTTTTGTG CTGGCGAACG GCTCGATGAG CTCCAACACC AGCGGCGAAG CCGAGATCCG CGCGCAGATG ATCGAAAACG ATCTGATCGA CTGCATGATC GCCCTGCCCG GACAGCTGTT CTTCACCACC CAAATCCCGG TGTGCCTGTG GTTTATGACC AAATCGAAGG CAGCCGATCC GGCCAAAGGC TATCGTAACC GTCAGGGTGA GACGCTGTTT ATTGATGCGC GTAACCTTGG TACCATGATC AACCGCACCA CCAAAGAGCT GACGGCAGAC GATATTGCCA CCATCGCCGA TACCTACCAT GCCTGGCGCA GCACGCCGGA AGAGCTGGTC GAGCGCGTTA AGCGCGGCGA CAGCCAGTTG GCGCAATATG AAGACCAGGC CGGATTCTGC AAGGTTGCGA CCATTGCAGA GATCAAGGCG AACGACTCTG TGCTGACGCC GGGCCGTTAT GTCGGTGCCG CCGAGCAGGA AGACGACGGC GTAGCGTTTG AAACCAAAAT GCGCGAGTTG TCGCAGACGT TGTTTGCACA GATGAAGCAG GCGGAAGAAC TGGATAACGC GATTCGTCAG AATCTGGAGG TGCTGGGTTA TGGCATTTGA
|
Protein sequence | MAKAPTKKAK AKKGFEDTLW DTANQLRGSV ESSEYKHVVL SLVFLKFISD KFEARRKQME DEGQGDFLEM EVFYQQDNIF YLPEEARWSF IKQNAKQDDI AVRIDTALST IEKRNPSLKG ALPDNYFSRQ NLETKKLASL IDTIDNIETL AHETDVEALS KEDLVGRVYE YFLGKFAATE GKGGGEFYTP KCVVTLLTEM LEPFQGKIYD PCCGSAGMFV QSVKFVESHQ GKSRDIALYG QELTATTYKL AKMNLAIRGL SANLGERPAD TFFSDQHPDL KADYILANPP FNLKDWRNDA ELTKDPRFAG YRTPPTGNAN YGWILHMLSK LSANGTAGFV LANGSMSSNT SGEAEIRAQM IENDLIDCMI ALPGQLFFTT QIPVCLWFMT KSKAADPAKG YRNRQGETLF IDARNLGTMI NRTTKELTAD DIATIADTYH AWRSTPEELV ERVKRGDSQL AQYEDQAGFC KVATIAEIKA NDSVLTPGRY VGAAEQEDDG VAFETKMREL SQTLFAQMKQ AEELDNAIRQ NLEVLGYGI
|
| |