Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0923 |
Symbol | dcm |
ID | 6272301 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 861729 |
End bp | 863147 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641725084 |
Product | DNA cytosine methylase |
Protein accession | YP_001879611 |
Protein GI | 187732725 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0270] Site-specific DNA methylase |
TIGRFAM ID | [TIGR00675] DNA-methyltransferase (dcm) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.579059 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGAAA ATATATCAGT AACCGATTCA TACAGCACCG GGAATGCCGC ACAGGCAATG CTGGAGAAAC TGCTGCAAAT TTATGATGTT AAAACGCTGG TGGCGCAGCT TAATGGTGTG GGTGAGAATC ACTGGAGCGC GGCAATTTTA AAACGTGCGC TGGCGAATGA CTCGGCATGG CACCGTTTAA GTGAGAAAGA GTTCGCCCAT CTGCAAACGT TGTTACCCAA ACCACCGGCA CATCATCCGC ATTATGCGTT TCGCTTTATC GATCTATTTG CCGGAATTGG CGGCATCCGT CGCGGTTTTG AATCGATTGG CGGACAATGC GTGTTTACCA GCGAATGGAA CAAACATGCG GTACGCACTT ATAAAGCCAA CCATTATTGC GATCCGGCGA CGCATCATTT TAATGAAGAT ATCCGCGATA TCACCCTCAG CCATAAAGAA GGCGTGAGTG ATGAGGCGGC GGCGGAACAT ATTCGTCAAC ACATTCCTGA ACACGATGTT TTACTGGCCG GTTTCCCTTG TCAGCCATTT TCGCTGGCTG GCGTATCGAA AAAGAACTCG CTCGGGCGGG CGCACGGTTT TGCCTGCGAT ACCCAGGGCA CGCTGTTTTT TGATGTGGTA CGCATTATCG ACGCGCGTCG TCCGGCGATG TTTGTGCTCG AAAACGTCAA AAACCTGAAA AGTCACGACC AGGGTAAAAC GTTCCGCATC ATCATGCAGA CGCTGGACGA ACTGGGCTAT GACGTGGCTG ATGCAGAAGA TAACGGGCCG GACGATCCGA AAATCATCGA TGGTAAACAT TTTCTGCCGC AGCACCGTGA ACGCATCGTG CTGGTGGGTT TTCGTCGCGA TCTTAATCTG AAAGCCGATT TTACTCTGCG TGATATCAGC GAATGTTTCC CTGCACAGCG AGTGACGCTG GCGCAGCTGT TGGACCCGAT GGTCGAGGCG AAATATATCC TGACGCCGGT GCTGTGGAAG TACCTCTATC GATATGCGAA AAAACATCAG GCGCGCGGTA ACGGCTTCGG TTATGGAATG GTTTATCCGA ACAATCCGCA AAGCGTCACG CGTACGCTGT CTGCGCGTTA TTACAAAGAT GGCGCGGAAA TTTTAATCGA TCGCGGCTGG GATATGGCCA AAGGTGAGAA AGACTTTGAC GATCCGCTGA ATCAGCAACA TCGTCCACGT CGGTTAACGC CTCGGGAATG CGCGCGCTTA ATGGGTTTTG AAGCGCCGGG AGAAGCGAAA TTCCGCATTC CGGTTTCGGA CACTCAGGCC TATCGCCAGT TCGGTAACTC GGTGGTCGTG CCGGTCTTTG CCGCGGTGGC AAAACTGCTT GAGCCAAAAA TCAAACAGGC GGTGGCGTTG CGTCAGCAAG AGGCACAACA TGGCCGACGT TCACGATAA
|
Protein sequence | MQENISVTDS YSTGNAAQAM LEKLLQIYDV KTLVAQLNGV GENHWSAAIL KRALANDSAW HRLSEKEFAH LQTLLPKPPA HHPHYAFRFI DLFAGIGGIR RGFESIGGQC VFTSEWNKHA VRTYKANHYC DPATHHFNED IRDITLSHKE GVSDEAAAEH IRQHIPEHDV LLAGFPCQPF SLAGVSKKNS LGRAHGFACD TQGTLFFDVV RIIDARRPAM FVLENVKNLK SHDQGKTFRI IMQTLDELGY DVADAEDNGP DDPKIIDGKH FLPQHRERIV LVGFRRDLNL KADFTLRDIS ECFPAQRVTL AQLLDPMVEA KYILTPVLWK YLYRYAKKHQ ARGNGFGYGM VYPNNPQSVT RTLSARYYKD GAEILIDRGW DMAKGEKDFD DPLNQQHRPR RLTPRECARL MGFEAPGEAK FRIPVSDTQA YRQFGNSVVV PVFAAVAKLL EPKIKQAVAL RQQEAQHGRR SR
|
| |