Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A1249 |
Symbol | |
ID | 6871483 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 1243339 |
End bp | 1244769 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642784421 |
Product | DNA cytosine methylase |
Protein accession | YP_002215094 |
Protein GI | 198242597 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0270] Site-specific DNA methylase |
TIGRFAM ID | [TIGR00675] DNA-methyltransferase (dcm) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.608462 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 66 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGAAA ATATTTCAGT AACACACGCC CGGAACCTCA TCGCCGACGA CGCCGGAAGC GAGATCCAGG CGATGCTGAG TCAATTGCTG GAAATCTACG ATGTTAAAAC GCTGGTGGCG CACCTTAACG GCCTGGGCGA ACAGCACTGG AGCCCGGCCA TCTTAAAGCG CGTAATGATG AACGCGGCAT GGCATCGTTT GAGCGACAAT GAACTCACCT GTCTTAAAAC AGAGTTGCCG ACGCCGCCAG CGCATCATCC ACATTACGCC TTTCGTTTTA TCGATCTCTT CGCGGGCATC GGCGGTATTC GCCGCGGATT TGAAGCGATA GGCGGACAGT GCGTGTTTAC CAGCGAATGG AATAAGCACG CGGTACGGAC ATATAAAGCG AACTATTTTT GCGATCCGCT GCAACATCGC TTTAATGAAG ATATCCGCGA TATCACGTTG AGCCACCGGG AAGGGGTCAG CGATGATGAG GCGGCGGAAC ACATTCGCCA GCATATTCCG CAACATGATG TCCTGTTGGC GGGCTTTCCC TGTCAGCCAT TTTCTCTGGC GGGCGTTTCC AAGAAAAATG CGTTGGGCCG CGCCCACGGC TTTGCCTGCG AGACTCAGGG GACGTTATTT TTTGATGTCG TAAGAATTAT CGATGCTCGC CGCCCCGCGC TGTTTGTGCT GGAAAACGTG AAAAACCTTA AAAGTCACGA CCAGGGCAAC ACTTTCCGCA TTATTATGCA AACGCTCGAT GAACTGGGAT ATGACGTGGC GGATGCCGCT GACAATGGCC CGGACGATCC GAAAATTATC GACGGGCAGC ACTTTCTTCC TCAGCATCGG GAACGTATTG TGTTGGTGGG ATTCCGTCGC GATTTAAACC TGAAAACCGA TTTTACGTTA CGCAATATCT CCCGCTGTTA TCCACCGCGC CGTCCGACGC TGGCAGAACT GCTGGAGCCC GTCGTCGAAG CCAAATATAT CCTGACGCCG GTGCTGTGGA AATATTTATA TCGCTACGCG AAAAAGCACC AGGCGCGGGG AAACGGTTTT GGCTATGGCA TGGTTTATCC TGACAATCCG GAAAGTGTGG CGCGCACGTT ATCTGCTCGC TACTACAAAG ATGGTGCCGA AATTCTGATC GATCGTGGTT GGGATATGGC GAAAGGCGAA GTGAATTTCG ACGATGCTGG CAACCAACAA CATCGTCCCC GCCGACTCAC GCCGAGAGAG TGCGCGCGTT TAATGGGATT TGAGGCGCCG CAAACGTACC AGTTCAGGAT ACCTGTCTCG GATACGCAGG CCTATCGCCA GTTTGGCAAC TCCGTGGTGG TGCCGGTATT TGCCGCGGTA GCAAAGCTGC TGGAACCCAA AATTCACCAG GCGGTGACGC TGCGTCAGAG AGAGACGGTA GATGGCGGAC GTTCACGATA A
|
Protein sequence | MQENISVTHA RNLIADDAGS EIQAMLSQLL EIYDVKTLVA HLNGLGEQHW SPAILKRVMM NAAWHRLSDN ELTCLKTELP TPPAHHPHYA FRFIDLFAGI GGIRRGFEAI GGQCVFTSEW NKHAVRTYKA NYFCDPLQHR FNEDIRDITL SHREGVSDDE AAEHIRQHIP QHDVLLAGFP CQPFSLAGVS KKNALGRAHG FACETQGTLF FDVVRIIDAR RPALFVLENV KNLKSHDQGN TFRIIMQTLD ELGYDVADAA DNGPDDPKII DGQHFLPQHR ERIVLVGFRR DLNLKTDFTL RNISRCYPPR RPTLAELLEP VVEAKYILTP VLWKYLYRYA KKHQARGNGF GYGMVYPDNP ESVARTLSAR YYKDGAEILI DRGWDMAKGE VNFDDAGNQQ HRPRRLTPRE CARLMGFEAP QTYQFRIPVS DTQAYRQFGN SVVVPVFAAV AKLLEPKIHQ AVTLRQRETV DGGRSR
|
| |