Gene MCA1888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1888 
SymbolhsdM 
ID3103261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2029427 
End bp2031007 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content64% 
IMG OID637171045 
Producttype I restriction-modification system, M subunit 
Protein accessionYP_114323 
Protein GI53803793 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCGAAA AGCTGTCTCA ACAGGAAGTC AACGCCACCG CCTGGGCGGC GTGCGACACC 
TTCCGGGGCG TGGTCGATCC CGCGCAGTAC AAGGACTACA TCCTGGTGAT GCTGTTCCTG
AAGTACATCA GCGACCTTTG GAACGACCAC TACGCCGAAT ACAAGGCGCA GTACGGGGAT
GACGACGAGC GCATCCGCCG CAAGCTCGAG CGCGAGCGCT TCATCCTGCC CTATGTCGAG
CTGAAGGAAG ACGATCAAGA GACCGGCAAG AGCCAGGTCA TCGACCGCTT CCTGGGCGAC
TTCAATGCGC TGTACGAGCG CCGCAACGAG CCCAACATCG GCGAGCTGGT CAACATCGTG
CTCGACCACA TCGAGGACGC CAACAAGGCC AAGCTCGAAG GGGTGTTTCG CAACATCGAC
TTCAACAGCG AGGCCAACCT CGGCAAGGCC AAAGACCGCA ACCGCCGCCT CAAAACCCTG
CTGGAGGACT TCGCCAAGCT CGACCTGCGC CCTTCGCGCG TGTCCGAAGA CGTCATCGGC
AATACCTACA TCTACCTCAT CGAACGCTTC GGCTCGGATG CCGGCAAGAA GGCCGGCGAG
TTCTACACCC CCAAGATGGT CTCGCGGCTG TTGGCGGCGC TGGCCAACCC CAGGCCGGGC
GACCGCATCT GCGACCCTTC CTGCGGCTCG GGCAGCCTGC TGATCGAAGC CGCGCAGTGG
GTCGAGGCGC AGGGCAGCCA CAACTACGCC CTGTTCGGCG AAGAAGTGAA CGGCGCCACC
TGGGCGCTGG CGCGGATGAA CATGTTCATC CACAGCAAGG ACGCAGCGCG CATCGAGTGG
TGCGACACCC TGAACAGCCC AGCGCTGATC GAGGGCGACC GGCTAATGAA GTTCAATGTG
GTGGTCGCCA ACCCGCCGTT TTCGCTCGAC AAGTGGGGCG CAGAGCACGC CGACCACGAC
CGTTTCAACC GCTTCTGGCG CGGCGTGCCG CCCAAGTCCA AGGGCGACTG GGCCTTCATC
ACCAACATGA TCGAGCGCGC CCTGCCAAGG GAAGGCCGGG TGGCCGTGGT CGTGCCGCAC
GGCGTGCTCT TTCGCGGCGG CGCCGAAGGC CGCATCCGCC GCGCCATGAT CGAGGAAAAC
CTGCTCGATG CCGTCGTGGG CTTGCCGGGC AACCTGTTCC CCACCACCTC GATCCCGGTG
GCCATCCTGC TCTTTGACCG CGCCCGCGAA AAAGGCGGCC CGCGTGAGGA TGTGCGCGAC
GTGCTGTTCG TGGACGCGAG CCGCGAGTTC ATTCCCGGCA AGAACCAGAA CCAGCTCTCC
GAAGCGCACT TTCAGAAGAT CGTCTCGACG GTGGCCGCGC GGCGCAACGT CGACAAATAC
GCCTACGTGG CCTCACTCGA CGAGATAGCC GAAAACGACT TCAACCTCAA CATCCCGCGT
TACGTCGACA CCTTCGAGGA GGAGGAAGAA ATCGACGTCG CCGCCGTGCA GCGCGAAATC
GAACAGCTCG AACGGGAGCT TGCCGACGTC CGCGCCCGCA TGCGCGAGCA CCTCAAGGCG
CTGGGGGTGG AGGGCGTATG A
 
Protein sequence
MTEKLSQQEV NATAWAACDT FRGVVDPAQY KDYILVMLFL KYISDLWNDH YAEYKAQYGD 
DDERIRRKLE RERFILPYVE LKEDDQETGK SQVIDRFLGD FNALYERRNE PNIGELVNIV
LDHIEDANKA KLEGVFRNID FNSEANLGKA KDRNRRLKTL LEDFAKLDLR PSRVSEDVIG
NTYIYLIERF GSDAGKKAGE FYTPKMVSRL LAALANPRPG DRICDPSCGS GSLLIEAAQW
VEAQGSHNYA LFGEEVNGAT WALARMNMFI HSKDAARIEW CDTLNSPALI EGDRLMKFNV
VVANPPFSLD KWGAEHADHD RFNRFWRGVP PKSKGDWAFI TNMIERALPR EGRVAVVVPH
GVLFRGGAEG RIRRAMIEEN LLDAVVGLPG NLFPTTSIPV AILLFDRARE KGGPREDVRD
VLFVDASREF IPGKNQNQLS EAHFQKIVST VAARRNVDKY AYVASLDEIA ENDFNLNIPR
YVDTFEEEEE IDVAAVQREI EQLERELADV RARMREHLKA LGVEGV