Gene CPF_2599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2599 
SymbolhsdM 
ID4201458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2866431 
End bp2867948 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content31% 
IMG OID638083466 
Producttype I restriction-modification system, M subunit 
Protein accessionYP_696989 
Protein GI110799934 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0274218 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAGC TTACACTACA AGAATTAGAA TCAACACTTT GGCAATGTGC TGATATTCTT 
AGAGGAGAAT TATCAGCTGC TGAGTACAAA GATTATATAT TTGGAATGTT ATTTTTAAAA
AGATTAAATG ATGAGTTTGA TGAGGAAAGA GAAGAGAGAA GAAAAGAGTT TTTAGAAGAT
GGATTAGATA AAGAGGAGGT AATAGAACTT TTAGAAGACC CTAGTATTTA TGAGACTTTC
TTTGTTCCAG AGCAAGCAAG ATGGGAGAAA CTTAGAAATT TAACTTTAAA TATAGGACCA
GAATTAGATA AAGCATTTAA AGCTTTAGAA GACGAACCTA AAAACTCAGA ATTAGTAGGA
GTTTTAAGCA CAACTAACTA CAACGATAAA GAAAAAGTAC CAGATAAAAA ATTAGCACAA
CTTTTAGTTT TATTTAGTAC AGTAAACTTA GCAAACTCAA ACTTGGCATC AGAAGATATG
CTAGGGGATG CTTATCAGTA TTTAATAAAA CAATTTGCCG ACCAAGGTGG TAAGAAAGGG
GGAGAATTCT ATACTCCTAC TGAAGTTGTT AAGGTTATTA CAAATATTTT AAAACCACAA
GAAGGGGATA GAATATATGA CCCAACTTGT GGTTCAGGTG GTATGCTTAT TCAAAGTATA
GAGTATGTTA AAAAGCATGG TGGAAATCCT AAGAATTTAA GTTTATTTGG TCAAGAAATA
AACCTTTCAA CTTGGGCAAT TTGTAAAATG AATATGCTAT TTCATGGGGC AAAGGGAGCA
GATATTCAAA AAGGTGATAC TATAAGAGAA CCAAAACATA CAGAAGGTGG AGCATTAAAG
GTATTTGATA AGGTATTAGC AAACCCACCA TTCTCACTTA AAAACTGGGG AGCAGAAGAA
GCTTCATATG ATGCTTTTCA TAGATTTACT TATGGAATTC CACCAAAGTC ATATGGAGAT
TTAGCTTTTG TTGAACACAT GTTAGGAAGT TTAAATATGA AAGGTAAAAT GGCATCAGTA
GTTCCTCATG GAGTTTTATT TAGAGGGTCA GCTGAAGGAA AGATAAGAAA AGGATTCATA
GAAGATGACT TAATAGAGGC GGTAATAGGA CTTCCACAGA ACTTATTCTA TGGAACTGGA
ATACCAGCAG CTATATTAGT ACTTAATAAG GCGAAATCAG AAGAAAGAAA AAATAAAATA
TTATTTATAG ACGGAAGTAA TGATTTTGTA AAACAAGGAA ACAAAAATAA ATTAAGAGAG
GAAGATATAG AAAAAATAAT AACTGCATTT GATAAGTTTG AAGATGTAGA AAAATATGCA
AATGTAATAG ATTTAGAAAC AATAAAAGAA AATGACTATA ACTTAAATAT AAGTAGATAT
GTAGATACAA CTGAAGAAGA AGAACCAGTA GATATACAAA AAGTTATAGA TGAAATAAAA
GAATTAGAAA AAGAAGAAGA GAAAACAAAA GAGAAACTTA ATGGATACCT AAAAGAATTA
GGATTTGATG TTCTGTAA
 
Protein sequence
MAKLTLQELE STLWQCADIL RGELSAAEYK DYIFGMLFLK RLNDEFDEER EERRKEFLED 
GLDKEEVIEL LEDPSIYETF FVPEQARWEK LRNLTLNIGP ELDKAFKALE DEPKNSELVG
VLSTTNYNDK EKVPDKKLAQ LLVLFSTVNL ANSNLASEDM LGDAYQYLIK QFADQGGKKG
GEFYTPTEVV KVITNILKPQ EGDRIYDPTC GSGGMLIQSI EYVKKHGGNP KNLSLFGQEI
NLSTWAICKM NMLFHGAKGA DIQKGDTIRE PKHTEGGALK VFDKVLANPP FSLKNWGAEE
ASYDAFHRFT YGIPPKSYGD LAFVEHMLGS LNMKGKMASV VPHGVLFRGS AEGKIRKGFI
EDDLIEAVIG LPQNLFYGTG IPAAILVLNK AKSEERKNKI LFIDGSNDFV KQGNKNKLRE
EDIEKIITAF DKFEDVEKYA NVIDLETIKE NDYNLNISRY VDTTEEEEPV DIQKVIDEIK
ELEKEEEKTK EKLNGYLKEL GFDVL