Gene CPF_1017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1017 
Symboldcm 
ID4202877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1168761 
End bp1169984 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content34% 
IMG OID638081898 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_695463 
Protein GI110800293 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000173246 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGATAG AAAAAACTGT ATGTGAACTG TTTGCAGGTG TTGGTGGATT CCACCTTGGA 
TTAAGTAAAG CATCAGCAGA CTGGGAGGTT CTTTGGGCGA ATCAATGGGA ACCTTCAAGA
AAGGTACAGC ATGCATTTGA GTGTTATGCA AAGCATTTTC CAAAGACAAA TGCTGTTAAT
GAAGATATAG CTTTAGTAAA TGAAAATCCA GAGGCTTTTG GACTTCCAAA GTATAATTTA
CTTGTAGGTG GTTTTCCATG CCAAGATTAT TCAGTAGCTG CTACGAAGTC AAAAGGAATA
GAAGGGAAAA AAGGAGTTTT ATGGTGGGAG ATAAGAAAGT TTCTAGAAAG AGACATGCCA
CCATTTGTAC TTTTAGAGAA TGTTGATAGA TTACTAAAAT CTCCTGCAAA GCAAAGGGGA
AGAGATTTTG GAGTAATGTT AACTTGTTTT AGAGATTTAG GATACAATGT TGAGTGGAGA
GTTATAAATG CGGCAGATTA TGGATTCTCT CAAAGAAGAA GAAGAGTATT TATATTTGCT
TATAAACATG ATACTAATTA TTGTGAAAGA GTAACTCAAG GATTAGAAAA TGAGGGATTC
ATGGAAGCTT ATATAAAAGA AAATGGATTC TTTGCAGAAC CATTTCCTAT AGAAACTATA
TATGGTGAAG ATGAAAAAGT ATTAAATCAA GATATACTTG AGGTATCAGA TCATTTTTCA
TTTGGTTTCC AAGAATCAGG AGCATTAATA AATGATAGAA TTTATACAAC TAGATATACA
CCACAAAGTC AAGAACCAGT TACACTTGGT GAAATACTTC AAAAGGATGT AAGTGAAGAG
TTTTATTTAG GTGAAGAATT AGATAGATGG ACTTATATGA AGGGTGCTAA GGCAGAGCCT
AGAACAACTA AAGAAGGCTT TGAGTATATC TATAGAGAGG GAGCTGTTGG ATTTCCAGAT
AGTTTAGAGT TACCTGCAAG AACAATGCTT ACAAGTGAAG CTAGTGTAAA TAGAAGTACT
CATGTAGTAA AGGATCCACA AACTAATAGA TTAAGAGTAT TAACACCAGT AGAGTGTGAA
AGATTAAATG GCTTTGATGA TGAATGGACT AATACAGGAA TGACTCAGAA GTTTAGATAT
TTTTGTATGG GAAATGCATT GGTTGTTGGA TTAATTGAAA GAATGGGTAG AAAATTAGAT
AGAATATTTG AATTAGAAGA GTAA
 
Protein sequence
MAIEKTVCEL FAGVGGFHLG LSKASADWEV LWANQWEPSR KVQHAFECYA KHFPKTNAVN 
EDIALVNENP EAFGLPKYNL LVGGFPCQDY SVAATKSKGI EGKKGVLWWE IRKFLERDMP
PFVLLENVDR LLKSPAKQRG RDFGVMLTCF RDLGYNVEWR VINAADYGFS QRRRRVFIFA
YKHDTNYCER VTQGLENEGF MEAYIKENGF FAEPFPIETI YGEDEKVLNQ DILEVSDHFS
FGFQESGALI NDRIYTTRYT PQSQEPVTLG EILQKDVSEE FYLGEELDRW TYMKGAKAEP
RTTKEGFEYI YREGAVGFPD SLELPARTML TSEASVNRST HVVKDPQTNR LRVLTPVECE
RLNGFDDEWT NTGMTQKFRY FCMGNALVVG LIERMGRKLD RIFELEE