Gene CPF_1018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1018 
Symboldcm 
ID4202576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1170044 
End bp1171384 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content30% 
IMG OID638081899 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_695464 
Protein GI110799652 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000487753 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACACAT ATATAGATTT ATTTGCTGGA CCAGGAGGAT TATGTACTGG ATTTAAAAAT 
GCTGGATTTA AGCCTTTAAT TGCAGTTGAG ATGAGTGATA ATACTGTAAA AACTTATGCA
AGAAACCATG AAGCGGAAGT TTATTCTTTA CAAGAACTTT TAGAAAACAA GGGGAGACTA
GAAGAAATAT TAAATATTAA TACTGATAAT ACTTGCTTAA TACATGGAGA TATACGTTTA
GTAGATAATG ATATTATAGT TGAAATACTT CAAAAGAAAT TTAAAACTAA TAGTGTAGAT
GTTGTTGCGG GAGGACCTCC TTGTGAATCT TTCTCACTTG CTGGGAAGAG AATTGATGGT
GATGAAAGAG ATGATCTATT TAAGAATATG CTAAGAATAG CTAGTATTTC AAATAGTAAA
TTTATATTTT TTGAAAATGT ACCAGGATTA TTAACTAAGA AAAGTAATAA TATAAAAGTA
TTTGATGTTA TTGTAGAAGA ATTTGATAAT TATGGATATA ATCTTGCTAG TACTGATAAG
AATATAATTA AATGTTTAGC TGCTGATTAT GGTGTTCCTC AAAATAGAGA GAGAGTTTTC
CTTATAGGAA TAAATAAAAT GTATGGAGAG AATCCATATA TTTATCCAGA AAAAACTCAC
GGAGAAGGAA GAAAATTTGA ATATATTAGT GTGAGTGATG CTCTAAGATA TTTACCTGAG
TTGAATAGTG GAGAAGGTGC TGATATTCAA CAAATAACAT ATAACTTTGA GGAAGATTTT
AGAAAAGGAA AGATTTCAGA AGCGGTATAT AATTATTTAA AATTTATTGC AGGAAAGGAA
GGGTATATAC CACCACATAT AAAGGAATCT ATTGATGATG GCTTAATAGA GATACATAAA
GCTGTTAAGC ATAGAGAAAA AATGATTAAT AGAATGAGTT ATATTAAACA GGGCGAAGGT
ATGAAAAAGG CTGCTGAAAG ATTAATAAAT GAGGGAAAAG AAGATATTGT TAGAGCTTAT
TTTCCCAATA AATTATATGC TGCTAGAAAT AGAAGATTAA AAGCAAATGA ACCATCATTT
ACAGTAACAA GTCATTGTCT TGATGAAATG GTTCATCCTT ATAATAATAG AGGTTTGACT
CCAAGAGAGG CAGCAAGGTT GCAATCATTT CCAGATTGGT ATGTGTTTGA AGGTGAGTAT
GTAAAATTTC ATTCGGATCC ACAACAAGAT AAGTATGAGC AATGTGGTGA TGCGATTCCA
GTATTATTAG TAAAAGCTTT AGCAGAACAA TTAAAAATTG CATTAAATAT TGTTAGTGAA
CGAACTAGTA TTAATAAGTA G
 
Protein sequence
MYTYIDLFAG PGGLCTGFKN AGFKPLIAVE MSDNTVKTYA RNHEAEVYSL QELLENKGRL 
EEILNINTDN TCLIHGDIRL VDNDIIVEIL QKKFKTNSVD VVAGGPPCES FSLAGKRIDG
DERDDLFKNM LRIASISNSK FIFFENVPGL LTKKSNNIKV FDVIVEEFDN YGYNLASTDK
NIIKCLAADY GVPQNRERVF LIGINKMYGE NPYIYPEKTH GEGRKFEYIS VSDALRYLPE
LNSGEGADIQ QITYNFEEDF RKGKISEAVY NYLKFIAGKE GYIPPHIKES IDDGLIEIHK
AVKHREKMIN RMSYIKQGEG MKKAAERLIN EGKEDIVRAY FPNKLYAARN RRLKANEPSF
TVTSHCLDEM VHPYNNRGLT PREAARLQSF PDWYVFEGEY VKFHSDPQQD KYEQCGDAIP
VLLVKALAEQ LKIALNIVSE RTSINK