Gene CPF_0139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0139 
Symbol 
ID4202270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp165074 
End bp166159 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content30% 
IMG OID638081020 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_694603 
Protein GI110799020 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00921515 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA TAGCAATTTC ATTTTTTGCA GGCGCTGGTG GATTGGATAT AGGAATACAT 
GAAGCTGGGT TTGATGTAAA ATTGAGTGTA GAATTAGAAG AAAAATACTG TGTGACATTA
AAACAAAATA ATCCTACATT TAATGTAGTA AATGGAGATA TTATGGATTA TTCAAAAGAA
AAAATATATA GTGATGCAGG ATTAAATTAT AATGATGAGA TTGATTTAAT ATTTGGTGGT
AGCCCATGTC AGAGTTTTAG TACAGCTGGT AAACGACAAG CTTTTTCGGA TGAAAGAGGA
AAGGCTATGT TAAAATTCAT TGAATTAATT GAAGAGGTAA AACCAAAAGC ATTTTTATTA
GAAAATGTAA AGGGGTTATT ATCAGCAACA TTAAAACATC GTCCTTTAAA TCAAAGGGGA
AAAGATTTTC CGCCATTAGA TGAAGATGAG GAAAATGGAA GTGCATTAAG GTATTTATTA
AATCAAGTCA AAGATTATAA CGTTGTATAT AAAGTGCTTA ATTCAGCTGA ATATGGAGTT
GCTCAAAAAA GAGAGAGAGT AATTTTTGTT GGAATAAGAA AAGATTTAAA CAAAGTATAT
GAATTTCCAA ATCCTACTCA TGGAGTAGGA AGAAAATATC CATTTGTTAC AGTTAATGAT
GTAATACAAG AGTTAGGAGA TATAAAACAT AATTATGTTA AGTATTCAGA GGAAAGATTA
AAATATATGA AGTTAATACC TAAAGGTGGA GGTAATTGGA GAGATTTAAA TGAGGATATA
GTTGAAAAGG CTATGGGGGG AGCATATAAA TCAGGTGGAG GTAAAACAGG ATATTTTAGA
AGAATAAAAG CTAATGAACC AAGTCCTACA TTACTGACAT CTCCAATACA AAAAAGTACA
AATATAGGGC ATCCGTATGA AGATAGACCT TTAAGTATAG AGGAATATAT CGCTATCCAA
GGATTTCCTA AAGGGTATAA AATAAATGGA ACAATTAATA ATAAATATAC TCAGATAGGA
AATGCAGTTC CAGTAAAATT AGCAAAAGTA TTAGGTGAAA AATTAATAGA TATCTTATAT
GAGTAG
 
Protein sequence
MSKIAISFFA GAGGLDIGIH EAGFDVKLSV ELEEKYCVTL KQNNPTFNVV NGDIMDYSKE 
KIYSDAGLNY NDEIDLIFGG SPCQSFSTAG KRQAFSDERG KAMLKFIELI EEVKPKAFLL
ENVKGLLSAT LKHRPLNQRG KDFPPLDEDE ENGSALRYLL NQVKDYNVVY KVLNSAEYGV
AQKRERVIFV GIRKDLNKVY EFPNPTHGVG RKYPFVTVND VIQELGDIKH NYVKYSEERL
KYMKLIPKGG GNWRDLNEDI VEKAMGGAYK SGGGKTGYFR RIKANEPSPT LLTSPIQKST
NIGHPYEDRP LSIEEYIAIQ GFPKGYKING TINNKYTQIG NAVPVKLAKV LGEKLIDILY
E