Gene CPF_2872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2872 
Symbol 
ID4203625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp3140496 
End bp3141671 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content32% 
IMG OID638083739 
Productputative methyltransferase 
Protein accessionYP_697236 
Protein GI110800285 
COG category[R] General function prediction only 
COG ID[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAGTA AGTTTTATTT ACACAAAGGT AAAAACAAAA AAGCTGAACA AGGCAGACCT 
TGGATATATA TTGATGAAAT AAACGAATAT GATGGAGATT ATGAAAACGG AGATATAGTT
GAAGTTTACA ATCATAAAGG TTATTTCTTA GGAAAAGGCT ATATAAATGA CAGAAGTAAA
ATAACTATAA GAATAATGAC TAAAGATATA GATGAAGAAA TAGATGAGGA TTTCTTCAAG
AGAAGATTTA AAACTGCATG GGAATATAGA AAGAAAGTTA TAGATACATC TTCATGTAGA
TTCATCTTTG GAGAGGCTGA TTTCCTTCCT GGTTTAACAG TTGATAAATT TGAAGATTAT
TATGTAATTC AAATATCAAC TCTTGGAATG GATAAATATA GAGACCTAAT AGTTAAAATT
CTAGTTGAGG AATACGGTGC TAAAGGTGTC TATGAAAGAA GTGATATAAA AACTAGAGAA
ATAGAAGGTT TAGAGCAAAG AAAAGGCTTC TTAACTGAAC CATTTGATAC AGATATACAA
ATAGTTGAAA ATGGAGTTAA ATACATAGTT GACTTAGAAA ATGGTCAAAA AACTGGTTTC
TTCTTAGATC AAAAAGAAAA CAGAGCTGCA ATGCATAGAA TATGTAAAGG TATGGATGTT
TTAGATTGCT TCACTCATAC TGGCTCTTTT GCTTTAAATG CCGGTATAGC AGGTGCTAAA
TCAGTTTTAG GAATAGATGT ATCTCAACAC GCTGTAGACT GTGCTACTAG AAACGCTGAA
CTTAATAACC TTCAAGATAG GGTTAAATTT GAAAAGCATA ATGCCTTTGA TGTATTAGGA
GATTGGTCAA GAGAAGGAAA ACAATTTGGT GTTGTTATTT TAGATCCACC AGCTTTCACA
AAATCAAGAA ATACTGTTAA GCAAGCAATA AGAGGATATA AAGAAATAAA TCTTAGAGGA
ATAAAAATGG TTAAAGAAGG TGGTTACTTC GCTACATGCT CTTGTTCACA TTATATGGAT
GAAGAACAAT TAAAGAAAAC TGTAGCTGAG GCTGCTCATG ATGCAAGAAG AACTTTAAGA
CAAATAGAAG TTAGAACTCA AAGTGCAGAC CACCCTATAC TTTGGAACTC TGACGAATCA
TATTATTTAA AATTCTTCAT ATTCCAAGTA TTCTAA
 
Protein sequence
MASKFYLHKG KNKKAEQGRP WIYIDEINEY DGDYENGDIV EVYNHKGYFL GKGYINDRSK 
ITIRIMTKDI DEEIDEDFFK RRFKTAWEYR KKVIDTSSCR FIFGEADFLP GLTVDKFEDY
YVIQISTLGM DKYRDLIVKI LVEEYGAKGV YERSDIKTRE IEGLEQRKGF LTEPFDTDIQ
IVENGVKYIV DLENGQKTGF FLDQKENRAA MHRICKGMDV LDCFTHTGSF ALNAGIAGAK
SVLGIDVSQH AVDCATRNAE LNNLQDRVKF EKHNAFDVLG DWSREGKQFG VVILDPPAFT
KSRNTVKQAI RGYKEINLRG IKMVKEGGYF ATCSCSHYMD EEQLKKTVAE AAHDARRTLR
QIEVRTQSAD HPILWNSDES YYLKFFIFQV F