Gene CPR_2177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2177 
Symbol 
ID4205002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2402792 
End bp2404555 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content29% 
IMG OID642566727 
ProductHemK family modification methylase 
Protein accessionYP_699477 
Protein GI110803251 
COG category[J] Translation, ribosomal structure and biogenesis
[R] General function prediction only 
COG ID[COG2890] Methylase of polypeptide chain release factors
[COG3872] Predicted metal-dependent enzyme 
TIGRFAM ID[TIGR00536] HemK family putative methylases
[TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000299232 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAT GCCAAGTAGG CGGTCAAGCA GTATTAGAAG GCGTTATGAT GAGGGGATCA 
AAAGGAACAG CTACAGCAGT TAGAACTCCA GAGGGAGATA TAGAGGTTTC TTTTGAAAAA
ACTATACCAT ATACAAAGAA AAATAAAATT TTAGGACTAC CTTTTATAAG AGGATTTGTA
ACTCTTATAG AGTCTTTAAT TGTAGGATTA AAATCATTAA ATTATTCAGC AAGTTTTTTT
GATGATACAG AACCATCTAA ATTTGAAGAT TGGTTAAATA ATAAATTTGG TGAAAAAGCT
AATAATGTAA TAATGACACT TACAATTATG CTTTCCTTTG TATTTGCCAT AATTTTATTT
GTAGCAATAC CAACTGGAAT TACTTTTTTA CTTAAAAAAC TTAATATTCC AGATTGGAGT
TTAAGTGCTA TTGAAGGAAT CATAAGTATT GGTATGCTTT TAGGATACAT GTACTTAATG
GGAAAAGTAG ATGATATAGA AAGAGTATTT CAATATCATG GAGCAGAACA TAAGACTATA
TTCTGTTATG AGAATGAAGA TGAACTTACA GTTGAAAATG TAAGAAAATA TTCTAGATTT
CATCCTAGAT GTGGAACAAA CTTTCTATTT TTAGTTGCTA TTGTAAGTAT ATTTATATTT
TCCTTTACTA AATGGGATTC AGTTGCTCAG AGAACGGCTA TAAGAGTAGC GATGTTACCG
GTAATATCAG GAATAACTTA TGAACTTATA AAATGGCTTG GCAAATCTCA AGGAAATTTT
GCAAAGATAA TAGCAGCGCC GGGATTACAA TTGCAAAAAT TGACTACAAG GGAGCCTGAT
GATTTACAAA TTGAAGTAGC AATAGCTTCT TTAAGGAGGG CTGAAGGGTT GAAAGAACCA
AATAAAAAAG TTGGAGAATT ATTAAATTTA GGAAATGAAA CTTTAAAAGA AGTAGGTATA
GATACATATA TATTAGATAC TCAATTATTA TTAGGAAAAA TTTTAGAAAA AGATAAAATA
TGGCTTATAA CGAATAAAAA TGAAGAAGTT AAAAAGTCAG ATGAAATACA TTTCTTAAAT
TTATTAGAAA AAAGAAAATC AAAAATGCCT ATGCAATATA TTTTAGGAAC TTGCGAATTT
ATGGGATTAG ATTTTTATGT AGAAGAGGGA GTTTTAATTC CAAGAGGAGA TACTGAAATA
ATTGTAGAGG AAGTATTAAA CAATATAGAT GAAGATGCAG AAATTAATGT ATGTGATTTA
TGTTGTGGAA GTGGAGCTAT AGGCTTATCT TTAGCTAATT ATAGAAAAAA TATTATTGTA
GATTTAGTAG ATATAGATGA TATACCAGAA AAAGTTACAA GAAAAAATAT AAGAGAATTA
GAATTATCAA AAAGATGTGG CTTTATTAAG AGTGATCTTT TAAGTGAAGT CATTAAAAAA
GGAAATAAGT ATGATATTCT AGTTTCTAAT CCACCATATA TAAGAACGGA AGTCATAAAT
ACTTTAATGA AAGATGTTAA AGATTATGAG CCGCACTTAG CTTTAGATGG GGGAGAAGAT
GGTTTAATAT TCTATAGAAG AATTATTGAT GAATCTTTAG AAGTATTAAA AGAAAATGGT
ATATTAGCTT TTGAAATAGG ACATGATCAA GGTGAGGATG TTAAAAATCT TATGATTGAA
AAAGGATATT ACGATGTTAA GGTCATAAAA GATTTAGCTG GTTTAGATAG ATGTGTTATA
GGAAGAGTAA GCCTTGAAAG ATAG
 
Protein sequence
MRKCQVGGQA VLEGVMMRGS KGTATAVRTP EGDIEVSFEK TIPYTKKNKI LGLPFIRGFV 
TLIESLIVGL KSLNYSASFF DDTEPSKFED WLNNKFGEKA NNVIMTLTIM LSFVFAIILF
VAIPTGITFL LKKLNIPDWS LSAIEGIISI GMLLGYMYLM GKVDDIERVF QYHGAEHKTI
FCYENEDELT VENVRKYSRF HPRCGTNFLF LVAIVSIFIF SFTKWDSVAQ RTAIRVAMLP
VISGITYELI KWLGKSQGNF AKIIAAPGLQ LQKLTTREPD DLQIEVAIAS LRRAEGLKEP
NKKVGELLNL GNETLKEVGI DTYILDTQLL LGKILEKDKI WLITNKNEEV KKSDEIHFLN
LLEKRKSKMP MQYILGTCEF MGLDFYVEEG VLIPRGDTEI IVEEVLNNID EDAEINVCDL
CCGSGAIGLS LANYRKNIIV DLVDIDDIPE KVTRKNIREL ELSKRCGFIK SDLLSEVIKK
GNKYDILVSN PPYIRTEVIN TLMKDVKDYE PHLALDGGED GLIFYRRIID ESLEVLKENG
ILAFEIGHDQ GEDVKNLMIE KGYYDVKVIK DLAGLDRCVI GRVSLER