Gene CPR_2342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2342 
Symbol 
ID4204772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2570377 
End bp2571447 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content32% 
IMG OID642566892 
Productbutyrate kinase 
Protein accessionYP_699607 
Protein GI110802483 
COG category[C] Energy production and conversion 
COG ID[COG3426] Butyrate kinase 
TIGRFAM ID[TIGR02707] butyrate kinase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATATA AATTATTAAT AATAAATCCA GGGTCTACAT CAACTAAAAT TGGTGTTTAT 
GAAGGTGAAA AAGAAATCTT AGAAGAAACT TTAAGACATT CAGCAGAAGA AATACTAAAA
TATGATACAA TATTTGATCA ACTTGATTTT AGAAAAGAAG TTATTTTAAA GGTATTAAAA
GAAAAAGGTA TTGACATAAA TGAGTTAGAT GCTGTTGTTG GAAGAGGTGG AATGCTTAAG
CCAATAGAAG GTGGAACTTA TGAAGTCAAT GAAGCTATGG TTGAGGACTT AAAAATTGGG
GTTCAAGGAC CACATGCTTC AAATTTAGGT GGAATATTAT CTAATGAAAT AGCAAAAGAA
ATTGGTAAGA GAGCATTTAT AGTAGATCCA GTTGTTGTTG ATGAAATGGA AGATGTAGCA
AGATTATCAG GAGTTCCAGA ATTACCTAGA AAAAGTAAAT TCCATGCATT AAATCAAAAG
GCAGTTGCTA AGAGATATGC AAAAGAACAT AATACTTCAT ATGAAGATGT TAATTTAATA
GTCGTTCATA TGGGGGGCGG AGTTTCAGTA GGAGCACATA GAAAAGGTAG AGTTATAGAT
GTAAATAATG CATTAGACGG TGATGGACCA TTTTCACCAG AAAGAGCAGG TGGAGTTCCT
TCAGGTGAAT TATTAGAAAT GTGTTTCTCA GGAAAGTATA GCAAAGAAGA AGTTTATAAA
AAGTTAGTTG GAAAAGGCGG ATTTGTTGCT TATGCTAACA CAAATGATGC AAGAGATTTA
ATAAAGCTAT CACAAGAGGG TGATGAAAAA GGCTCATTAA TATTTAATGC TTTCATATAT
CAAATAGCAA AAGAAATAGG ATCAATGGCT GTAGTTTTAG ATGGAGAAGT TAATGCTATA
GTATTAACTG GTGGAATTGC ATATAGTGAT TATGTAACTA ATGCTATAAA TAAAAAAGTA
AAATGGATTG CACCTATGGT TGTATACGGT GGAGAAGATG AACTTTTAGC TTTAGCACAA
GGAGCTATAA GAGTTTTAGA TGGAGTTGAA GAAGCAAAGA TATATAAATA G
 
Protein sequence
MAYKLLIINP GSTSTKIGVY EGEKEILEET LRHSAEEILK YDTIFDQLDF RKEVILKVLK 
EKGIDINELD AVVGRGGMLK PIEGGTYEVN EAMVEDLKIG VQGPHASNLG GILSNEIAKE
IGKRAFIVDP VVVDEMEDVA RLSGVPELPR KSKFHALNQK AVAKRYAKEH NTSYEDVNLI
VVHMGGGVSV GAHRKGRVID VNNALDGDGP FSPERAGGVP SGELLEMCFS GKYSKEEVYK
KLVGKGGFVA YANTNDARDL IKLSQEGDEK GSLIFNAFIY QIAKEIGSMA VVLDGEVNAI
VLTGGIAYSD YVTNAINKKV KWIAPMVVYG GEDELLALAQ GAIRVLDGVE EAKIYK