Gene CPF_2656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2656 
Symbolbuk 
ID4202880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2927971 
End bp2929041 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content32% 
IMG OID638083522 
Productbutyrate kinase 
Protein accessionYP_697036 
Protein GI110800296 
COG category[C] Energy production and conversion 
COG ID[COG3426] Butyrate kinase 
TIGRFAM ID[TIGR02707] butyrate kinase 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATATA AATTATTAAT AATAAATCCA GGGTCTACAT CAACTAAAAT TGGTGTTTAT 
GAAGGTGAAA AAGAAATCTT AGAAGAAACT TTAAGACATT CAGCAGAAGA AATACTAAAA
TATGATACAA TATTTGATCA ACTTGATTTT AGAAAAGAAG TTATTTTAAA GGTATTAAAA
GAAAAAGGCA TTGACATAAA TGAGTTAGAT GCTGTTGTTG GAAGAGGTGG AATGCTTAAG
CCAATAGAAG GTGGAACTTA TGAAGTCAAT GAAGCTATGG TTGAGGACTT AAAAATTGGG
GTTCAAGGAC CACATGCTTC AAATTTAGGT GGAATATTAT CTAATGAAAT AGCAAAAGAA
ATTGGTAAGA GAGCATTTAT AGTAGATCCA GTTGTTGTTG ATGAAATGGA AGATGTAGCA
AGATTATCAG GAGTTCCAGA ATTACCAAGA AAAAGTAAAT TCCATGCATT AAATCAAAAG
GCTGTTGCTA AGAGATATGC AAAAGAACAT AATACTTCAT ATGAAGATGT TAATTTAATA
GTCGTTCATA TGGGGGGCGG AGTTTCAGTA GGAGCACATA GAAAAGGTAG AGTTATAGAT
GTAAATAATG CATTAGATGG TGATGGACCA TTTTCACCAG AAAGAGCAGG TGGAGTTCCT
TCAGGTGAAT TATTAGAAAT GTGTTTCTCA GGAAAGTATA GCAAAGAAGA AGTTTATAAA
AAGTTAGTTG GAAAAGGCGG ATTTGTTGCG TATGCTAACA CAAATGATGC GAGAGATTTA
ATAAAGCTAT CACAAGAAGG TGATGAAAAA GGCTCATTAA TATTTAATGC TTTCATATAT
CAAATAGCAA AAGAAATAGG ATCAATGGCT GTAGTTTTAG ATGGAGAAGT TGATGCTATA
GTATTAACTG GTGGAATTGC ATATAGTGAT TATGTAACTA ATGCTATAAA TAAAAAAGTA
AAATGGATTG CACCTATGGT TGTATACGGT GGAGAAGATG AACTTTTAGC TTTAGCACAA
GGAGCTATAA GAGTTTTAGA TGGCGTTGAA GAAGCAAAGA TATATAAATA G
 
Protein sequence
MAYKLLIINP GSTSTKIGVY EGEKEILEET LRHSAEEILK YDTIFDQLDF RKEVILKVLK 
EKGIDINELD AVVGRGGMLK PIEGGTYEVN EAMVEDLKIG VQGPHASNLG GILSNEIAKE
IGKRAFIVDP VVVDEMEDVA RLSGVPELPR KSKFHALNQK AVAKRYAKEH NTSYEDVNLI
VVHMGGGVSV GAHRKGRVID VNNALDGDGP FSPERAGGVP SGELLEMCFS GKYSKEEVYK
KLVGKGGFVA YANTNDARDL IKLSQEGDEK GSLIFNAFIY QIAKEIGSMA VVLDGEVDAI
VLTGGIAYSD YVTNAINKKV KWIAPMVVYG GEDELLALAQ GAIRVLDGVE EAKIYK