Gene CPR_0594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0594 
Symbol 
ID4204911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp711069 
End bp712937 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content27% 
IMG OID642565154 
Productcholine/ethanolamine kinase family protein 
Protein accessionYP_697921 
Protein GI110801960 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0510] Predicted choline kinase involved in LPS biosynthesis
[COG4750] CTP:phosphocholine cytidylyltransferase involved in choline phosphorylation for cell surface LPS epitopes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0311348 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTATCAT ATGATATTGA AGTTTTAAAG ATTATAAATG ATGAAGATAA TATAAGTCAA 
AGAAGATTAG CAGAAATGTT AAATCTTTCT TTAGGTAAAA TAAATTCTTT ACTTAAGGAA
CTATTGACTA AAGAATACAT AGTTAAAATT GATATAGATA AGAGAAATGT TAAATATGAA
TTAACTGAGG CAGGAATTGC ATTACTAGAA AATTCATTAG ATAAAGTTAA AAGCACAAAA
TTATCAATAC ATAAGCATAA ATTTCATAAA GTTAAACAAG CTATTATTTT AGGAGCTGGT
GGTAAAGGTG ATTTTGGAAA GCCAGCTGGT TTTTTAGAGC TTGAAGAATT CAGAATAATT
GATAGAATAA TTGATATTTT GAAATCAAAT GGTATAGAGA AGATTGTTAT AGTAAATGGA
TATAAGAAAG AATACTATGA AGAATTAGCT AAAAAAGATT CAACAATTTT TTGTGTAACT
AATTCAAATT ATAAATGGAC AGGAACTATG AGTTCTTTAG CTTTAGCTAA GGAATATATA
GATGATGATT TTATATTGGT AGAAAATGAT CTTGTTTTTG AAAGTAGAGC TGTAGAACAA
ATTATTAAAA ATGATAATAG AGATTGTGTT TTAATTACTA ATGAAAGTGG ATCAGGGGAT
GAGGCCTTCG TTGAAATAAG AGATGGCTAT CTTTTTAAAA TGTCAAAAGA TGTACACCAA
TTTAATAAGA TTGATGGAGA AATGATTGGT ATAAGCAAAA TATCTTATAA ATTATTTAAA
ATGATGTTAG AAGAATTTAA ACATAATATA AATCCATATA TGAATTATGA GTACACTATG
TTAGATGTTG CTAGAAATTA TAAGGTTGGT TATGAAAAGA TAAATGACCT TGTTTGGGGA
GAAATTGACA ATAAAGATCA ATATGAAAAA GTAAAGAAAC ACATTATACC TATGATAAGA
AGAAAAGAAA TGCAATATAA AATAGATCAA GTAAAAAGTG CTATAGTTAA TGGATTAAAG
GTTTCAGAAG AGGAAGTTAA AGAAATAGTA CCTGTAGGAG GTATGACTAA TAAAAACTAT
AAGGCTTTTG TAAATGACAA AGCTTATATA GTAAGAATAC CAGGATTAGG TACAAGCTCT
ATGATTAATA GAAGAGATGA AATGATAAAC TCTAAACTAG CAGCAGATGA AGGAATAGAT
GCTAAAATAC TTTTCTTTGA TGAAGAGTCT GGAGTAAAGA TTGCAGAACT TATAGAGGGA
GCAGAAACTC TTAATCCTGC AACAGCTAAA AAGAAAGAAA ATATGGAATT AGTGGTAGGA
GCATTAAGAA CTTTACATAA TTCAGATATA AAAATGGAAA ATAGATTTAA TGTTTTTGAG
AAAATTGAGG AGTATGAGAG TCTTGTTAAA AAGGTTAACG GTACTCTTTT TGAAGATTAT
TATGAAATAA AAACAAGAGT ATTAAAGTTA GAAAAAGTTT TAGAAGATAA TGGAATGGAA
ATAAAACCAT GTCATAATGA TACCGTTCCA GAGAACTTTG TTAAAGATAT TAATGAAAGA
ATGTATCTAA TAGATTGGGA GTACAGTGGA TTAAATGATC CTATGTGGGA TTTAGCTGCT
CACTCAATTG AATGTGATTT TTCAGAGGAT GATGAAGAAT TATTCTTAAA TCTATATTTT
AACAATTTAA TTGAAGATAA GCATAAGATA AGAATTCTTG TCTATAAAAT ATGTCAGGAC
TTTTTATGGA GCATATGGAC TATTTTAAAA GAAGCTCAAG GTGATGATTT TGGAACTTAT
GGTATAGATA GATATAATAG AGGAAAGAAA AATTTAGAAT TATTAGATAA AATTTTAATG
GGGCAATAA
 
Protein sequence
MLSYDIEVLK IINDEDNISQ RRLAEMLNLS LGKINSLLKE LLTKEYIVKI DIDKRNVKYE 
LTEAGIALLE NSLDKVKSTK LSIHKHKFHK VKQAIILGAG GKGDFGKPAG FLELEEFRII
DRIIDILKSN GIEKIVIVNG YKKEYYEELA KKDSTIFCVT NSNYKWTGTM SSLALAKEYI
DDDFILVEND LVFESRAVEQ IIKNDNRDCV LITNESGSGD EAFVEIRDGY LFKMSKDVHQ
FNKIDGEMIG ISKISYKLFK MMLEEFKHNI NPYMNYEYTM LDVARNYKVG YEKINDLVWG
EIDNKDQYEK VKKHIIPMIR RKEMQYKIDQ VKSAIVNGLK VSEEEVKEIV PVGGMTNKNY
KAFVNDKAYI VRIPGLGTSS MINRRDEMIN SKLAADEGID AKILFFDEES GVKIAELIEG
AETLNPATAK KKENMELVVG ALRTLHNSDI KMENRFNVFE KIEEYESLVK KVNGTLFEDY
YEIKTRVLKL EKVLEDNGME IKPCHNDTVP ENFVKDINER MYLIDWEYSG LNDPMWDLAA
HSIECDFSED DEELFLNLYF NNLIEDKHKI RILVYKICQD FLWSIWTILK EAQGDDFGTY
GIDRYNRGKK NLELLDKILM GQ