Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0594 |
Symbol | |
ID | 4204911 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 711069 |
End bp | 712937 |
Gene Length | 1869 bp |
Protein Length | 622 aa |
Translation table | 11 |
GC content | 27% |
IMG OID | 642565154 |
Product | choline/ethanolamine kinase family protein |
Protein accession | YP_697921 |
Protein GI | 110801960 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0510] Predicted choline kinase involved in LPS biosynthesis [COG4750] CTP:phosphocholine cytidylyltransferase involved in choline phosphorylation for cell surface LPS epitopes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0311348 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTATCAT ATGATATTGA AGTTTTAAAG ATTATAAATG ATGAAGATAA TATAAGTCAA AGAAGATTAG CAGAAATGTT AAATCTTTCT TTAGGTAAAA TAAATTCTTT ACTTAAGGAA CTATTGACTA AAGAATACAT AGTTAAAATT GATATAGATA AGAGAAATGT TAAATATGAA TTAACTGAGG CAGGAATTGC ATTACTAGAA AATTCATTAG ATAAAGTTAA AAGCACAAAA TTATCAATAC ATAAGCATAA ATTTCATAAA GTTAAACAAG CTATTATTTT AGGAGCTGGT GGTAAAGGTG ATTTTGGAAA GCCAGCTGGT TTTTTAGAGC TTGAAGAATT CAGAATAATT GATAGAATAA TTGATATTTT GAAATCAAAT GGTATAGAGA AGATTGTTAT AGTAAATGGA TATAAGAAAG AATACTATGA AGAATTAGCT AAAAAAGATT CAACAATTTT TTGTGTAACT AATTCAAATT ATAAATGGAC AGGAACTATG AGTTCTTTAG CTTTAGCTAA GGAATATATA GATGATGATT TTATATTGGT AGAAAATGAT CTTGTTTTTG AAAGTAGAGC TGTAGAACAA ATTATTAAAA ATGATAATAG AGATTGTGTT TTAATTACTA ATGAAAGTGG ATCAGGGGAT GAGGCCTTCG TTGAAATAAG AGATGGCTAT CTTTTTAAAA TGTCAAAAGA TGTACACCAA TTTAATAAGA TTGATGGAGA AATGATTGGT ATAAGCAAAA TATCTTATAA ATTATTTAAA ATGATGTTAG AAGAATTTAA ACATAATATA AATCCATATA TGAATTATGA GTACACTATG TTAGATGTTG CTAGAAATTA TAAGGTTGGT TATGAAAAGA TAAATGACCT TGTTTGGGGA GAAATTGACA ATAAAGATCA ATATGAAAAA GTAAAGAAAC ACATTATACC TATGATAAGA AGAAAAGAAA TGCAATATAA AATAGATCAA GTAAAAAGTG CTATAGTTAA TGGATTAAAG GTTTCAGAAG AGGAAGTTAA AGAAATAGTA CCTGTAGGAG GTATGACTAA TAAAAACTAT AAGGCTTTTG TAAATGACAA AGCTTATATA GTAAGAATAC CAGGATTAGG TACAAGCTCT ATGATTAATA GAAGAGATGA AATGATAAAC TCTAAACTAG CAGCAGATGA AGGAATAGAT GCTAAAATAC TTTTCTTTGA TGAAGAGTCT GGAGTAAAGA TTGCAGAACT TATAGAGGGA GCAGAAACTC TTAATCCTGC AACAGCTAAA AAGAAAGAAA ATATGGAATT AGTGGTAGGA GCATTAAGAA CTTTACATAA TTCAGATATA AAAATGGAAA ATAGATTTAA TGTTTTTGAG AAAATTGAGG AGTATGAGAG TCTTGTTAAA AAGGTTAACG GTACTCTTTT TGAAGATTAT TATGAAATAA AAACAAGAGT ATTAAAGTTA GAAAAAGTTT TAGAAGATAA TGGAATGGAA ATAAAACCAT GTCATAATGA TACCGTTCCA GAGAACTTTG TTAAAGATAT TAATGAAAGA ATGTATCTAA TAGATTGGGA GTACAGTGGA TTAAATGATC CTATGTGGGA TTTAGCTGCT CACTCAATTG AATGTGATTT TTCAGAGGAT GATGAAGAAT TATTCTTAAA TCTATATTTT AACAATTTAA TTGAAGATAA GCATAAGATA AGAATTCTTG TCTATAAAAT ATGTCAGGAC TTTTTATGGA GCATATGGAC TATTTTAAAA GAAGCTCAAG GTGATGATTT TGGAACTTAT GGTATAGATA GATATAATAG AGGAAAGAAA AATTTAGAAT TATTAGATAA AATTTTAATG GGGCAATAA
|
Protein sequence | MLSYDIEVLK IINDEDNISQ RRLAEMLNLS LGKINSLLKE LLTKEYIVKI DIDKRNVKYE LTEAGIALLE NSLDKVKSTK LSIHKHKFHK VKQAIILGAG GKGDFGKPAG FLELEEFRII DRIIDILKSN GIEKIVIVNG YKKEYYEELA KKDSTIFCVT NSNYKWTGTM SSLALAKEYI DDDFILVEND LVFESRAVEQ IIKNDNRDCV LITNESGSGD EAFVEIRDGY LFKMSKDVHQ FNKIDGEMIG ISKISYKLFK MMLEEFKHNI NPYMNYEYTM LDVARNYKVG YEKINDLVWG EIDNKDQYEK VKKHIIPMIR RKEMQYKIDQ VKSAIVNGLK VSEEEVKEIV PVGGMTNKNY KAFVNDKAYI VRIPGLGTSS MINRRDEMIN SKLAADEGID AKILFFDEES GVKIAELIEG AETLNPATAK KKENMELVVG ALRTLHNSDI KMENRFNVFE KIEEYESLVK KVNGTLFEDY YEIKTRVLKL EKVLEDNGME IKPCHNDTVP ENFVKDINER MYLIDWEYSG LNDPMWDLAA HSIECDFSED DEELFLNLYF NNLIEDKHKI RILVYKICQD FLWSIWTILK EAQGDDFGTY GIDRYNRGKK NLELLDKILM GQ
|
| |