Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_119491 |
Symbol | EtnK |
ID | 5000455 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | - |
Start bp | 434862 |
End bp | 436501 |
Gene Length | 1640 bp |
Protein Length | 345 aa |
Translation table | |
GC content | 42% |
IMG OID | 640415876 |
Product | predicted protein |
Protein accession | XP_001416386 |
Protein GI | 145343558 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0510] Predicted choline kinase involved in LPS biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0654217 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATGCTT TCGTCCTTTC CCGACCGGAA CGACCAAGTT ATTGCAAAGT TCGACAAACT CGACGGAGTC CAAGCGCAAC TGCTTTCTCG AAAAAAGTAG CAACATCACT AAGTGTGGAA GATGTTGACG CCATCAAAAG TATTTTAAAT TACAAGCATT CGGTGAAGTA AGTTACTCAT ATCCGCCGCC GATCTTGGAT ATTCTCAATA CCTAAACTTT TTCTGTTAGA CTCGAGTGCG CACAGTTCAC AGAAGGCTTC TGCAACCAAG TTTTCAAGGT ATCCGATACG AAACGGCGCT ACATTATTTA AAAACTAAAA TTCACATCTG CACAAGGTCT CATGTGGAGG CGAAACCTTT GTAGTAAAAA AGTACAGCAA GTTGTCAAAA CTGCGCTGTG ACTTACTGAG GTAGGCGTGA AGTACACAGT ACATTTCAAC TGTCTCAAGA GCGCGCAGTT GCATCGAGAT GCATAAAACT TTGGCAGAAC GTGGCATTTG TCCGCATTAT ATTTCACACG CCGATGATAT CATCGTTACT GAGTACTTGG ATGGCCGAGT TCTCCGAGAA GAAGATATGA AGGAACTTTC GTTCTGCAAG TCAACTGCTA AATTGTAAGC ACACTCTCGA ATCTTTGAAC AAGAATGTTA AATCTTTTGT CACAATATCG TAGAATAAGT AGGCTGCACT CAGTGCACGT AGAGGGTGAA TCATTTGAGA TCTTGAAACC TTTACTTCCC CTGACTTGTG AAGGACGTGA GCGGGCACTC ATATGGAAAT GGTTTCATCA AATGCTGATG CAGTTGCGAC CACTTGAGGT AGTGTCGAAA TCCCCGGATG TTAATGAGAC TAAGATTTTC AAAGGGGGGC ATGATCGCTG GTGTTGGGGT CAAGGTGCGG CGCAAAGCGC AAAGTACCAG TTGGTGGTCG CTCAAGTTGA CAAGTTGACA TTGCTTTAGG AATTAGAGGA TGAAGTGTTC AAAGTGGAAT CTTTCTTCAA GGCGTGTATA TGAGTGTCCA AAATTCAATA GTATCTCACA AAATATCCAG AGCGTTCACT TACCTATCTG TTTCTGTCAC GGTGACCTGA AGCCATCGAA CGTAATCTAC CAACAGGACC GAAACTTCAA GGTACACCAT AAACTTCACG TTGAAGTCAC TTACTTACAC CTCATACCTT CGTTTCACTA AAGTTGATCG ACATTGATTT AGCCGGCCCA AATTATCGTG GATTCGATAC AATGAAATTG TTTCGAACTA CGAACTCCTT CTACGATGAA AGTTTGCTAT CATTTTTGCA AGAGTACCAG GCGGAAAAGA ATTCAGAAAT GAACGTTGAA GGTGTTCTAT TTCTGCTTGT AGTTAGGATT AGATTAACAC CTCTGCAGGA CTATTTTATG AGGCTCAAAT GTGCGAGGCA CTCACATGGC TCGAGGTAAG CTGCTTGCAC TCGAACTATT AAAGACGCAA TCGCCTATTA AGTATTCCCG AAGGCAGCTT TATTTTTCGC AACACTAGTT CCCATGAATG GAGAGACGTC TCAAAGAAAT CTTTCCCTAT TTGAAGATAG GTGGTTACAT TACAAGCAGA CACAGTGGAA ATTTGTTTAC TATGGAAAAC TTCTCAAGGA CATCGGTTAG
|
Protein sequence | MHAFVLSRPE RPSYCKVRQT RRSPSATAFS KKVATSLSVE DVDAIKSILN YKHSVKLECA QFTEGFCNQV FKVSCGGETF VVKKYSKLSK LRCDLLSCIE MHKTLAERGI CPHYISHADD IIVTEYLDGR VLREEDMKEL SFCKSTAKLI SRLHSVHVEG RERALIWKWF HQMLMQLRPL EGGMIAGVGV KELEDEVFKV ESFFKSVHLP ICFCHGDLKP SNVIYQQDRN FKLIDIDLAG PNYRGFDTMK LFRTTNSFYD ESLLSFLQEY QAEKNSEMNV EGLFYEAQMC EALTWLEAAL FFATLVPMNG ETSQRNLSLF EDRWLHYKQT QWKFVYYGKL LKDIG
|
| |