Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_42574 |
Symbol | |
ID | 5003330 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | + |
Start bp | 10498 |
End bp | 11802 |
Gene Length | 1305 bp |
Protein Length | 421 aa |
Translation table | |
GC content | 58% |
IMG OID | 640418751 |
Product | predicted protein |
Protein accession | XP_001419042 |
Protein GI | 145349233 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0510] Predicted choline kinase involved in LPS biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.202848 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCACCG ACCGAAGACA CTCGAGCGCG AGCGAAGAAA ACTCGGGGGC GAGCACGACA GACTTTCAAG ACGAAGACGC GCCGACGTCG AGAAGCGCGA CGCCCGCGCC CGGACGATTG AACGTCGAGC TCGACGTCGA TCGCGAACAA CGCGATGAGT ATCAAGGCGT GAAATCCATC GTGCGAAACA CCGTGCGAGG GTGGGCAAAC GTCGAAAACG CGGCGCTCGA GGTGAGCCCG GTGCGAGGCG GGATCACGAA CGCGCTGTTC AAGGTTCGTC TGGCGCAAGA CGCGGCGCCG ACGACGACGA AGGATCCGAT CGCACGCGCG GTGGTGGTGA GAGTTTTTGG CAAGGGTACC GATCAATTCA TCACTCATCG CAAAGTACAA GGCGAGACGT CGCACGTTTT GAACGAACAC GGGTTCGGGG CAAAAGTGCT CGGCGTTTTT TCAAATGGGT TGGTTGAAGA GTTCATCGAA GCCGAGAGTG TGGCTCCGGA GGAGTTGGCG AACGGAGGGA TTTTGCTTCG ACGAGTCGCG GCGCAGATGC GACGCTTGCA CAAGGAAGTG GCGCCGGATT TAGTGCCTCG CGCCGCGGCT GGCGAGACCA TCGCGCGCGC CCGAGCCAAC GCTATCTGGG ACACGCTTCA GTTGTGGTTC GACTTGGCGT ACGGTGTTGC CAATGATCCG ACCATTTTCA AGAATGACGC GCGCAAAGAG TCGATTTTGG CATCGTTGAA GATCGATTCG GAATCGCGTC AAATGCTGTT CGAAGTCATT CGCGCGAGGT GCGAAGCCGT GAACAGTCAG ACAGTGTACT GTCACAACGA CATTCACGCC GGTAACTTTT TGCTGAACAG AAAGACGGAC AACCTGACGC TCATCGATTA CGAGTACGCC GACTACGGTC CCCGTGCGTT TGACATGGCC AATCTGTTTT GCGAATTCGC CGGGTTCGAG TGCAACTACG ATCAGTTTCC GACGTGCGAA CTTCGCCGCG AGTTTTACTC GGCGTACTTG CACACCACGG TCGATGCGGA GATTGACGCG CTCGAAGCGG AAGTCGCGGC GTGGACGCCC GTGACGCACG CATTCTGGGC GCTCTGGGCG GTGATTCAAG CCAAGTATAG CGCCATCGAT TTTGACTTTT TGGGTTTCGC CGCGATGCGC ATGAAGGTGT TTTACGCCTC TGCTCTCGCG CCGAGTGAGT GGGTGCCGAC GAACGCCGCG CTCGGTGGAC AGCACGGCAC GCCGGAGAAG AGTGTGGGTT GGAATGCCAC GGCTGAGGGA AACGTCGTGC TTTGA
|
Protein sequence | MVTDRRHSSA SEENSGASTT DFQDEDAPTS RSATPAPGRL NVELDVDREQ RDEYQGVKSI VRNTVRGWAN VENAALEVSP VRGGITNALF KVRLAQDAAP TTTKDPIARA VVVRVFGKGT DQFITHRKVQ GETSHVLNEH GFGAKVLGVF SNGLVEEFIE AESVAPEELA NGGILLRRVA AQMRRLHKET IARARANAIW DTLQLWFDLA YGVANDPTIF KNDARKESIL ASLKIDSESR QMLFEVIRAR CEAVNSQTVY CHNDIHAGNF LLNRKTDNLT LIDYEYADYG PRAFDMANLF CEFAGFECNY DQFPTCELRR EFYSAYLHTT VDAEIDALEA EVAAWTPVTH AFWALWAVIQ AKYSAIDFDF LGFAAMRMKV FYASALAPSE WVPTNAALGG QHGTPEKSVG WNATAEGNVV L
|
| |