Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_24922 |
Symbol | |
ID | 5003284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | - |
Start bp | 283360 |
End bp | 286450 |
Gene Length | 3091 bp |
Protein Length | 888 aa |
Translation table | |
GC content | 60% |
IMG OID | 640418705 |
Product | predicted protein |
Protein accession | XP_001419332 |
Protein GI | 145349834 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0784] FOG: CheY-like receiver |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.123273 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.39046 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGCGCGACGC GCGACGCGCG ACGACGACGC GACGACGCGA CGGTGCGCGG AGAAAAGATT CACGCGTCGG ATTCAGGGAT CGGCGCGGGC GCGACGGCGG TCGCGCGCGG GAAGGCGCGC GCATTCATTC GTCCGCGGCA CCCGCGCGAC GGGCGAAAAG CGTCGGGGAT TCGATTCGAT TCGACGGCGG CGCGCGACGC GCGACGGGGT GAGGGCGACG CGCGATCGGT GGAGACAGGG AGAAATGATT TCGCGGCGTC ATGGCGCGCG CGAGCGATGT CGGAGAGGCG CCGAGCGAAC GCGAGATCGC GGGGGCGTAC GGCGCGGCGG CGACGCTGTT GGGGTCGAGC GCGGGGGCGC GCGGAGGCGC GCGCGCGCGG CGAGGGGAGA TTCCGACGTC GCGCGTGGGA GCGCTCGTCA GAGCGCTCGG GGCGCACGTG GGGGCGCACG AAGACGATAT GCTGCGACGG TGCGAAAAGA TTGTCGATCC CGAGGGGCGA GGGAGCTTCG CGCTCGGGCG GCTGCGAGAC GCGCTCGCGG CGCTCAGGGA CGAGGGGCGA TTGCCGACGC GCGAGGAGGA AGGAGGGAAA GGGACGGGAG AGGTGGCGAT GACGACGCCC GTGGTTTCGA CGGGGAGACG TGCGACGCGC GCGACGGCGA CGGCGACGAC GACTACGGAT GATCGGAAGT CGGTGTTGAT CGTGGACGAC GTGGCGTCGG TGCGAAAGTT TCAGGCGGGG CAAATCGGAA AGATGGGTTA TCAAAGCATA GAGGCGGGCG AGGGCGCGGA GGCGTTGCGA TTGTACGGAG AAAATCACGC GGAGATTCAA GGGATTTTGA TGGATTTGAT GATGCCCACG ATGGATGGAT ACGAAATCGC GACGGGGATT CGAGCGATGG AGGAGAAGCA GGCGCTCAGG CGTATGCCCA TCATCGCCGT GACGTCGCTC ACGGACCAAG AGCTCAAAGA ACACGCCAGT TCTGTGGCAT TTGACTGTCA CGTCGATAAA CCGACTTCAA TGTCCAAGCT GGCCACGATT TTTGAATCCA TGCGCATGGC GCCGAGCTTG AGTGCCGCCG AGTTGCAAGC CATCGGCGCG GCGAACGCCG CAAAGCTGGA TCCTCACGGG ACGAAGCGTT CGCACCTGCC GGTGTCGAGC AGCGACACGA CTTCGGACGA TGGCAAACAC GGCGGCTCTA ACGATGGGTC AAGCGACCAA AACGGTCGCA AGGGATCGAA TAGCGACGCA GATCCAGACG CGCGATCGAG CGGGGATTCG GGTAACGAAC AACGATACGT CTCGCAGCGC GCCGACGGGC CTTGCAATAG TTCGAAGCGA CCGGGCGCTA CCGCGTCTGG TGCGACAGCG GTCGCACCGA AAAAGACCGG TACGAGTGAC AATGGTAGTG GTGGTAACAG CAACAACGGC TCGAGCGACG CCTCGAGACA AGGTGTCGCC GACGACGCAG AAGAAAAGAG CACAGTGGAT AAGGCGGCTC CGGCGGACAG GAACACGACA AAACAGCTAT CACCGGCGCG AGGCACGCAG ACGGGGACGA AAGCGCACGA GTCAACAAAA ACCGACACAG CAGTTGCTGC GGAAGCGATG GGGAAAACGA CTACGACGCA GGATCGAACG ACGACTGGCG TCAACGCCGT CGACGCGGGT TCCAAGCCGG TGGAAAGTTC GCATCGCCCG TGTGCAAGAT GCGGTTCGGA AAAGACACGA TTTTGTTACT ACAACAATGG GTTACCGACA CAACCTCGAC ATTACTGTCG ATCGTGCCAA AGGTACTGGA CCGAGGGCGG AACGCAGCGC AATTTACCAA AGGGAAGTGG TCGCAGAAGG GTAGAACGAC CCGACGGTGC GGCAGCGCTC ACCGCTCCCG TGAAAGATGC CGCGAAGAAA AACGCTTTGA GTACCGCCGC TGCTATCGCG CACGGTGCCA ACGCACAAAA CGAGATGCAA CGCGCGATAT TGGTACTCAT GACTCAAGTT GTCGGCTTCG ATGTGGACCA TGCGGCGAAT AACGCCGGGA TGGTAGCGTC GCGCGCTGGA GAAGAAGTCG CGCAAATCGT GCTCCAAAAT CTTGGTCATA GCTCAGAAGC GATAGAAATC GCTGTGACAT TAGCCAAAAC CACGGGATGG CGCATCGGAA TATCCGTTTC AGCTGTCGCC ACCGCTGCAG TAAGTCAAGG CTTGAATCAG GCCGAGATTG CGTCTTTGAT TTCCACGCAG TTGCCATTTT TGGCCAACGC GCTTGTTCGC GAGGTTTCCG AACAGGTGAA AACTATGGAA CTTTCGAGGA AATCGCCGAC GAACGACTCT GACTCTGGAG GCTCTGGCGG AACGGGTGAA AATGGTGGAA AGAGCGCGAC GAGTGCGAAG TCCAATTCCA CCTTGACCGG AGTCAAGGCT GAACCAGCTA CTGTTGCGAG CGTTCAGGCA CTGCAGACGC AGGTGACGCA ACGTTGGCTG AATGACTTGC AATACGTAGA TGATGGGAAC GCCCGTCAAG CTGCGAGTGC GAGAGTGGGC GCAGTGACCG TGCAAAGACC GTCGGCGAGC GGCTCGCAAC CGCAGCCTCA AGGTTCTGGA GCAAGCAACG CATCCGCCAT GCCTTGGACG CAAGGGCAAA CCTCGCGCGT GACTCCAGTA TCATCCGCGT CTGGATTTCA ACCCGCCTTG ACAAGTGCTT TTACACCATC CATGGCAGCA CCGCGAATGG CTATGAACAT GTTTGGTGCC CACCCGATGA TGAATCAGCC CATGAACCCA GCTTGGCTAT CGATGATGAA TCGTTTTGGT TCTAGCGTTC AGCCGAGCGG CGCCCCGTCA CGTGGACCGG GAACGCCTCA GGCCCACGAG GCGTTCATGA AGGCATTGGA CACCTTGGGT GGTGGATCTT CCGAACCACC GAACTAAAAC CGTCACAAAT CAATAAATTC AATTTTCCGA ATTCTCTTCC AGAAGAGTTG TATACCGCCT TTTACTTTTA ACACCATCGA GCTTGGCGCG CATATATGTA TTTCGATCTC TATTCTTTCG TTCTGGGAGC CGCCATGGCT CTCCGGCGTG TTATTAATAA A
|
Protein sequence | MARASDVGEA PSEREIAGAY GAAATLLGSS AGARGGARAR RGEIPTSRVG ALVRALGAHV GAHEDDMLRR CEKIVDPEGR GSFALGRLRD ALAALRDEGR LPTREEEGGK GTGEVAMTTP VVSTGRRATR ATATATTTTD DRKSVLIVDD VASVRKFQAG QIGKMGYQSI EAGEGAEALR LYGENHAEIQ GILMDLMMPT MDGYEIATGI RAMEEKQALR RMPIIAVTSL TDQELKEHAS SVAFDCHVDK PTSMSKLATI FESMRMAPSL SAAELQAIGA ANAAKLDPHG TKRSHLPVSS SDTTSDDGKH GGSNDGSSDQ NGRKGSNSDA DPDARSSGDS GNEQRYVSQR ADGPCNSSKR PGATASGATA VAPKKTGTSD NGSGGNSNNG SSDASRQGVA DDAEEKSTVD KAAPADRNTT KQLSPARGTQ TGTKAHESTK TDTAVAAEAM GKTTTTQDRT TTGVNAVDAG SKPVESSHRP CARCGSEKTR FCYYNNGLPT QPRHYCRSCQ RYWTEGGTQR NLPKGSGRRR VERPDGAAAL TAPVKDAAKK NALSTAAAIA HGANAQNEMQ RAILVLMTQV VGFDVDHAAN NAGMVASRAG EEVAQIVLQN LGHSSEAIEI AVTLAKTTGW RIGISVSAVA TAAVSQGLNQ AEIASLISTQ LPFLANALVR EVSEQVKTME LSRKSPTNDS DSGGSGGTGE NGGKSATSAK SNSTLTGVKA EPATVASVQA LQTQVTQRWL NDLQYVDDGN ARQAASARVG AVTVQRPSAS GSQPQPQGSG ASNASAMPWT QGQTSRVTPV SSASGFQPAL TSAFTPSMAA PRMAMNMFGA HPMMNQPMNP AWLSMMNRFG SSVQPSGAPS RGPGTPQAHE AFMKALDTLG GGSSEPPN
|
| |