Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_42847 |
Symbol | |
ID | 5003266 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | - |
Start bp | 233128 |
End bp | 236169 |
Gene Length | 3042 bp |
Protein Length | 984 aa |
Translation table | |
GC content | 62% |
IMG OID | 640418687 |
Product | predicted protein |
Protein accession | XP_001419319 |
Protein GI | 145349807 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.451507 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.305439 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGATCG CGACGCCGGC GCGGGACGGC GGCGGCGACG GCGACGGGAC GAACGACGGC GACGGCGACG GACGCGAGAC GCGCGCGCCG AAGGCGCGCG CGGCGCGACG GGAGACGCGC GCGGATGAGG TGGTGTACGC GTGCGCGACG CGCGAGGGAG GGGTGTCGCT GACGCGGTCG ACGACGCGGC GAGGCGACGC GGCGACGACG TCGACGACGA CGGCGCTCGC GCGGGACGAG GGCGATGGGG AGGCGATGAC GTCGATCGCG ATGGATCGAA GCGGGACGAA GGTGTTTTGC GCGAGTCGGA GCGGACGCGT CGTGCGGTTG GACGCGATGG AGGATGGGAG GGGGGGGACG ACGATGCGGA GGACGAAGGC GTGGTCGCCG CACAAGACGT CGCCGGTGTT GGACATGTGC GTGGACGTCA CGGGGACGCT GTTGTGCACG GGGAGCGCCG ATCGAACGGC GCGAGTGTGG GACATCGAAC GAGGGTACTG CACGCACGCG TTTCGCGGGA AGCACGGAGG GGCGGTGACG GCGACGGCGT TTCATCCGAG CGTTAGAGAG GCGAGAGCGT TCACCGCGGC GGAGGATGGG TCGCTCGCGA TGTGGTCGCT CACGGGCGAG GCGGGCGTCG GGAAGAAGGG TAAGAAGGCC TCATCCGATG GCTGCGTCGC GTTCGTGGCG AACGCGCACG TGAGTGCGGT GACGTCGATT CGAATCGACG TCGAATCGAA CACGTTGCTC ACCGCGGGGC GGGACAAAAT AGTTCGAACG TTTGATTTAG ACACGCTCAA TCCGCGAACG ACGACGGCGG TTCACGAAAC GATCGAGGAT TGCGTGATTT TACGTCCGGA TTCAGCCATC GTTCGCGACT GCAAGGTGAA GCCGCCGCCG GGCGGGCGCG GAGTCATTTT CGCCGTCGTC GGCGACGGCG GACGCGTTCG AGTTTGGCGC GAGAACGCGG CGAAGCATTC GATCGAGTCC GCACCTCTCG TAGCGGTAAA TACGCTCACC AAAGGAGGCG ATGATAATGA CGAAGATTTT GAAGCCGCCG CGGGAACGTT CACGAAGTGT GCGCTCACAC ACGATGGGAA CCGTTTGATT GGCGTGAGCG GCGACGCGCG TTTGTTGACG TACCAGGCCA ACGCAGAGAC GACGTCGTTG GAGATTGAAC GCGAAATTGT TGCGAATACG GATGAAGTGA TCGGTTTGGC GTTCGTACCC GGTGCGAAAG AGCAAGCACT TCAAAAGAAG CGAAACATCG ATGGCGATAG CGACGAGAAC GAGAATGAAG ACGAGCGAAC GCTCGCGAGA CCGCCGAGAG AAGTAGCCGT GGTCACCAAC TCGCCCACGG TACGTATGTT TGATCCGACG ACGATGTCAT GTGTCGGATC TTTGAACGGT CACAGCGCCG TTGTGCTCTC GGTTGATGCC ACGATGACGA CGGACGGGAC AGCGCTCATT TTGACGGGCG CAAAGGATCA CACGGTACGA CTGTGGGACG CCGCCACGCG AGAGTGCATC GCCGTCGGCG AAGGCCATGT CGGCGCAGTT GCCGCGGTAG CGTTTCCCCC GAACTCGAAA AATGGCGCAC CGTTTGCCAT TTCGGGTGGC GTCGACCGCG TGCTTCGCGT ATGGGACATA GATGGAGTTC GGCGAAATGG CGACGGCGAA TTGAACGCTA CGGCGGCCAC AGTGGCGCAC GACAAGTCCC TCAACGGCGT TGCCGTTGCG CCGCACCTCC GCATGGTTGC CACGTGTTCG AGCGATAAGA CGGCGAAGAT TTGGAAAATG CCCGATTTAG TTCCGTTGGC CACGCTACGC GGCCATCGTC GTGGAGTTTG GGCGTGCGCG TTTTCTCCTT CGGATCGCGT ACTCGCCACC GCGGGCGGCG ACAAGATGGT GAAGATTTGG AGCGCCGATG ACCGTGCTGG GAGCGACACC AACGGTGCTT GCTTGCGCAC GCTCGAAGGT CATACCGCAG CGGTGTTGAG CATTAAATTT ATGTCTCGAG GTACCCAGCT TGTCACCACG GGTGGCGACG GGCTGTTGAA TTTATGGAAC GTCACCTCTG GGTCTTGCGC CGCATCCATC GATGCGCACG AAGACAAAGC TTGGGCGCTG GCCGTGGCAA GCGATGGCGA TTGGATCGCC ACTGGGGGCA CCGACGCGTC CATGGCGCTG TGGAAGGACT CCACGTCGAG CACCACCGCC GATGCGGCGA AGAAGCACGC CCTCGCCGTC GAACGCGAGC AAGCATTCTT CAACGCCGAG CGCTCGGGCG AAGTCACGAA GGCGATCGAT TTAGCGCTCA GACTCGAGCG CCCTGGTGCA CTTCTTCGTG TTTTGACGAA ACTTCTGGAG AGTGACTACG AAAATGGCGA CGCCAGACTC CGGAAATGCG TCGAGCCGTT GCACGAAGAC AAGCTCGCGC GAGTGCTCAA GTGCGTGCGC GAGTGGAACA CAAACGGACG CACGTGCCAC GTCGCGCAAC ACGTTCTCGC CGCCATCTTC CGCACACACA CCATGGAGGA ACTGAGCAAG GTTCCGGAGA TTTCTCAAAT CACTAGAGCG TGTCGGGCGT ACACCGAGCG TCATCGCTCG CGTCTCGAGC GTCTGTATCG CGGAACTTTT TTAGTCGACA CGCTCCTCTC GCGCACGGGC GCCTTGGTAG ACGACGAAGA GTCGATGGAA GAAGTCAGGC GCACCCACGA AACACTGGAT AACTTCGGTT TCATGCGCGC GGATGATGAC GCGCCGCCGC GACGTTTGCC TGCTCCGACG GCGAGCGAAG AAGACGAGCC CGCCGACGTC GCAGACGAAG AAATGGCGGA GCCGAGCGAG GACGACGAGC CCGCCGGCGA AGGGGAAGCC GCCGAAAAAG TGCGAGAAGA CGATGTCGTC ATGGGTCCGC CGAAAAAGCT CAAGCGATTA AACGCGATAA AAAGACTCGC GTCCGACGTA GAGGATCAAC GCAAACTCTT GCGCGACCCG TCTCCCCGTC ATACGCGTAG CGGCAAAAAA TTGAGCGGCT GA
|
Protein sequence | MAIATPARDG GGDGDGTNDG DGDGRETRAP KARAARRETR ADEVVYACAT REGGVSLTRS TTRRGDAATT STTTALARDE GDGEAMTSIA MDRSGTKVFC ASRSGRVVRL DAMEDGRGGT TMRRTKAWSP HKTSPVLDMC VDVTGTLLCT GSADRTARVW DIERGYCTHA FRGKHGGAVT ATAFHPSVRE ARAFTAAEDG SLAMWSLTGE AGVGKKGKKA SSDGCVAFVA NAHVSAVTSI RIDVESNTLL TAGRDKIVRT FDLDTLNPRT TTAVHETIED CVILRPDSAI VRDCKVKPPP GGRGVIFAVV GDGGRVRVWR ENAAKHSIES APLVAVNTLT KGGDDNDEDF EAAAGTFTKC ALTHDGNRLI GVSGDARLLT YQANAETTSL EIEREIVANT DEVIDERTLA RPPREVAVVT NSPTVRMFDP TTMSCVGSLN GHSAVVLSVD ATMTTDGTAL ILTGAKDHTV RLWDAATREC IAVGEGHVGA VAAVAFPPNS KNGAPFAISG GVDRVLRVWD IDGVRRNGDG ELNATAATVA HDKSLNGVAV APHLRMVATC SSDKTAKIWK MPDLVPLATL RGHRRGVWAC AFSPSDRVLA TAGGDKMVKI WSADDRAGSD TNGACLRTLE GHTAAVLSIK FMSRGTQLVT TGGDGLLNLW NVTSGSCAAS IDAHEDKAWA LAVASDGDWI ATGGTDASMA LWKDSTSSTT ADAAKKHALA VEREQAFFNA ERSGEVTKAI DLALRLERPG ALLRVLTKLL ESDYENGDAR LRKCVEPLHE DKLARVLKCV REWNTNGRTC HVAQHVLAAI FRTHTMEELS KVPEISQITR ACRAYTERHR SRLERLYRGT FLVDTLLSRT GALVDDEESM EEVRRTHETL DNFGFMRADD DAPPRRLPAP TASEEDEPAD VADEEMAEPS EDDEPAGEGE AAEKVREDDV VMGPPKKLKR LNAIKRLASD VEDQRKLLRD PSPRHTRSGK KLSG
|
| |