Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_17102 |
Symbol | |
ID | 5004024 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | - |
Start bp | 378838 |
End bp | 381804 |
Gene Length | 2967 bp |
Protein Length | 988 aa |
Translation table | |
GC content | 64% |
IMG OID | 640419445 |
Product | predicted protein |
Protein accession | XP_001420157 |
Protein GI | 145351596 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.121633 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.237025 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAAGG CCAAGGCGCG CGCCGTCGCG CGGTCGTCCG CCGCGTCGTC CGCGCCGGCG ACCAAGCCGA GGCGCGGCGT CGGGTTCACG AGCGGGCCGA CGAAGCATCG CACCGGGAAG ACGCTCGGCG GCGGCGTGGA GTTCGCGAAG AAACGCCTGA AAGTCGGTCG AAAGGTGGCG AAACACGCGA ACGAGACGGA CAGCGCGGTG CGGAGCAAGC GCATACGGCT CGCGGCGCAG AATTTGCAGA GCGCGGGGGC GGGGACGAGC GGTGAGGGGA CGCGGGACGA GGACGCGGCG AGCGCGCGAG GGACGCCGCT GAACGAACTG TTGAATCAGT GCGGACACTA CGCGGCGAAG ACGCGATGCG ATGGGTTGAA TGGGTTGTTG GAGGTGTGCG AGAGGTATCC GGGCGCGGTG CGAGCGAGGG CGGGGGACGC GATCGAACGC GTGGGCGAAC GGTTGGCGGA CGAAGCGAGA GAGGCGAGAC GAGCGGCGAG GGAGTGCCTG GGGCGAGGCG TGGTGCCGGC GTTAGGGGTG GAGGGGCTGG CGCCGTTTGC GAAAACGTTG ATTTTGTACG CGGGGGCGGC GCTGACGCAC GTGGCGGACG ATGTACGGAG GGACGCGCCC GCGGCGCTGG ACGCGCTGTT GGAGGCGGCG CCGACGCTCG TGGCGGCGCA CGCGCCGGCG AGCACGCTGG GGCACTTGGG CGAGCTCTTG CGGCGCGGAG ACGACGGTGG CGTCGGTTCG GGGTCGTCCG TGCAGCGTGG CGTCGGATCG CAAAAGCCCG CGACGCGATT GGCGCTATTG CGAAGTTGTA GGCGCTTTTT GGAGACGCTC GCAGAGGGCG TAGGCACGAC GGACGGCGAT CGCGGTGGTC TTTCGTCGAC CTCGTCGATG ACGACGACGT TCGTTTGGGG AGAAGAGGCG CGCCGCGGCG CCGCCACGCG CTCTATCGGA TCGATGTATG CGGAAAGATC GCGCCAAGCG CCGCCGAGCG TGCTCGCGGC GACGACGGCG TCCAACGACG AATCTGAAGA TTCCGGGGGA CGAATCACCG GTGAAAGTCG TGCTGCGGTG CGCGTGAACG CCAAACGCCT TGTAGAGTTG GCGATGTGCG TCTGGGACGA CGCAGCGCAG ACGTTTACGG ATGAGAGAGG CGTCGACGTT GATCGCGTTC GAGTAATGGC GCAATCCATG GCGTGCGCTC GGTTAGCGCT CAGCCTCGCC GACGGTGCGG AAGAATCGGA GATGCTCGAA AACACTGATA GTGCATCGAC TGCGGTGAGT GTCGTTCCGG AAATCGCGCG GCGTACTCTC GGCATGTTCC CATCGACGTC GCCGGCGAGC GTGGCGGAGA AGCAAGACAT CTCGCGAACG CGCGAGGCGA TGGTGGATTT GAATTTCGAG ACGTGTCGGT TCTTGCTCGA CGCGTCCTCA TCGGTTGCTC ACGAAGCACT CGCGCAGCAT TTAGCGCCAG GAGTGATGGA TGCGCTTCCG CACGTTCTCG CGCGCGCTTT GCAATACGTC ACGTCGACGC TGCGGGGCGT CGCCCTCGAC GGCGGCGCCA TGGGCGAAGA CGTGCCGACG CCGGATGACG CGTACGGGGA CGTCTTGGCG CTCGCGCGTG ACGCACTTAC TTTACCGGCG TGGTGTTTCA GTGCGAGCAT CACGGGTAGC GCGTGTGCAG ATCTTCTCGT CGCCGTGACG GAGACTTGGG AGCGCGCCGT GGCGGATGAA GACATCGAAC GCATCACGCA ATGCGTCGCT TTGTTGACGG AAACTTTGCC TGAAGAAGCG CGACAAGGGT ACTTTCGCGT GCCGATCGAA ACCGCTGCGG GATGGGTGCG TCACATTCCT CGCGTACTGT GGGCGTTCAA GCATGAAAAC CCCTCTGCGA CGCAAAAGTT GCTGTCGTTG CTACACGACG TCGCGGCGAG AAATCCTCCG GGATCGCCGT TGGCTGACGT CTTATCGACG TGCGAGGCAG AAATGGCGGT GCTTTTCTTC ATGGTTCCAC CCGCGGGGTC ACCCGAAGGC GCGAAATCGA GACCCGGTCC GTTCGCCCGC CTTCCGTTCC CGTCGCAGTG TGCGGCGGTG CGTCTCGTAG GCGTCTTGCC GACGCTCACG CCGCCGATGA TTCGGGCGTT GGCCAAGATG TGTCTGGACG TTGACCGCGT GAATGAAGAA CTTTGCGTCA TCGCTATCGA GGCCATGCAA GCCAACGCGC TGGCGGCGCC TTTAGAGTTA ACGATGTCGT TCTACGCCAC TCTACTCGTC GGCGCCGCCG GGGTGAAATT TCTCGACAAG TCGAGCAAGC GCGACGTCGC GACGGTGGAA CAGAAATCCT GGCTCATCGC TCGTCGAGCG ATTCCGAGCG CGGCGGCCGC TTTAGTAGCG CTTAGTGATG CCGACGCGCC GTGGACCGGG GCATCTCTCG CCAGTGTGAC GCTCAAGCAC ATGTGGAGCA GTCGAGTGGA AAAGGGCGAC GTCGACGGCG CCACGCGAAC GGCGAGCGGG TTCGTCGCGC TCATCGCGAG CACGGCTGAA TTCGCAAGAT CGGTGTCGGG TCAATCGGGT CAATCGATCG ATGACGATAA CGTTTCCGGC GCCATTCCAG AGATGTTCGC TTGGTTCATC TTGCGCGCTG AAGACGGCGA TGGCGTTGAC GTCGACGTCG CGTGGCGAGC GTTGCGCGCC GCGCCTTCGA CGACGCCTGG TCCCGTCGCG AGCGCGGTCG TCGCGTCTTC CGAATCTTCC GCCGCGCTCA CCGATCGCGC GTTGGCTTTC GTGAGCAAGT TGATCACGGA AACCGCCGCC GGTTCGATTG AAATTTCCAA AGACGAGTTG CGCGACGTCG TCCGATCGAT TCAAAACAAA GCGTCGGCGC TCGAGGCGAA CGAGTCGACG AAACGCGCGC GGGCGCTCGA CGTTCATTGG AACGTCGCGT TCGGAGAGGC GATATAG
|
Protein sequence | MGKAKARAVA RSSAASSAPA TKPRRGVGFT SGPTKHRTGK TLGGGVEFAK KRLKVGRKVA KHANETDSAV RSKRIRLAAQ NLQSAGAGTS GEGTRDEDAA SARGTPLNEL LNQCGHYAAK TRCDGLNGLL EVCERYPGAV RARAGDAIER VGERLADEAR EARRAARECL GRGVVPALGV EGLAPFAKTL ILYAGAALTH VADDVRRDAP AALDALLEAA PTLVAAHAPA STLGHLGELL RRGDDGGVGS GSSVQRGVGS QKPATRLALL RSCRRFLETL AEGVGTTDGD RGGLSSTSSM TTTFVWGEEA RRGAATRSIG SMYAERSRQA PPSVLAATTA SNDESEDSGG RITGESRAAV RVNAKRLVEL AMCVWDDAAQ TFTDERGVDV DRVRVMAQSM ACARLALSLA DGAEESEMLE NTDSASTAVS VVPEIARRTL GMFPSTSPAS VAEKQDISRT REAMVDLNFE TCRFLLDASS SVAHEALAQH LAPGVMDALP HVLARALQYV TSTLRGVALD GGAMGEDVPT PDDAYGDVLA LARDALTLPA WCFSASITGS ACADLLVAVT ETWERAVADE DIERITQCVA LLTETLPEEA RQGYFRVPIE TAAGWVRHIP RVLWAFKHEN PSATQKLLSL LHDVAARNPP GSPLADVLST CEAEMAVLFF MVPPAGSPEG AKSRPGPFAR LPFPSQCAAV RLVGVLPTLT PPMIRALAKM CLDVDRVNEE LCVIAIEAMQ ANALAAPLEL TMSFYATLLV GAAGVKFLDK SSKRDVATVE QKSWLIARRA IPSAAAALVA LSDADAPWTG ASLASVTLKH MWSSRVEKGD VDGATRTASG FVALIASTAE FARSVSGQSG QSIDDDNVSG AIPEMFAWFI LRAEDGDGVD VDVAWRALRA APSTTPGPVA SAVVASSESS AALTDRALAF VSKLITETAA GSIEISKDEL RDVVRSIQNK ASALEANEST KRARALDVHW NVAFGEAI
|
| |