Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_40743 |
Symbol | |
ID | 5005746 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009369 |
Strand | + |
Start bp | 365468 |
End bp | 366952 |
Gene Length | 1485 bp |
Protein Length | 478 aa |
Translation table | |
GC content | 60% |
IMG OID | 640421167 |
Product | predicted protein |
Protein accession | XP_001421637 |
Protein GI | 145354744 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0562331 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTCCG CGCCCGCGCC GCCCGTGCGC GTCGGTCACG CGCTGAGTGG GATATTCGTC ATCAACCTGC GCGGAGACGT GCTGCTGATG CGCGCGTACC GCGAAGACAT CGAGCGACAC GTGTTGGATG CATTTCGCAC GCAGATCCTG AATCCGCGCG GTGGCGCGCG ACGCGACGCG CGATCGACGC CTCGACGCGG ACGCAAAGAC GACGGATTCG CCACGGAAGC GCCGGTGCGA AGAATCGGGA GCGTGACGTA CATGATGAAA CGGTCGAGAG ACGTGTACGT GGTGGGGATC GCGCGCGGGC AAGGCGAACG GGGCGGACCG GGCGACGCGA ATTTAATGCT CGGATTCACG TTCCTGGGAC ACGTCGTGCG GCTGTGCAAT CAATACTTTG GCGCGTGCGA TGAAAACGCG ATCCGTGGGA ACTTTGTGCT GATGTACGAG CTGCTGGATG AGATTTGCGA CGACGGGTAT CCGCAGATAA CCGCTGGGGA GACGCTGAAG ACGTACATCA CGCAGAAGGG TTCTAAACTT GAAGGTGCGA TCGGAAAAGA GGCGATGGAA CGGAGCGCGG CGGAGGACCA ACGCCGGGCG ATGGAGGCGG CGAAACAGGT GACGAGCGCG GTGCAATGGC GAAGAGAGGG GTTATCGTAT AAGAAGAATG AAGTGTATTT GGACATCGTG GAGAGCGTGA ATCTGATGAT GAGCGCGGAA GGCACGGTAT TGCGAGCGAA CGTGCAGGGT TCGATTTACA TGAGGACTTT TCTGAGTGGG ATGCCAAACC TCAGCGTCGG GCTGAACGAT CGCCTCGGGG AGACGACGCG CGTGACGTCG CGCGGCGAAG ACGCCGAGAC GAGCGCGGCT CGCGATCGAA GGCTGATCGA CCTGGACGAT TTACAGTTTC ATCAGTGCGT GCGACTGGAT AAATTTAGCG CGGAAAAAGT GATCGAGTTC ACCCCGCCCG ATGGCGAGTT CGAGCTCGTC AAGTATCGCG TGTCGGATAA CATCACGCTT CCGTTCAAGC TCATGCCCGT AGTGAAGGAA CTCGGTAGAA CGCGTCTGGC CGTCACCGTC AACCTACGCT CGCTCTACGG TCCCACGACC GTGGCGAACG AAATTAAAGT GCGAATCCCC GTCCCCAAGC TCACCGCGCG GGCGACGATC AACGTGAGCG GGGGCAAGGC CAAGTACGTA CCCGAGGAGG GCTGTCTTCG CTGGAAAATC AAAAAGTGCG CGGGTCACGA GGAATACCAG CTCGACGCCG AGGTCTTACT CGCCAACACG CTGGAGGACC ACAAACCTTG GGTGCAACCG CCGATAAACA TCGCGTTTCA CGTCCCGATG TTCACCGCCT CGGGCTTGCG AGTGCGCTTT CTCGAAGTCA AGGAGGCGTC CAACTACGAC GTCGTCAGGT GGGTGCGATA CTTGTGCCAG AGCGGCGGTT CGTCGTCGTC GTCGTACGAG ATTAGATGCG CGTGA
|
Protein sequence | MSSAPAPPVR VGHALSGIFV INLRGDVLLM RAYREDIERH VLDAFRTQIL NPRDDGFATE APVRRIGSVT YMMKRSRDVY VVGIARGQGE RGGPGDANLM LGFTFLGHVV RLCNQYFGAC DENAIRGNFV LMYELLDEIC DDGYPQITAG ETLKTYITQK GSKLEGAIGK EAMERSAAED QRRAMEAAKQ VTSAVQWRRE GLSYKKNEVY LDIVESVNLM MSAEGTVLRA NVQGSIYMRT FLSGMPNLSV GLNDRLGETT RVTSRGEDAE TSAARDRRLI DLDDLQFHQC VRLDKFSAEK VIEFTPPDGE FELVKYRVSD NITLPFKLMP VVKELGRTRL AVTVNLRSLY GPTTVANEIK VRIPVPKLTA RATINVSGGK AKYVPEEGCL RWKIKKCAGH EEYQLDAEVL LANTLEDHKP WVQPPINIAF HVPMFTASGL RVRFLEVKEA SNYDVVRWVR YLCQSGGSSS SSYEIRCA
|
| |