Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_12608 |
Symbol | |
ID | 5002223 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | - |
Start bp | 280244 |
End bp | 281391 |
Gene Length | 1148 bp |
Protein Length | 345 aa |
Translation table | |
GC content | 59% |
IMG OID | 640417644 |
Product | predicted protein |
Protein accession | XP_001418430 |
Protein GI | 145347967 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.000194885 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0659856 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTGGG GACAGGCGGC GACGCAGGCG CCGGCGAACC CGAACGGGGA CTTTCTGGTG GCGAATCCGC CGAACGATGG GATAAGCTCG CTGTCGTGGT CGCCGACGGG GAATTTTTTG GTGGCGACGG CGTGGGACGG AGACGTGCGT GAACGCGCGA GGCGAACGCG AACGCGAACG CGAACGAGCG AACGAGCGAA CGAGCGAACG ACGGGGACTG ACGAGACGAC GCGGTGCGTC GAGACGCGAT GTAGGTGTAT TGCTATGAGG TGGCGAACAA TGGGCAGGCG ATGCCGAAGG CGTCGACGAA GCACGAGGCG CCGGTGCTGT GTAGCTCGTG GTCGAGCGAC GGCGCGAGCG TGTTCACGGG CGGGTGCGAT AACATCGCGA AGAAATGGGA CTTGGCGAGC GGGCAGGCGA CGCAGATCGC GCAGCACGAT GGGGCGATCA GACACATGGC GTGGATCGAA CAGGTGGGGT TGTTAGTCAC CGGATCTTGG GATCGAACGT TGAAGTATTG GGACACGCGT CAGCCGAATC CCGCGCTCCA GGTTCAGCTC CCCGAACGGT GTTACGCGTT AGATGTCACG CACCCTTTGC TCGTCGTGGG ATGTGCCGAG AGACAGATTC AGATTTTCAA CTTGAGCAAC CCGCAAGTGC CGTACAAGCA GTTGTTATCG CCGCTCAAGT ATCAGACGCG ATGCGTCGCC ACGTTTCCCG ATCGCTCGGG CTACTTGGTC GGCTCCATCG AAGGACGCGT CGCGGTTCAG CACGTGGAAG ACAATTTGCA AAGCAAGAAC TTTACTTTCA AATGTCACAG AGAAGGCACG CAAGACATTT ACGCCGTCAA CTCCATCTCG TTCCACCCGA CGTTTGGAAC GTTCGTCACC GCGGGCGCCG ACGGCAACTT CAACTTTTGG GACAAGGATA GTAAGCAACG CCTAAAGAAC ATGACCAAGT GTTCGGCACC CATCTCGTGC GGGAACTTCA ATCGCGATGG GACGATTTAC GCGTACGCGG TGTCGTACGA TTGGAGCAAG GGCGGTGACA ACCCGCTGTC AAACACGCCG AATAACATTT ACTTGCACGC CGTGAACGAA ACAGAAGTCA AGCCGCGGCC GGCGAAGAGC GGAATCGGTC GTCGATAA
|
Protein sequence | MSWGQAATQA PANPNGDFLV ANPPNDGISS LSWSPTGNFL VATAWDGDVY CYEVANNGQA MPKASTKHEA PVLCSSWSSD GASVFTGGCD NIAKKWDLAS GQATQIAQHD GAIRHMAWIE QVGLLVTGSW DRTLKYWDTR QPNPALQVQL PERCYALDVT HPLLVVGCAE RQIQIFNLSN PQVPYKQLLS PLKYQTRCVA TFPDRSGYLV GSIEGRVAVQ HVEDNLQSKN FTFKCHREGT QDIYAVNSIS FHPTFGTFVT AGADGNFNFW DKDSKQRLKN MTKCSAPISC GNFNRDGTIY AYAVSYDWSK GGDNPLSNTP NNIYLHAVNE TEVKPRPAKS GIGRR
|
| |