Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_30495 |
Symbol | |
ID | 5001033 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 286024 |
End bp | 287635 |
Gene Length | 1612 bp |
Protein Length | 478 aa |
Translation table | |
GC content | 57% |
IMG OID | 640416454 |
Product | predicted protein |
Protein accession | XP_001416612 |
Protein GI | 145344174 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.288694 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTTGCG CGATCAGTGG GGCGGCGCCG GCGCGCCCGG TGGTGACGCC GAGAGGCGTG CTGTACGAGA GATCGTTGAT CGTGAAGGCG ATCGAGGTGC GCGCGCGAAC CGAGGGCGAG GGGAGGGCGA TGGGAGGACG CGCGCGGGCG ATCGCGGCGA ACGGGGCGCG GATGCGACGT GAATTGTGAC GCGCGAATGC TGACCGGAAC GCGCGATTTA CGACGATAGG AACGAGGCGA GTGCCCGGTG ACGAAAGAAT CGCTGAGCGT GGACGATTTG ATCGAACTCA AGGCGCAGAA ATGGGTGAAC CCGCGACCGG AGGCGACGAT GTCGGTGCCA GGGTTGTTGA GCGCGTTTCA CAACGAGTGG GACGCGTTGA TGCTAGAAAC GCACACGCTG CGGAAGGAAT TACAAACGAC CAGGCAAGAG TTGAGTCATG CGTTGTACCA ACACGATGCC GCGTGTCGAG TGATCGCACG GTTGATGAAG GAGAGAGATG ATGCCAGAGA CGCGCTCGCG AACGCGAAGG GGTCGGCGAA GCGAAGCGCC GCGGGGGACG CCGAACCGGA ATCGAAAAAG GTCAAGGCGG GACTTCCGGC GGCTGTCGTG GCGAAAATGA ATGACGTGCA GAAGGAACTT TCGAGTGGAC GCAAGAAGCG CGAAATTTCG AGCGAATTGG CGACGATGGG GGATATCTCG GCGTACGAAG CCAAGGTGAC GCAAGCTGTG CACAAGACGT CGCCTGCAGG CATCAACGCC GTGGCGATCA AAGCGGGCGA CGATAACGTC ATCGCCACGG CGGGTAACGA TCATACTGTG GCAATCTTTG ACAAAGTAAG CTCGCAGCGC GTTCAGCAGC TCAGCGGACA TTCCAAGAAG GTTCTCGACG TCAAGTTTGT CGGCGACAAC GTGTTGTCGT GTGGCGCGGA CAAAGTCGTC AAGCTTTGGG GTGCCGATGG CTCTGAGATT GCAACGTTCG GTGATCACAC TGGTGATGTA ACCAGCGTCA GCGTGCATCC CTCGAACAGC TATTTTGTGT CCACTTCAAC GGACAAGACT TGGGGATTTC ACGACCTTAC CACGAGCTCG TGCATCACCT TGGTGAACGA CGACACCGAT TCTGCCATCA CGTGCGCAAA TTTCCACCCC GATGGTGTGA TTTTGGGAGC GGGCACGAAG GATTCGGTAG TTAAGCTTTG GGATGCTAAA GATGCGCGCA AGTTGCTTCA ATTAGACGGC CATGCGAGCG AGATTACCGG TTTGTCTTTC TCGGAAAATG GCTACTATCT CGCCTCTGCG GCTAAGGACG GTGTGAAGAT TTGGGACCTT CGAAAGTCCA AGCTCGTTCA CGAGATCGAA TGTGCTGGCG CACAGGGCGT TGCTTTCGAC CACAGCGGGT CTTACATCGC CACTGGCGGT CACAATGCGT CCGTTTACCA GGTGAAAGGC AAGTGGGAAT TGGTCAAGGA GTTTGAGGTG AAGAAGGCTG TCAAGGCTGT CGCCTTCGGC GGCGACGCGC GGTCGCTCGT CGTCGCTAGT GCCGACCATA ACTTGCGTAT CTTCGCTTAG ATCTAGGATT CGATTAATTT ACACACTTCT AAAGTGAGCT AA
|
Protein sequence | MFCAISGAAP ARPVVTPRGV LYERSLIVKA IEERGECPVT KESLSVDDLI ELKAQKWVNP RPEATMSVPG LLSAFHNEWD ALMLETHTLR KELQTTRQEL SHALYQHDAA CRVIARLMKE RDDARDALAN AKGSAKRSAA GDAEPESKKV KAGLPAAVVA KMNDVQKELS SGRKKREISS ELATMGDISA YEAKVTQAVH KTSPAGINAV AIKAGDDNVI ATAGNDHTVA IFDKVSSQRV QQLSGHSKKV LDVKFVGDNV LSCGADKVVK LWGADGSEIA TFGDHTGDVT SVSVHPSNSY FVSTSTDKTW GFHDLTTSSC ITLVNDDTDS AITCANFHPD GVILGAGTKD SVVKLWDAKD ARKLLQLDGH ASEITGLSFS ENGYYLASAA KDGVKIWDLR KSKLVHEIEC AGAQGVAFDH SGSYIATGGH NASVYQVKGK WELVKEFEVK KAVKAVAFGG DARSLVVASA DHNLRIFA
|
| |