Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_30782 |
Symbol | |
ID | 5000758 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 738915 |
End bp | 740462 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | |
GC content | 60% |
IMG OID | 640416179 |
Product | predicted protein |
Protein accession | XP_001416748 |
Protein GI | 145344456 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.799885 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.268797 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACGA CGATCGCTTC GCGCGCCGTC GCGGCGACGA CGCCCCGCGC GCGCGCGCGC GCGCGAACGA ATCCCCGCGC GTCGTCCACC TCGCGTCGGA CCCGCGCGAT CGCCGACGCC AACGCCGAGG CGACGGCCGA GGACGCGCGC GAGCTCGCGG CGTGGTTATC CTACGACAAG GGCGTGGACG CGAGCGGATT AGTGTTCAAA GAGGGCGCGA GAGGCGAGGT GGAGGTGGCC CTGCGCGGAG ACGTCGACGC CGGGGCGCGC GTGCTCGCGG TGCCGCAAGA CTGCGCGGTG ACGTCCGTGG ACGTCGACGC GCACCCGATC GTCAGCGGAT TGGCCAAGGG GCGACCAGAG CTCGTCGGGT TGGCGCTGTG GTTGTGCGCG GAGCGCATCA AGGGTGGAGC GAGCGATTGG GCGCCGTACG TGAAGACGCT CGCGGCGAAT CCAGACGCGC CGTTGTTTTG GACCGAGGCG GAAGATTTCG CGCTGTTGAA GGGGTCGCCG ATCGTGAATG ATGCGGTAGA ACGCTCGAGG AGCGCGAGGG AGGAATATGC GGCTATCGTG GAGGTCATTA AGGGTGATCC CACGGCGTTT CCCGCGGAGG CGTACGAGTT TTTCACCGAG GAGCGCTTCG TGGACGCGCT GGCGACGGTG TGCGCCAAGG CGACTTGGTT ACCCACGGCA TCGTGCTATG CGTTGGTGCC TTTGTTAGAC GTCATCACGA TTGCTGGATC TCCGGTGCCG GGCGTTTCAC CGCCCTCGGC GAAGGATGGA ATCGCGCGGT GCGCTGCGGA TTACGACGTC GACAGCGCGT GCGTGGTTTT ATCCGCCGTC GTCAAGGCGC CGGCGAACTC GCGAGTCGTA CAGTTGGATC CGTTGCAAAG GAACAACGGT GAGCTGTTCT TGAACACTGG TCGCGTTGAT CAGAAGCATC CAGGGGATTA TTTGTACATG CGGACCGAGA TTCAGCCCTC CGATCGCTTA TTTTCGGCGA AGAAGCAAGT CCTCGAGGGT ATGGGTTTCA CCGCCGAAAA CCAGTACTTT CCGGTGTATG AAGATCGTAT GCCGACCCAG TTGTACTCTT ATCTTCGTTT TGCCCGCGTT CAAGATCCCG GTGAGATGAT GGCGGTGTCG TTTGAGGAGG ATAAAATCGT TTCGGTGATG AACGAATACG AAATTCTTCA GCTTCTCATG GGCGATTGTC GAGAGTTAAT GTCTGAATAC GACACGAATG AGGAAGACGA GCTGAACCTG TTGAAGCTCT CGGACACGAT GCGCGTGCGA GAGATCGAGG CGGCCAAGCT TAGAATGTCC GAGAAGAAGC TCATCGGTTG CACGATGACG GCGGTTCGCA AGCGCTTGGC GCCGATTCGT GGCATCCCAA CCAAGCAAGG CATGGAAGAC CCGAACCAGG ATCTTTTAGA CATTTTCAAC GCGATCGAGA GCATTCCAAA TAAGCCAAAG GAAATGATGG AAGACTTCAA GAAGTGGGCG CGTGGGGACT ACGAAGACAT CCCCAAAGGC GGGGGTGGCT GCGGGTAG
|
Protein sequence | MATTIASRAV AATTPRARAR ARTNPRASST SRRTRAIADA NAEATAEDAR ELAAWLSYDK GVDASGLVFK EGARGEVEVA LRGDVDAGAR VLAVPQDCAV TSVDVDAHPI VSGLAKGRPE LVGLALWLCA ERIKGGASDW APYVKTLAAN PDAPLFWTEA EDFALLKGSP IVNDAVERSR SAREEYAAIV EVIKGDPTAF PAEAYEFFTE ERFVDALATV CAKATWLPTA SCYALVPLLD VITIAGSPVP GVSPPSAKDG IARCAADYDV DSACVVLSAV VKAPANSRVV QLDPLQRNNG ELFLNTGRVD QKHPGDYLYM RTEIQPSDRL FSAKKQVLEG MGFTAENQYF PVYEDRMPTQ LYSYLRFARV QDPGEMMAVS FEEDKIVSVM NEYEILQLLM GDCRELMSEY DTNEEDELNL LKLSDTMRVR EIEAAKLRMS EKKLIGCTMT AVRKRLAPIR GIPTKQGMED PNQDLLDIFN AIESIPNKPK EMMEDFKKWA RGDYEDIPKG GGGCG
|
| |