Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_87742 |
Symbol | |
ID | 5002874 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | + |
Start bp | 289289 |
End bp | 290884 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | |
GC content | 63% |
IMG OID | 640418295 |
Product | predicted protein |
Protein accession | XP_001418663 |
Protein GI | 145348453 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2124] Cytochrome P450 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.966257 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000597375 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCGTCGC GCGCGACGGC GCGCGCGGCG GCGTCGCACG ACGCGCGCGC GGCGACGCGG CGCGCGCGCG TCGGCGGTGC GGCGCGCGAA TCGGCGGCGC GGCGGCGCGC TTCGGGACGC GCGACGTCCG ACGTCGACGG CGCTTCGGGA CGCGCGACGT CCGACGTCGA CGGCGCTTCG GGACGCGCGA CGTCCGCGGA CGCGTTCGAT CTCGGACAGG CGATCGGGAC GCTGGCGAGG GGCGAGCGAG GGGCGACGCT GCCGCCGGGA CGCGTGGGCG CGCTCGGCGT GCGCGAGACG CTGGAGTACC TCGCGGACTC GAACGGCTTC GTGCGACGGC GCGTCGAGCG GTACGGACCG ATTTTTAAGA CGGCTTTGTT TTTCAAGCCG GCGATCGTGT TCGGAAGCCG AGAGGCGGTT CGAGAGTTTT TGAAATTTGA AGGCGAGCTC CCCGCGGACG AGGCGCTGCC GGAGACGTTT CGCGAATTGC ACACCGAGTA CGGGGCGCTG CGCATGACGG GGAGCAGACA CGCGGCGACG CGCGCGAATT TCGGCAAGGT GCTGGGACGC GCGGCGCTGG AGAGTTACGC GCCGGCGATC GGGGAAAGGA CGAGGGAGTT TGTGGAAGAC GTAGCGCGGC GATCTGGGAA GGAGACGTCG TCGTTCAGGC CGGGCGCGGA GTGCGTGGAT TTCGCGCTCG ATTTGTTGTT TGAGCTCTTC CTCGGTCACG TGCCCGAGGC AAAGTACAAA GACGCGATGA AAGCGTACAA CGGAGGATTA CTTTCGCTCG GAAAATGGTC GTCGGAGTTT AAGGCTGGAA AACTGGCGCT GGAGGATTTG ACGTCGTACG TCGAGGCGCA TTACAGAGGC GTCAAAGCGC GCGGCGAACT CGATCGCCCA GAGTACTTCT TTTACAAGCA ATACTCGCAA GCCGTGGACG AGTTCGACGA AGTGTTCAGC GACGATCGCA TCGCGACGAC GTGCGTGTTA ATGGTCTGGG GTTCGTACAT CGAAGCCGCC GCTCTCATGG GACACGCATG CGTTTTGCTG GGCGAGCACG ACGACGCGCG GCGAGCCGTT TTGCGCGAGT TCGAGCGGGT GTGCTGCGAC GATGAAAACG GTTGTCGTCG CATCGGTACT TTGGCGGACA TCATGTCCAT GCAGTACACG TCCGCGGTGG CGAAGGAGTC GCTTAGAGTC ATGCCGCAGA CGGCGGGCGG CTTGAGAGTG AACCCGTCGC CTCGCAAATT CGCCTCGTTC GACGTACCCG CCGGTTACGT TTTGACCGCA GACCCACGCA TTCCATTTCG CGACGAAGCC AACTTCCCCG ACCCGGACGC GTTCAAGCCT GAACGTTTCG TTCCAGGGAC GCACGAAGCG AAGCAAAACG ACGTCTCTTC GGAAACGTAT TATCCGGGCG GCATGGGCCA GCACCAGTGT CCTGGGATCT CTCTCGCGAC GGTCATGACG CAAATATTTC TCGCCGAACT CGTGTCCGCT TTCCCAAACG GGTGGCGAGG AAAGACGGCG CCGAAATACG TTCAGGTTCC CATAGTCATC CTCGACAGAG AGTATGAAAT CGAGTTTCTT CGCTGA
|
Protein sequence | MASRATARAA ASHDARAATR RARVGGAARE SAARRRASGR ATSDVDGASG RATSDVDGAS GRATSADAFD LGQAIGTLAR GERGATLPPG RVGALGVRET LEYLADSNGF VRRRVERYGP IFKTALFFKP AIVFGSREAV REFLKFEGEL PADEALPETF RELHTEYGAL RMTGSRHAAT RANFGKVLGR AALESYAPAI GERTREFVED VARRSGKETS SFRPGAECVD FALDLLFELF LGHVPEAKYK DAMKAYNGGL LSLGKWSSEF KAGKLALEDL TSYVEAHYRG VKARGELDRP EYFFYKQYSQ AVDEFDEVFS DDRIATTCVL MVWGSYIEAA ALMGHACVLL GEHDDARRAV LREFERVCCD DENGCRRIGT LADIMSMQYT SAVAKESLRV MPQTAGGLRV NPSPRKFASF DVPAGYVLTA DPRIPFRDEA NFPDPDAFKP ERFVPGTHEA KQNDVSSETY YPGGMGQHQC PGISLATVMT QIFLAELVSA FPNGWRGKTA PKYVQVPIVI LDREYEIEFL R
|
| |