Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_43800 |
Symbol | |
ID | 5006547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | + |
Start bp | 116907 |
End bp | 118490 |
Gene Length | 1584 bp |
Protein Length | 528 aa |
Translation table | |
GC content | 60% |
IMG OID | 640421968 |
Product | predicted protein |
Protein accession | XP_001422489 |
Protein GI | 145356546 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | [TIGR01790] lycopene cyclase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGCGC GTCGAGCGCC CGCGGCGCGC GTGACGCGCG CGATTCGCGC GCGAGGCGAC GCGGGAACGC GCGCGCGCGA CGTCGCGCCG GGCGCGACGC GGCGCGGGGC GTCGGCGACG CCGCGGGCGA CGCGACGGCC GAGCGCGAGG GAGACGCGGC CGGAGCTGTA CGGCTTGGAC GCCTCGTGGG ACCCGCTGAC GAGCGGCGAT CGGCGGGAGA GCGAGGAGTC GCGAACGCCG CTTCCAGAAA CGCTGCCGAA CGTGCGATGG GGGACGAGCG CGAGCGAGGC GTACGATTTG GTGATTGTCG GGTGCGGACC GGCGGGGCTG ACGGCGGCGG ACGAGGCGAG CAAGCGCGGA TTGCGCGTGG CGTTGATGGA TCCGTCGCCG CTCGCGCCGT GGATGAATAA TTACGGGGTG TGGTGCGACG AGTTTAAATC GCTCGGGTTC GATGATTGCT ATCGCGCGGT GTGGAACAAG GCGCGAGTTA TTATAGACGA CGGCGACGCC GACGGGAAGA TGCTCGACCG CGCGTACGCG CAGGTGGATC GGAAGAAGCT CAAGCAGAAG CTCATCGCGC GCAGCGTGAC GCAGGGCGTG GAGTTTGGTA TCGCCGCGGT CGATAGCTGC GATAACAGCG ATCCGAACCA TTCGGTGGTG ACTTTGAGCG ATGGACGCAA GGTCTATGCG AAGATGGTTT TGGACGCCAC TGGGCACTCT CGTAAGCTGG TGGACTTTGA TCGCGATTTT ACGCCGGGAT ATCAAGCCGC TTTCGGAATC GTGTGCACAG TGGAGAAGCA CGACTTTCCG TTGGACACGA TGCTGTTCAT GGACTGGCGA GACGAGCACT TGAGCCCAGA GTTTAAGCGA GCGAACGACA GGTTGCCGAC GTTTTTGTAC GCCATGCCTT TCTCGGAAAC TGAGGTGTTC CTCGAGGAAA CGAGCTTGGT GGCACGACCT GGCTTAGAGT TTGACGACTT GAAGCTCAAG TTGAAGGAGC GTTTGGATTA TTTGGGCGTG AAAGTAACCA AGGTACACGA AGAGGAGTAT TGTCTCATTC CCATGGGCGG CGTGTTGCCG ACGTTTCCGC AACGCACGCT CGGCATCGGT GGAACCGCCG GCATGGTCCA TCCTAGCACT GGATTTATGG TCGCAAAGAC GATGTTATGC GTTAGAACGC TCGTAGGCAC GCTTGATGAA GCCCTTAAGG CGGGTAAGCG AGGGGATATT ACCGGCGCCC TGGAAGCGGC GGAGGCGGCG CAAATGAACA ACGGTAAATT CGACGCCGAC GCCACCGCGG CATTAGTGTG GAACTCAATT TGGCCGGAGA ATGATTTGCG CATGCGCACT TTCATGTGCT TTGGAATGGA GACTCTTATG CAGCTCGATA TCGATGGAAC GCGTCAATTC TTTGACACGT TCTTCGACCT TCCCAAGGAC GTCTGGGCTG GCTTTTTGAG CTGGCGAATC CAGCCGGTGG GCTTGCTTTC GCTCGGGGTG AATCTGTTCG CGTTGTTTTC GAACTACATG CGAGTTAACT TTGTCAAATC CGCTCTGCCT TTCATGGGGT CGTTCTTCGC AAAC
|
Protein sequence | MRARRAPAAR VTRAIRARGD AGTRARDVAP GATRRGASAT PRATRRPSAR ETRPELYGLD ASWDPLTSGD RRESEESRTP LPETLPNVRW GTSASEAYDL VIVGCGPAGL TAADEASKRG LRVALMDPSP LAPWMNNYGV WCDEFKSLGF DDCYRAVWNK ARVIIDDGDA DGKMLDRAYA QVDRKKLKQK LIARSVTQGV EFGIAAVDSC DNSDPNHSVV TLSDGRKVYA KMVLDATGHS RKLVDFDRDF TPGYQAAFGI VCTVEKHDFP LDTMLFMDWR DEHLSPEFKR ANDRLPTFLY AMPFSETEVF LEETSLVARP GLEFDDLKLK LKERLDYLGV KVTKVHEEEY CLIPMGGVLP TFPQRTLGIG GTAGMVHPST GFMVAKTMLC VRTLVGTLDE ALKAGKRGDI TGALEAAEAA QMNNGKFDAD ATAALVWNSI WPENDLRMRT FMCFGMETLM QLDIDGTRQF FDTFFDLPKD VWAGFLSWRI QPVGLLSLGV NLFALFSNYM RVNFVKSALP FMGSFFAN
|
| |