Gene OSTLU_4520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_4520 
Symbol 
ID5002078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp86761 
End bp87750 
Gene Length990 bp 
Protein Length330 aa 
Translation table 
GC content61% 
IMG OID640417499 
Productpredicted protein 
Protein accessionXP_001417668 
Protein GI145346382 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID[TIGR01289] light-dependent protochlorophyllide reductase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.645511 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.871752 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ACTGAGACGA AGAAAAGGGT CGTGATTACC GGGTCGAACT CGGGGATCGG GCTCGACGCG 
GCGACGAAAC TCGCGGCGAG CGGGGACTGG GTCGTCGTGT TGGCGTGCCG AACGCGCGCG
AAGGCGGAGG CGGCCAAGGC GAATATATTG TCTGCGACGA ACGCGGACGG GGCGAACATC
GAGTGCGTCG AGTGCGATTT GTCGAGTTTG GACTCGGTGC GCGCTTTCGT GCGCGAGGTG
AGGAAAACGG GCGGTGTCGA CGCTTTGTGT CTGAACGCCG GCGTGGAATA CAGCGGCGAT
CCCGTGGTGC ATCGAACGAG GGACGGTTTC GAGGAGACGT TCGGTGTGAA CCATTTGGGG
CACTTTTTGC TCGCCAACTT GTTATTGGAG GATCTCGAAA AGTCGAGCGA GGCGCATCCG
CGAATCGTCG TGACGGCGAG CGAGGTGCAC GACCCGGCGT CGCCGGGAGG ATCGGTGGGC
AGCGGCGCGC ACATCGGCGA CTTGCGAGGC CTCGAACGCG ACGGCGCGGC GTTCGAGATG
GCGGACGGTG AAGCGTTCGA CGCCGATAAG GCGTACAAAG ACTCTAAGTT GGCGAACATG
CTCTTCATGT ACGAGCTCGA GCGACGCCTG CAGGCGAGAA ACTCGAAAAT CACGGTGAAC
GCGTTCGGTC CGGGACTCAT CACGCGCACC GGCTTATTTC GCAACCAAAA TCCTCTCTTC
GTCAAAGTCT TCGACTTCGC CACGAACGAG ATTTTCCACG TCGCAGAAAC CGTTTCCGGA
GGTGGGAACT GCTTAGTCTT CATGCTCACC GACCCTTCGC TCGAGGGCAG CGGGGGCGTG
TACTGGAACA ACGATTTGTC GCCCGGCGCG CCGCCGTCCC TCGTCGCCGC CGGACACAAA
TTCGCTCAAA CCAACTCTTC TGTCGAATCA AACGATGCCG TCGAAGCGCA AAAGCTTTGG
AAGCTCAGCG AATCGCTCGT CGGGTTGGCC
 
Protein sequence
TETKKRVVIT GSNSGIGLDA ATKLAASGDW VVVLACRTRA KAEAAKANIL SATNADGANI 
ECVECDLSSL DSVRAFVREV RKTGGVDALC LNAGVEYSGD PVVHRTRDGF EETFGVNHLG
HFLLANLLLE DLEKSSEAHP RIVVTASEVH DPASPGGSVG SGAHIGDLRG LERDGAAFEM
ADGEAFDADK AYKDSKLANM LFMYELERRL QARNSKITVN AFGPGLITRT GLFRNQNPLF
VKVFDFATNE IFHVAETVSG GGNCLVFMLT DPSLEGSGGV YWNNDLSPGA PPSLVAAGHK
FAQTNSSVES NDAVEAQKLW KLSESLVGLA