Gene OSTLU_4180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_4180 
Symbol 
ID5002635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp293807 
End bp294790 
Gene Length984 bp 
Protein Length328 aa 
Translation table 
GC content59% 
IMG OID640418056 
Productpredicted protein 
Protein accessionXP_001418894 
Protein GI145348929 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID[TIGR01289] light-dependent protochlorophyllide reductase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0280434 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000647505 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATCGTCACGG GCTCGTCCTC GGGACTCGGG CTGTACACGG CGAAGGCGCT GATCAAGAAT 
GGATACTTCG TCGTCAACGC GGTGCGCTCG CCGGAGAAGA TGCAGGTCAA GGCGAACGAG
CTGGGGCTGG ACGAGTCGTC GTACGCGATC ATGTACCTCG AACTCGGTGA CTTGCAGTCG
GTGCGCGATT TCGCGACCGA GTTCCGACGA TCCAAGTACG TGAAGAACTT TCAAGCGCTG
GTGTGCAACG CGGCGCTGTA TCTCCCGAAC GCCACGGTGC CGTCGTACAC CAAGGATGGG
TTCGAGGAGT GCGTGGGGGT GAATCATCTC GGTCACCACT TGTTGTCTTT GCTCTTGCTC
GACGATCTCG CGGAAGCGCC GGACGCGAAC ATGAAGCGTT TGATCATCGT CGGGTCGGTG
ACGGGCAACA CGAACACGCT CGCCGGTCAA GTGCCGCCGC GCGCGGGTCT CGGCGACATG
TCGGGCTTGA GAAACGGCTT CAAGAATAGC GACCGTAACC AAGGCGCGTT GATCGACGGC
ACTCGCTTCA TCGGCGCCAA GGCGTACAAG GATTCCAAGC TGTGCAACAT GCTCGACATC
AAGGCGTTCG CCGAGCGTTT CGGCGAATCC ACGGGGATCA AGTTCAGCAC GATGTACCCG
GGATGCATCG CGGATTCCAA CTTGTTCCGC AACCACACCG CGTTCTTCCG CTGGTTCTTC
CCGATTCTTC AAAAGAACGT CACCAAGGGT TACGTCAGCG AGGAAGAAGC CGGCGAACGC
CTCGCCTCCA TCGTGTACGA TCCGCGATAC AGCGAGCAAG GCGCGTACTG GGCCTGGAAG
GGTGGTGGCG ACCAGCTTTG GGACAACTAC AACAACAACA ACGACGACAC GCGCACGATT
GCGTTCAACA ACAAGCCGTC GAAGGAAGGC AGAGACATGG CCAAGGCGAA CGAAGTGTTT
GATATTTCCA CCGAGCTCGT CGGC
 
Protein sequence
IVTGSSSGLG LYTAKALIKN GYFVVNAVRS PEKMQVKANE LGLDESSYAI MYLELGDLQS 
VRDFATEFRR SKYVKNFQAL VCNAALYLPN ATVPSYTKDG FEECVGVNHL GHHLLSLLLL
DDLAEAPDAN MKRLIIVGSV TGNTNTLAGQ VPPRAGLGDM SGLRNGFKNS DRNQGALIDG
TRFIGAKAYK DSKLCNMLDI KAFAERFGES TGIKFSTMYP GCIADSNLFR NHTAFFRWFF
PILQKNVTKG YVSEEEAGER LASIVYDPRY SEQGAYWAWK GGGDQLWDNY NNNNDDTRTI
AFNNKPSKEG RDMAKANEVF DISTELVG