Gene OSTLU_43511 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43511 
Symbol 
ID5006548 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp118707 
End bp120098 
Gene Length1392 bp 
Protein Length442 aa 
Translation table 
GC content55% 
IMG OID640421969 
Productpredicted protein 
Protein accessionXP_001422490 
Protein GI145356548 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID[TIGR01790] lycopene cyclase family protein 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.746424 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGATG ATCGCGAATG GATTGCGTTT CAACAGCGCA AGGTGTTTAG TGAGCAAAAG 
CAAATCAAAG AGTACCTCAG TGCTTTGAAC GACCGCGACA AGGTCGACGT TCTCGTTGTC
GGTGCGGGCC CCGCAGGTCT GGCGATCGCA GCGGAGACGG CGAAGAAGGG TCTTTCTGTT
GGTCTCGTCG CACCAGACAC CCCGTTCGTG AACAACTACG GAGTATGGCT CGACGAGTTC
AAAGATCTAG GGCTCGAACA CTGCTTGCTT CATAAGTATG ACGACGCATT GGTTTGGTTC
GATGATTCTG ATCCTGCGAG TGGAACTGAA CTCGGTCGAC CTTACGGTCA AGTGTGCCGC
AGGCGTCTTC GCGACCATTT GTTGAAGGAG TGCGCGGCGG CTGGCGTCAA GTATTTACCA
GGCCTGGTAG ATTTTGTGCG TCACGGTGAC GTCGAAAAGA ACGAGTTAGC CGAAGTTCGA
GGCACCATCA TTAGCGATTC CGATATTACC GCAAGCGAGA AGAAGTTGAT CGAAGAAGAG
GCAAACAGAG GCCAGCAATT CACGTTGAAT TCGCGTCTCG TCGTTGCCGG CACCGGTCAC
AACCGCGACA TGCTCAGCTA CGAAGAGGGT GCGCCGCCGG GCTGGCAGAC TGCGTATGGC
GTTGAGGTGC GCATTCCGAA CCACGGTTTT CCCGTGAACA AGGCCGTGTT CATGGATTTT
CGTCAAAGCG ATCCGGAGGC GATGAAAGAG GAACAAGACG AGGGCGTTTG GCGCGTGCCG
TCTTTCCTTT ACGTGTTACC CGTGGACAAG GATGTGGTGT TCGTCGAGGA GACGTGCCTC
GTCGCGCGCG TACAAGTGCC GTTCGATGAA CTCAAACGGC GATTGTATCG TCGTATGAAG
CGGATGGGTA TGGAAATCGT CGAAGAAGAC ATCTTGGAAG TCGAGGCGAG TTGGATTCCA
CTGGGCGGTA CCCCGCCGGT TGCCCCGCAA CGCACCATCG CGTACGGTGC AGCAGCCGGC
ATGGTCCACC CTGCGTCTGG CTACTCCGTC GTAAACAGTA TTAGCAAAGC TCCGCGTGTT
GCGACGGCCA TGGCCGAAGG CTTGAAGGAG GGTGGCGAGA TTGAGGCGAG CCGAAGAGCG
TGGGAAATCC TTTGGGGTGC GGAGCCACGA AGACAAATCG GTTTCTACCA GTTCGGTATG
GAGCTTCTCA TGTCGCTTCG CATCGAGCAG ATGCGCAACT TCTTTAGTAC CTTCTTTGCG
CTTCCAACAA ATCTGAGCAG AGGATTTTTG GGTAACAGAT TGTCGAGCTC AGAGTTGATC
ATGTTTGCTC TCACTACGTT CGCAATTGGT AACAACGAAC TTCGTGGGTT GTTGCTCGCT
CACCTGGTTT CA
 
Protein sequence
MKDDREWIAF QQRKVFSEQK QIKEYLSALN DRDKVDVLVV GAGPAGLAIA AETAKKGLSV 
GLVAPDTPFV NNYGVWLDEF KDLGLEHCLL HKYDDALVWF DDSDPASGTE LGRPYGQVCR
RRLRDHLLKE CAAAGVKYLP GLVDFVRHGD VEKNELAEAN RGQQFTLNSR LVVAGTGHNR
DMLSYEEGAP PGWQTAYGVE VRIPNHGFPV NKAVFMDFRQ SDPEAMKEEQ DEGVWRVPSF
LYVLPVDKDV VFVEETCLVA RVQVPFDELK RRLYRRMKRM GMEIVEEDIL EVEASWIPLG
GTPPVAPQRT IAYGAAAGMV HPASGYSVVN SISKAPRVAT AMAEGLKEGG EIEASRRAWE
ILWGAEPRRQ IGFYQFGMEL LMSLRIEQMR NFFSTFFALP TNLSRGFLGN RLSSSELIMF
ALTTFAIGNN ELRGLLLAHL VS