Gene OSTLU_43800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43800 
Symbol 
ID5006547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp116907 
End bp118490 
Gene Length1584 bp 
Protein Length528 aa 
Translation table 
GC content60% 
IMG OID640421968 
Productpredicted protein 
Protein accessionXP_001422489 
Protein GI145356546 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID[TIGR01790] lycopene cyclase family protein 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCGC GTCGAGCGCC CGCGGCGCGC GTGACGCGCG CGATTCGCGC GCGAGGCGAC 
GCGGGAACGC GCGCGCGCGA CGTCGCGCCG GGCGCGACGC GGCGCGGGGC GTCGGCGACG
CCGCGGGCGA CGCGACGGCC GAGCGCGAGG GAGACGCGGC CGGAGCTGTA CGGCTTGGAC
GCCTCGTGGG ACCCGCTGAC GAGCGGCGAT CGGCGGGAGA GCGAGGAGTC GCGAACGCCG
CTTCCAGAAA CGCTGCCGAA CGTGCGATGG GGGACGAGCG CGAGCGAGGC GTACGATTTG
GTGATTGTCG GGTGCGGACC GGCGGGGCTG ACGGCGGCGG ACGAGGCGAG CAAGCGCGGA
TTGCGCGTGG CGTTGATGGA TCCGTCGCCG CTCGCGCCGT GGATGAATAA TTACGGGGTG
TGGTGCGACG AGTTTAAATC GCTCGGGTTC GATGATTGCT ATCGCGCGGT GTGGAACAAG
GCGCGAGTTA TTATAGACGA CGGCGACGCC GACGGGAAGA TGCTCGACCG CGCGTACGCG
CAGGTGGATC GGAAGAAGCT CAAGCAGAAG CTCATCGCGC GCAGCGTGAC GCAGGGCGTG
GAGTTTGGTA TCGCCGCGGT CGATAGCTGC GATAACAGCG ATCCGAACCA TTCGGTGGTG
ACTTTGAGCG ATGGACGCAA GGTCTATGCG AAGATGGTTT TGGACGCCAC TGGGCACTCT
CGTAAGCTGG TGGACTTTGA TCGCGATTTT ACGCCGGGAT ATCAAGCCGC TTTCGGAATC
GTGTGCACAG TGGAGAAGCA CGACTTTCCG TTGGACACGA TGCTGTTCAT GGACTGGCGA
GACGAGCACT TGAGCCCAGA GTTTAAGCGA GCGAACGACA GGTTGCCGAC GTTTTTGTAC
GCCATGCCTT TCTCGGAAAC TGAGGTGTTC CTCGAGGAAA CGAGCTTGGT GGCACGACCT
GGCTTAGAGT TTGACGACTT GAAGCTCAAG TTGAAGGAGC GTTTGGATTA TTTGGGCGTG
AAAGTAACCA AGGTACACGA AGAGGAGTAT TGTCTCATTC CCATGGGCGG CGTGTTGCCG
ACGTTTCCGC AACGCACGCT CGGCATCGGT GGAACCGCCG GCATGGTCCA TCCTAGCACT
GGATTTATGG TCGCAAAGAC GATGTTATGC GTTAGAACGC TCGTAGGCAC GCTTGATGAA
GCCCTTAAGG CGGGTAAGCG AGGGGATATT ACCGGCGCCC TGGAAGCGGC GGAGGCGGCG
CAAATGAACA ACGGTAAATT CGACGCCGAC GCCACCGCGG CATTAGTGTG GAACTCAATT
TGGCCGGAGA ATGATTTGCG CATGCGCACT TTCATGTGCT TTGGAATGGA GACTCTTATG
CAGCTCGATA TCGATGGAAC GCGTCAATTC TTTGACACGT TCTTCGACCT TCCCAAGGAC
GTCTGGGCTG GCTTTTTGAG CTGGCGAATC CAGCCGGTGG GCTTGCTTTC GCTCGGGGTG
AATCTGTTCG CGTTGTTTTC GAACTACATG CGAGTTAACT TTGTCAAATC CGCTCTGCCT
TTCATGGGGT CGTTCTTCGC AAAC
 
Protein sequence
MRARRAPAAR VTRAIRARGD AGTRARDVAP GATRRGASAT PRATRRPSAR ETRPELYGLD 
ASWDPLTSGD RRESEESRTP LPETLPNVRW GTSASEAYDL VIVGCGPAGL TAADEASKRG
LRVALMDPSP LAPWMNNYGV WCDEFKSLGF DDCYRAVWNK ARVIIDDGDA DGKMLDRAYA
QVDRKKLKQK LIARSVTQGV EFGIAAVDSC DNSDPNHSVV TLSDGRKVYA KMVLDATGHS
RKLVDFDRDF TPGYQAAFGI VCTVEKHDFP LDTMLFMDWR DEHLSPEFKR ANDRLPTFLY
AMPFSETEVF LEETSLVARP GLEFDDLKLK LKERLDYLGV KVTKVHEEEY CLIPMGGVLP
TFPQRTLGIG GTAGMVHPST GFMVAKTMLC VRTLVGTLDE ALKAGKRGDI TGALEAAEAA
QMNNGKFDAD ATAALVWNSI WPENDLRMRT FMCFGMETLM QLDIDGTRQF FDTFFDLPKD
VWAGFLSWRI QPVGLLSLGV NLFALFSNYM RVNFVKSALP FMGSFFAN