Gene OSTLU_41212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_41212 
Symbol 
ID5002508 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp658510 
End bp660210 
Gene Length1701 bp 
Protein Length555 aa 
Translation table 
GC content60% 
IMG OID640417929 
Productpredicted protein 
Protein accessionXP_001418296 
Protein GI145347692 
COG category[C] Energy production and conversion 
COG ID[COG0281] Malic enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0240958 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.449043 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCCGT GGACGCGACA GGTGGTGTCT GGGGTGGAAC TGCTGCGGTC GGGGAAGTAT 
AACAAAGGGA TGAGCTTCAC GCGGGACGAA CGCGATAGGT TGAACTTGCG CGGGTTGTTG
CCGCCGGCGG TGTTCGATCA GAGCGTGCAG GTCGAGCGCG TGATCGAGCG CTTGCGTCGG
GTGACGAACG ACGTCGAAAA ACATGCGTGG CTCGCGTCGC TGTACGAACG AAACGAGCGG
CTGTTTTATC GCGTGGTGAA GGACCATTTG GAGGAGTTGT TGCCGATTTT ATCGGCGCCG
ACGGTTTGGC AAGTGTGCGC CGAGTTTGGT TTGATGTACA GACGACCGCG CGGGTTGTAC
ATCTCCATCA AAGACCGCGG GTCCGTGTAT AGGCTGTTGA AAAATTGGCC CGTGCGCGAC
GTCAAGGCGA TCGTGCTCAC GGACGGGCAA CGCGTCACGG GACTCGGCGA TTTGGGCGTC
CAAGGCATGG GCACGGCGGT GGGCAAGTCG ACGTTATTCA CCGCGCTCGG CGGCCTCGAC
CCGGCGGACG TGTTGCCAAT TTGTATCGAT GTGGGTACGG ACAACCAGGC GTTGCTCGAG
GATAAGTTTT ACATCGGTCT GAGGCAAAAG CGAACCGGCG GCGAGGAATA CGACGACTTG
CTCGACGAAG TCGTCTACGG GTTGAAGCGG CGTTTCGGCC CGCGCGTGTT GCTATGCTTC
GAAGAATTTT CCAACAAGAA CGCAAAGCGA TTGCTTGATC GATACAACAG CAATTCTGTC
GCGTATTGCG ATGATTTGCA GGGCATCGCT GCGACGACGT TGGCGGCGAT TATCTCGGCG
CTGCCGCAAA TGGGCGGATC ATTGAGAGAG CAGCGGTTTC TTTTCGCCGG CGCCGGGGAG
ACCGGTGCAC ACACCGCAGA TTTATTGGCG ACGTATATTT CGCAACAACA CGGCATATCG
TTGCCCGAGG CGCGAGAGAA CATTTATTTC ATAGATAGAA AAGGTTTGGT GACGCGCGAT
CGGGCGCAAC GCGAGGACGA TTTGGAAATT CACAAACTAC CCTACGCGCA CGACATGGAA
GGTGCGGCGT CCGTGCGCGA ATCCGTCGAG CTCATCAAAC CGACGGCGTT GATTGGCGTC
CGTCGACACC GATTTTCCTT CTTCGAAGGC TCAGTGCTGA AGGATGAAAA GCTCTTCACC
GAGGACGTGT TGCGCGCGAT GGCGAAACAT AGCGAGAAAC CGCTCATCAT GGCGCTTTCG
CGACCGAGCG CGCTGCGCGA GTGCACGGCC AAGGAAGCGT ACGAGGCGAC GAACGGGAAG
TGCATCTTCG TCGGCGGGTG CAAGTCGACG CCGTTCGAGT ACGCGGGTAG AGAAATCGCG
CCGTCCGAAT GCAGCACCGA ATACGTCTTT CCGGGATTGG GACTCGGCTT GACCATCGCC
GAAGGCACGC GCGTGCGAGA CTCTTTACTC ATGGAAGCCG CCGAAGTCGT GGCCAACAGC
GCCACTCCGG GCGATATCGC TCGCGGCGCC GTGTTCCCGC GAAAGCGCCA CATTCCCGAC
GTCTCCGCGC GCGTCGCCGC GCGCGTCGCC GGTAAGGCGT TCGCGAGCGG TTTGTCCGCG
CTCCCGGGCA AGCCCATGGA CTGGCTTCGC TTGGCGAAAT CGTGGATGTT CGACCCGACG
TATCGCCCCT ACACGCCTTG A
 
Protein sequence
MVPWTRQVVS GVELLRSGKY NKGMSFTRDE RDRLNLRGLL PPAVFDQSVQ VERVIERLRR 
VTNDVEKHAW LASLYERNER LFYRVVKDHL EELLPILSAP TVWQVCAEFG LMYRRPRGLY
ISIKDRGSVY RLLKNWPVRD VKAIVLTDGQ RVTGLGDLGV QGMGTAVGKS TLFTALGGLD
PADVLPICID VGTDNQALLE DKFYIGLRQK RTGGEEYDDL LDEVVYGLKR RFGPRVLLCF
EEFSNKNAKR LLDRYNSNSV AYCDDLQGIA ATTLAAIISA LPQMGGSLRE QRFLFAGAGE
TGAHTADLLA TYISQQHGIS LPEARENIYF IDRKGLVTRD RAQREDDLEI HKLPYAHDME
GAASVRESVE LIKPTALIGV LLKDEKLFTE DVLRAMAKHS EKPLIMALSR PSALRECTAK
EAYEATNGKC IFVGGCKSTP FEYAGREIAP SECSTEYVFP GLGLGLTIAE GTRVRDSLLM
EAAEVVANSA TPGDIARGAV FPRKRHIPDV SARVAARVAG KAFASGLSAL PGKPMDWLRL
AKSWMFDPTY RPYTP