Gene OSTLU_42336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_42336 
Symbol 
ID5003350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp148333 
End bp149625 
Gene Length1293 bp 
Protein Length430 aa 
Translation table 
GC content62% 
IMG OID640418771 
Productpredicted protein 
Protein accessionXP_001419083 
Protein GI145349318 
COG category[C] Energy production and conversion 
COG ID[COG0039] Malate/lactate dehydrogenases 
TIGRFAM ID[TIGR01757] malate dehydrogenase, NADP-dependent
[TIGR01759] malate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.22262 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.238463 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGC GCGCGACGAC GACGACGCGC GCGCGAACGA CGACGACGAC GACGACGACG 
CGGGCGGCGA TAGGTCGACG ACGCGCGCGC GAACCGAACG CGCGACGACG CGGAGACGCG
TGCGTGCGGT ACAACACGCC CGCGGGCGTG GGGAGGAAGG CGGACGATCC TCTGGGGGTG
TTCAGGCTGG AGTACGACAT CTCGATGGAC GAGGCGCATC GACCGAAGAC GTGGAAGCCG
ACGGTCACGG TGGCGGTGAG CGGCGCGGCG GGGCAGATTT CGAATCACTT GTTGTTTAAG
ATCGCGAGTG GGTCGGTGTT CGGACACGAT CAGCCGGTGG TGTTGAGATT GCTCGGGAGC
GAGCGGTCGA GACAGGCGCT GGAGGGGGTG GCGATGGAGC TGGAGGATTG CTTGTTTCCG
TTGTTGCGCG AGGTCGACAT CGGCATCGAC TGCAGGAAAG TCTTCGCGGG CGCGGATTGG
GCGCTGTTGA TCGGGGCGAA GCCGCGTGGA CCGGGGATGG AGCGCGGAGA TTTGCTTGAG
ATGAATGGGG CGATTTTCGT CGATCAAGGC AAGGCGTTGA ACGAGGTGGC GAAGCCGACG
TGCAAGGTCA TCGTCGTCGG GAACCCTTGC AACACGAACG CGCTCATCGC GCTGTCGCAC
GCGCCCAACT TGGATCCGCG CAACTTCCAC GCGTTGACCA AGCTCGACGA AAACAGAGCA
AAGTGTCAAC TCGCGCTCAA GGCGGGCGTG TTCTACGAAA CCGTGAGTAA CGTCGTCATT
TGGGGCAACC ACTCCACGAC GCAGGTGCCG GATTTCGTCA ACGCCAAGAT CGACGGTAAG
AAAGCCACCG AAGTCATCAC CGATCAAGAC TGGCTCGAGA ACGACTTCAC TCCCGCGATT
CAAACCCGCG GCGGGTTGCT GATCAAAAAG TGGGGTCGCT CTTCCGCGGC GTCCACGGCG
GTGTCCATCG CCGATCACAT CAGAAATTTG GTCAACCCGA CGCCGGAGGG CGACTGGTTC
TCCACAGCCG TGCTCAGTAA CGGTAACCCG TACGGCATCC AAGACGGCAT CGTTTACTCC
TTCCCGTGCC GCTCCAAGGG CGATGGTTCG TACGAAATCG TTCCCGGTTT AGAAGTGAAC
GACTGGCTTC GCGAGCGCAT GAAGAAGAGC GAAGAAGAGC TCACCAGCGA AAAGGGCTGC
GTCGGCCACC TCGTCGGGGA AGCGCACGTT GACGTCCCAG ACGCAGGGTG CCCGGTCGAT
CTCGAAGACA CTCTTTTGCC AGGTGAAATG TAA
 
Protein sequence
MTARATTTTR ARTTTTTTTT RAAIGRRRAR EPNARRRGDA CVRYNTPAGV GRKADDPLGV 
FRLEYDISMD EAHRPKTWKP TVTVAVSGAA GQISNHLLFK IASGSVFGHD QPVVLRLLGS
ERSRQALEGV AMELEDCLFP LLREVDIGID CRKVFAGADW ALLIGAKPRG PGMERGDLLE
MNGAIFVDQG KALNEVAKPT CKVIVVGNPC NTNALIALSH APNLDPRNFH ALTKLDENRA
KCQLALKAGV FYETVSNVVI WGNHSTTQVP DFVNAKIDGK KATEVITDQD WLENDFTPAI
QTRGGLLIKK WGRSSAASTA VSIADHIRNL VNPTPEGDWF STAVLSNGNP YGIQDGIVYS
FPCRSKGDGS YEIVPGLEVN DWLRERMKKS EEELTSEKGC VGHLVGEAHV DVPDAGCPVD
LEDTLLPGEM