Gene OSTLU_32875 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_32875 
Symbol 
ID5003458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp555 
End bp1823 
Gene Length1269 bp 
Protein Length370 aa 
Translation table 
GC content61% 
IMG OID640418879 
Productpredicted protein 
Protein accessionXP_001419252 
Protein GI145349672 
COG category[C] Energy production and conversion 
COG ID[COG0039] Malate/lactate dehydrogenases 
TIGRFAM ID[TIGR01772] malate dehydrogenase, NAD-dependent 


Plasmid Coverage information

Num covering plasmid clones71 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAACG CCGTCGGTGC CGCCGCGCGT CGCGTCGGCG CGTTGCGCGC GCAAGTCGTC 
GATCCCGCCT CATCGTCGCG GCAGACGCCT CAGGAAGCGC CATCCGCGCG CGTCGGGCTC
GTGGATTGGT TCTTCGGTGC GTCCGGGATC GGTGGTAAGC GCGCGTCGTT CACCGTCGCC
GTCCTCGGCG CGGCGGGCGG TATCGGGCAA ACGCTCTCCG CATTCATCAA GGCGAATCCA
AAGGTGGCGG AACTGCGACT CTACGACGTC GCGCCCGTCG TTCGAGGCGT CGCCGCGGAC
GTCTCTCACG TGAACACGCG AGCGAAGGTG AGCGGATACG TCGGTGATGA CGAACTTGAG
GCGTGTTTAC GAGGATGTGA CCTCGTCATC ATTCCCGCGG GCGTGCCGCG CAAACCGGGC
ATGTCGCGCG ACGACTTGTT CGGCGTGAAC GCCGGGATCG TCCGCACACT GTGCGAGGGT
GTGGCAAAGA CGTGCCCGAA CGCGATCGTA AATATCATAT CCAACCCCGT GAATTCAACG
GTTCCCATCG CGGCGGAAGT GTTTAAAAAT CACGGTTGTT ACGATGCGCG CAAACTTTTG
GGCGTGACGC ACCTCGACGT GATGCGGGCG AAGACGTTCG TCGCCGCGGC AAAAGGGTTC
GACGACCCGA CTTTGGTGGA CGTCCCGGTG ATCGGTGGAC ACGCGGGGAC GACGATTTTG
CCGTTACTGT CTCAAACCAC TCCGCGTTGC TCGTTTACGC CCGAGGAAGT GAGCGCGTTG
ACGAGTCGAA TCCAAAACGG TGGCACCGAA GTCGTCGAAG CGAAGGGAGG CGCCGGAAGC
GCCACGCTCT CCATGGCCGC TGCCGCGGCG GAGTTCGCGG ATGCGTGTCT CAGAGGATTG
AGCGGTGAGT CTGGAATATG GGCGTGTGCG TACGTCGAGA GCAAGGCGAC GCGGGCGCCT
TTTTTTGCCA CCAAGGTGCT CCTCGGACGA AACGGCGTGG AGCGCGTGGC GGGCACTGGA
ACGCTATCAT CGTACGAGAA GCGCGCGTTG GAGAGCATGT TACCAGAACT GGAAGCTAGC
ATTAAAAAGG GGATCAATTT CCTTCATTCC TAATCGACGA GCGCGACCGA CTGACCGCGA
CCACCTTTAG AGACGGCGTC TCGTCGTCGA ATGTAAAGAA ACAATGCACG TCAAGATGCG
AATGTGTGAC CGAACATGTA CGATCGCGCT CTAGAAAGCT AAGGAATAGG AATACGAAGC
GATTCACCC
 
Protein sequence
MSNAVGAAAR RVGALRAQVV DPASSSRQTP QEAPSARVGL VDWFFGASGI GGKRASFTVA 
VLGAAGGIGQ TLSAFIKANP KVAELRLYDV APVVRGVAAD VSHVNTRAKV SGYVGDDELE
ACLRGCDLVI IPAGVPRKPG MSRDDLFGVN AGIVRTLCEG VAKTCPNAIV NIISNPVNST
VPIAAEVFKN HGCYDARKLL GVTHLDVMRA KTFVAAAKGF DDPTLVDVPV IGGHAGTTIL
PLLSQTTPRC SFTPEEVSAL TSRIQNGGTE VVEAKGGAGS ATLSMAAAAA EFADACLRGL
SGESGIWACA YVESKATRAP FFATKVLLGR NGVERVAGTG TLSSYEKRAL ESMLPELEAS
IKKGINFLHS