Gene OSTLU_35343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_35343 
Symbol 
ID5002981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp416762 
End bp418024 
Gene Length1263 bp 
Protein Length420 aa 
Translation table 
GC content59% 
IMG OID640418402 
Productpredicted protein 
Protein accessionXP_001418696 
Protein GI145348522 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG1597] Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.0838869 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0559638 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCTGT TGGTGTTCGT GAACTCGAAG AGCGGGGGGC AGATGGGAAC GTACATGCTC 
GAGTCGCTGC GGTCGAATTT AAATCCGCTG CAAGTGGTGG ATTTACACAA CACCGGTCCA
AAGGCGGCGT TGAAGCTCTT CGCCAACGTG CCAAACGTAC GAATTTTAGT CGCGGGGGGA
GACGGAACGG TGGCGTGGAT TTTGCAAACG CTCGACGAAA TCGACGTTCC GAAGAAGCCA
CCGGTGGGTG TTTTGCCTCT GGGCACCGGG AACGACTTGG CGAGAGTACT GGGATGGGGT
GGTGGGTACT CGAACGAGCT CATCTCTGAA CTTTTAGTGC AAGTGCTCGA GGCGCACCCC
GCGCTGCTGG ATCGCTGGCA GGTGGAAATC ACCGCCAACG AGCCGCCGAA AACGCCAAGC
AAGTTCGCAT CCGCGGCTGG GTTACCGGCG GCGCCGCCCT TGCCGAAGAA GAAGGAGATT
GTTTTCCAAA ACTATCTCGG CATCGGCGTA GACGCGCAGG CGGCGCTGCG CTTCCATAGA
ACTCGCAACT TGCGACCTCA ATTGTTTTTT AGCGCGATGA CGAACAAACT GCTGTATGGA
GCGTTCGGGG CAAAAGATGT CCTCGAGCAC TCTTGCGCAG GTTTGCACCG AAGCATTAGA
ATTTACGCCG ATGGCGTGCG ACAGACGATT CCGCCCGAAG CCGAGGGAAT CATTTTGCTC
AACATCAACT CCTTCGCGGG CGGCGTGCGG ATGTGGGAAC GCGACGGGTC GTACGGTGTG
TCTTCGATGC AGGACGGCAT GGTGGACATC GTCGTCGTGC ACGGTGCCTT GCACTTGGGT
CAGCTGAACA TTGGCGTCGA CAAACCCGTG CGCATTTGTC AAGCGCGGGA AGTCCGCGTC
GTCGTCGATC GCAAAATCCC CATGCACGTC GACGGCGAGC CGTGGGAGCA ACCCGCGTGC
ACGATGGATA TTAAACTGAG AAATAAAGCC ACGATGCTCC GTCGAACCGC GGACGTGCGC
GGGATGACGG TGATCGAGAT GCAAAACACC CTCGATTGGG CGTGCAAAGA GGACATAATC
TCCGAGCCCC AGCGCGAGCA AATCATGGTC GAAGCCTACC GCCGCGCCGA CGCCCGCTCG
ATGGAAAACG GCCATCGCCG TTCGGGTTTG CACCGCCGAT CGGGCAGCAT CGGCAACCTC
TTAAACGCCA AGAGTTCGTC TTACAGCCAG CTCTTCGCCG GCGACGGCTT CGGTCTCGGG
TAG
 
Protein sequence
MPLLVFVNSK SGGQMGTYML ESLRSNLNPL QVVDLHNTGP KAALKLFANV PNVRILVAGG 
DGTVAWILQT LDEIDVPKKP PVGVLPLGTG NDLARVLGWG GGYSNELISE LLVQVLEAHP
ALLDRWQVEI TANEPPKTPS KFASAAGLPA APPLPKKKEI VFQNYLGIGV DAQAALRFHR
TRNLRPQLFF SAMTNKLLYG AFGAKDVLEH SCAGLHRSIR IYADGVRQTI PPEAEGIILL
NINSFAGGVR MWERDGSYGV SSMQDGMVDI VVVHGALHLG QLNIGVDKPV RICQAREVRV
VVDRKIPMHV DGEPWEQPAC TMDIKLRNKA TMLRRTADVR GMTVIEMQNT LDWACKEDII
SEPQREQIMV EAYRRADARS MENGHRRSGL HRRSGSIGNL LNAKSSSYSQ LFAGDGFGLG