Gene OSTLU_35336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_35336 
Symbol 
ID5002963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp224999 
End bp226459 
Gene Length1461 bp 
Protein Length486 aa 
Translation table 
GC content61% 
IMG OID640418384 
Productpredicted protein 
Protein accessionXP_001418644 
Protein GI145348415 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.00292419 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACCG ACGACGCGTA CGGACGATGG AAGTCGTTGG TCCCGTTCGT GTACGACTGG 
TTCGCGCACA CGCGGACGTC GTGGCCGTCG CTGTGCGCGC GCTGGGGCGA GGTGCTCGAC
GCGAACGACC ATCGCTCGCG ACAGCGCGTC TATCTCACCG AACAGACCGA AGGGACGACG
GCGAGCGGGA AGCCGACGCC GAACACGATA TTGGTGTGCC AGGCGGAGGT GGTGCGACCG
CGCGTGGCGG CGGCGGAGCA CATGATTTTC GATGAACACG CAAAGTCGCC AATTTTAAAG
AAGGAAAAGG CGCTGTGGCA CCCGGGAGAG GTGAATCGAA TGCGGTGCGT GCCGGGGAAA
GAAAACGTGC TGTTGACGCA CACGGATGCG CCGGAGGTGT TCGTGTTCGA CGCGAACGGG
CCGGGAGGGA AGCAGAGCGC GTGTAAGAGA GCAGACGGGA CGCAGTACAC GCCGCCGACG
GCGTGCTTGC GAGGACACAC GGAAAACGCG GAATACGCGC TGGCGGTGTC GACGGTGGGA
GAGGTGGTGG CGAGTGGAGG TAAGGATGAA AAGGTGATGA TTTGGGAGCT CGGAGATGCG
AGCACGGGGG GCGGGGCGAG AGGAAAGGAG GAGAAGGAGG GAAGCGGCGC GCCCGTGGTG
GGCGGCGGGT TGAGCTCGAC GGAACTCGCG AGACACACGT CTATTTGGGC GCGCGTCGAG
TTTTCGGGGC ACACCGATAC GATCGAGGAT GTGTGCTTTA ACCCACGGAA CGAGCGGGAG
CTGTGCTCGG TCGGGGATGA TCGGAATATG TTTTTTTGGG ACACGCGAAC GAAGAAGGCG
GCGGGGTTCG CGAAGGGGGC GCACGCGGAC GACGTGCACT GCGTCGCGTG GAGCGCGTTC
GAAGAGCACG TCATCGTTAC TGGTGGAAAA GACACCACCG TTAAGGTTTG GGATCGTCGA
ACGCTGTCCG ATAGCTCGAA CGAGGCAATG CACACGTTCG ACGACCACAC CGACAGTGTT
TTGTGCGTGG ACATGCACCC GCAGGCAAAG GGGGTTTTCA TGACAGCCGA CGAAGTAGGC
CGCGTGAACG TGTTTGATTA CTCGAAAGTC GGCGCTGAAC AGAGTGCGGA ACAAGCAAAA
GCTGGTCCGG CGCACTTGGT CTTTCAGCAC AGCGGCCATC GTGGGACGGT TTGGGATATT
CAGTGGAACC CTTACGACTC CTGGACCGCG TGCTCGACCT CGGTCGGGGA CTTTCAGAAT
ACTTTGCAAC TCTGGCGCGT GAACGATTTG ATCTATCGCG ACGAAGAGGA GTGCATTCGT
GAGCTCGAAC AACATCGGGA TATCATATGT GGTCGCGCGG CGCTGAAACA GTCAGAGCCG
TCGGTGAAGG AGGAAAAAAC GGACGCCGAC ACCGACGGCG GCTCTATTAT CATCGAAGAC
GACCGCGTGG ACGAAGACTA G
 
Protein sequence
MITDDAYGRW KSLVPFVYDW FAHTRTSWPS LCARWGEVLD ANDHRSRQRV YLTEQTEGTT 
ASGKPTPNTI LVCQAEVVRP RVAAAEHMIF DEHAKSPILK KEKALWHPGE VNRMRCVPGK
ENVLLTHTDA PEVFVFDANG PGGKQSACKR ADGTQYTPPT ACLRGHTENA EYALAVSTVG
EVVASGGKDE KVMIWELGDA STGGGARGKE EKEGSGAPVV GGGLSSTELA RHTSIWARVE
FSGHTDTIED VCFNPRNERE LCSVGDDRNM FFWDTRTKKA AGFAKGAHAD DVHCVAWSAF
EEHVIVTGGK DTTVKVWDRR TLSDSSNEAM HTFDDHTDSV LCVDMHPQAK GVFMTADEVG
RVNVFDYSKV GAEQSAEQAK AGPAHLVFQH SGHRGTVWDI QWNPYDSWTA CSTSVGDFQN
TLQLWRVNDL IYRDEEECIR ELEQHRDIIC GRAALKQSEP SVKEEKTDAD TDGGSIIIED
DRVDED