Gene OSTLU_33241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33241 
Symbol 
ID5003481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp610361 
End bp611878 
Gene Length1518 bp 
Protein Length505 aa 
Translation table 
GC content64% 
IMG OID640418902 
Productpredicted protein 
Protein accessionXP_001419218 
Protein GI145349602 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0310383 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.250936 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACCG AGGCGGTGAC GCGCGACGAC GACGACGCGC GCGCGAACGA CGATGGATCG 
AACGATGGAC TCACGGCGTT CGAACGCGCG CGAGAGGCGC ACATCGCGCG GAATAAGGCG
AGGATGGAGG CGTTGAACAT CAACGCGCTG AGCGCGGGCG TCGGCGCGGC GTCTCGCGGG
AGCGCGGCGT CGTCGCGAGG GATCACGGGG AAGCGAAATC GGGATAAAGC GAGCGCGCCG
AGCGTGCCGA CGCGACGGTC GAGCCGGGCG AGGAAGATCG CGCCGGAGTT GGCGAGCGGG
GTGGATCGAG AGCGGCGCGA CGGGACGGTG GTGCTCGCGA ACGGGGCGAC GTATCATCCG
AGCGGCGAAC TCACGGCGGC GCCGACGAGG ACGCGACCGA GCGGTGAGGT GGCGATGCGG
AGTGAAAACG GGAGCGAGAA GACGGACGCG GCGTTCATCG AGGAGTTGAA GACGAAGATG
GGGGCGCACG TCGGCGAGAC GAAGGAGGCG AAGGAGGCGA CGGCGAGCGT GCGAGAGATG
ATCAAGAATA AGCTCACGCT GCGCGACGTG GACGTCGCGA AGTGCGTGCC CAAGGGTGTG
ACGCACTTGG ACTTTTCCCC CGACGAGTCC ATGCTTCTCG TGGCGTCGGG AGATAAGGAG
GGACACATCG GTTTGTGGCG CGTCGATAAG ACGACGTCGG AGGAGGAGGA CGAGGACGAC
GGGGTGTTGT ACTATAAAGC CCACGGGTCG TACATCTCGC ACTGCAAATG GGGCCGCGGC
GCCTTGCGAG GGAAGCTCTT CACGTGCGCG TACGACGGCG CGGTTCGCGT CCTCGATCCG
CAAACCGGTT CGTTTCAGGA AACCGTCTAC TCCGAGGAGG ACGAATTCTC GGCGTGCGAC
CAATTCGCCG ACGGCAACAC CGCATTGGTG TGCGATAACG TCGGTAACTT GCATCAACTG
GATCTGCGCG TGGGTAAATT CACGTCCCCG AGCCTGTCGA TTCACGAGAA GAAGATCAAC
ACCGTGCACA TCGATCCCGG CAACGAGCAC CGATTCGCCA CTTCCACGAA TCAGCTCGTC
AGCGTCTGGG ACGCGCGAAA GTTGAAAAAG AACGCCAAGT CCACGCACGA TCTGGTCCAT
CGAAAATCGT CCCAAGCGGC GTATTGGTGT CCCGACGGCT CCGGCGCGCT CCTCACGACG
TGTTACGACG ACGCCCTCCG CGTCTGGCAC CCCGACCGAT CCGCCGCCGC GCCCACCGCC
ACCATCAAGC ACAACAATCA AACCGGTCGT TGGGTCCTTC CCTTCCGCGC CGTCTGGTCC
GCCGCCGGCG ACGGCGTCTT GTGCGGTTCC ATGACGCGTC AAGTCGAAAT TTTCAACCCC
GCCACCGGCG CGTCGCTCGC GCGTTACGCC TCTCCCGACC ACATGACCGC CATCGCCAGT
CGTCTGGCGT GTCATCGCTC GCTCAATTAC GTCGCCGCCG GCACCGCGAG CGGTCGCGTG
CACGTGTACC GCGCGTGA
 
Protein sequence
MKTEAVTRDD DDARANDDGS NDGLTAFERA REAHIARNKA RMEALNINAL SAGVGAASRG 
SAASSRGITG KRNRDKASAP SVPTRRSSRA RKIAPELASG VDRERRDGTV VLANGATYHP
SGELTAAPTR TRPSGEVAMR SENGSEKTDA AFIEELKTKM GAHVGETKEA KEATASVREM
IKNKLTLRDV DVAKCVPKGV THLDFSPDES MLLVASGDKE GHIGLWRVDK TTSEEEDEDD
GVLYYKAHGS YISHCKWGRG ALRGKLFTCA YDGAVRVLDP QTGSFQETVY SEEDEFSACD
QFADGNTALV CDNVGNLHQL DLRVGKFTSP SLSIHEKKIN TVHIDPGNEH RFATSTNQLV
SVWDARKLKK NAKSTHDLVH RKSSQAAYWC PDGSGALLTT CYDDALRVWH PDRSAAAPTA
TIKHNNQTGR WVLPFRAVWS AAGDGVLCGS MTRQVEIFNP ATGASLARYA SPDHMTAIAS
RLACHRSLNY VAAGTASGRV HVYRA