Gene OSTLU_119571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_119571 
SymbolWdr50 
ID5000349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp586721 
End bp587956 
Gene Length1236 bp 
Protein Length411 aa 
Translation table 
GC content48% 
IMG OID640415770 
ProductWD-repeat protein 
Protein accessionXP_001416430 
Protein GI145343655 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAGGCT CACGTGGCCG TGCCTGGATA GACGACCATG GTGACGGCGG TTTCTGCGGA 
AATTTTGCGA ACAAGCGACG GGTGGAGGCA GTTGAGTTTG CGGGTTCTGA ATTGGCTCTT
TCTCAGCCAG ACGAAGAATC TATTTCCTTC ATCGGGTCGA AAGTAGTCAA GCGTTGGTCC
GATGGTACCC ATACCAGCAC CGGTCGTGCT TTCCCAGAAG TTGGAAACTC CGCAGTTCGT
GTAGGGCTGG AAGCTCGGCG CAACCTGAAT CTTCTTGATT TTACGAGGGT GAAAGATTTT
GATTTTGACG GCCCCCCTGA GAGTGCAGCT ATTTGTGTGC ATTTTCACCA CAAACGTGAG
TTGGCCTTGA GTGCTAGTGC CGACTCTCGG GTGAATATAT TTAGGGTGGA CGGTAAAAAC
AACAAGCTGT TTCAGACACT GTCGATAGAA AATGTATCAA TTAAGAACGT AGAATTCAGC
AGGAGTAGCG ATTACACTAT TATATGCGGC AAAGGGAACT TATTATTGGG CCATCTTGAG
AGAAACACCG TTGAGAGGGT CCACATGCGT AGAGAGACGC TCGTCCGCGG TACCTTTGCT
CAAGCTCCAG GCACAGACTT GTTAGGTATT CCAGGATCAT CCGAAGTGAA TATAATTTCA
CAAACGAGCC GCACGTGTGT TATGAAACTC AATACTACTG GTACGTCGGT TCGTGCATGT
GTATTTTCAC ACGCCGGGAT GGAGATAATC GGCGTCACAG ATGATGGATT GCTTTATTGT
TGGGATGTTC GGATGCAGCG TTGTCTGAAA AAAGCAGCTG GTTTTGATGA CACAAAATGT
ATATGTTTGA CACCTGACGG TGAGAAAATC ATCACGGGGC AAGGCAATGG CATAGTCAGC
GTCCATAGAT TCGCAGATGT GGACTGGAGT GATCCATATA AGAGAAGAAC GCAGAGTCCA
GCAAAGCGCA TATCAAGTCT TTCGACATCG GTCACATCAC TAAGTTCAGA TCCTCATGGT
GATATCATCA TCATGTCGTC ATCATTCAAG AAGAACGCTT TACGAGCCTT CCACGTCCCT
ACGTTATCGA TGGTCAAAGC TTGGCCAACT AGCTCAACCC CTTTACACTA CGTCTCGTCG
ACAGCTTTCA ACAACGATGG CACACTTCTG GCCGTCGCGA ATGCGAGGGG TCGCATACTC
ACGTATCGAG TTTGCTCTCG GCAAACAAAT GCATGA
 
Protein sequence
MGGSRGRAWI DDHGDGGFCG NFANKRRVEA VEFAGSELAL SQPDEESISF IGSKVVKRWS 
DGTHTSTGRA FPEVGNSAVR VGLEARRNLN LLDFTRVKDF DFDGPPESAA ICVHFHHKRE
LALSASADSR VNIFRVDGKN NKLFQTLSIE NVSIKNVEFS RSSDYTIICG KGNLLLGHLE
RNTVERVHMR RETLVRGTFA QAPGTDLLGI PGSSEVNIIS QTSRTCVMKL NTTGTSVRAC
VFSHAGMEII GVTDDGLLYC WDVRMQRCLK KAAGFDDTKC ICLTPDGEKI ITGQGNGIVS
VHRFADVDWS DPYKRRTQSP AKRISSLSTS VTSLSSDPHG DIIIMSSSFK KNALRAFHVP
TLSMVKAWPT SSTPLHYVSS TAFNNDGTLL AVANARGRIL TYRVCSRQTN A