Gene OSTLU_35655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_35655 
Symbol 
ID5002888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp584222 
End bp585388 
Gene Length1167 bp 
Protein Length388 aa 
Translation table 
GC content66% 
IMG OID640418309 
Productpredicted protein 
Protein accessionXP_001418978 
Protein GI145349100 
COG category[A] RNA processing and modification 
COG ID[COG0430] RNA 3'-terminal phosphate cyclase 
TIGRFAM ID[TIGR03400] 18S rRNA biogenesis protein RCL1 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.286653 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0571099 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCCT CCCGCGCGCC CGCGCGCTTC ACCGGCGCCA AGGACTTCCG CGCGCGCATC 
CTGATCAGCG TCCTGAGCGG TAAGCCGTGC GTGATTCGCG ACATTCGCGT CAAAACCGCC
GCGCGAGGCG GCGGTGACGA CGCCGTCGGG CTGCGCGATT ACGAAGTCTC GCTCCTGCGC
CTGATCGATA AGCTCACGAA CGGCACGCGC GTGGACATCA GCGAAGACGG CACCGCGGTG
CGGTTCGACC CGGGCGTGGT GAAGGGAGGA CGCGCGCTGA CGCACGAGTG CGCGACGAGC
CGGGGCGTCG GGTATTACGT CGAGCCGACG CTGGCGTTGG GATTGTTCGC GAAGAAACCG
ATCGAATTGA CGCTCATGGG GGTGACGAAC GACGACGCGG ACGTGAGCGT GGACGTGTTT
AGGACGGTGA CGCTGCCGAT GCTGAAGAAA CACTTCGGCG TGGACGATGG ATTGGCGCTG
GAGGTGGAAC GGCGAGGGTG TCCGCCGAAC GGCGGCGGTC GCGTGCGGTT GACGTTGCCG
ATTGTGAAAA CGCTGCCGAC GCTGGATTGG TGCGACGAGG GCTTGGTGAA ACGGGTGCGA
GGGGTGACGT TTACGTGCAA GGTGTCGCCG CAGAATGGAA ACCGCATGGT GGACGCTGCG
AGAGGGGTGT TGAACGCGTT CATTCCAGAC GTGTACATTT TCACCGACCA TCACGTCGGT
CCGGAGGCGG GGAAGAGCCC AGGGTACGGA TTATCCCTCG TCGCCGAAAC CACCACGGGT
TGCGTGCTCG GCGCCGACGC CGCGTCCACG GCGTGCGCGT CCGCGATGAG CGAGGCGGCG
GATTTAGAAT GGGCCGACGA CGCCGAGGCG CGCGTGCCCG AAGACGTCGG CCGCCGCGTC
GCCGAGGCGC TCGTCGCCGA GATCCAACGC GGCGGCGTCG TCGACAGCAC CCATCAATCC
CTCGCCCTCA TCCTCCTCGC CATCGGTCCC GAGCAAGTGT CCAGAATCCG TCTCGGTCAG
CTCACCCCTC GAGCGATCGA AACCTTGCGC GCGCTCAAAG CCTTCTTCGG CGTCACCTTT
CACGTGCAGC CCGAGCCCGA GAGCGGCACC GTGTTCTGCT CCGTCGTCGG CGTCGGTCTG
AAGAACGTCG CCAGGCGCAG CACGTGA
 
Protein sequence
MPPSRAPARF TGAKDFRARI LISVLSGKPC VIRDIRVKTA ARGGGDDAVG LRDYEVSLLR 
LIDKLTNGTR VDISEDGTAV RFDPGVVKGG RALTHECATS RGVGYYVEPT LALGLFAKKP
IELTLMGVTN DDADVSVDVF RTVTLPMLKK HFGVDDGLAL EVERRGCPPN GGGRVRLTLP
IVKTLPTLDW CDEGLVKRVR GVTFTCKVSP QNGNRMVDAA RGVLNAFIPD VYIFTDHHVG
PEAGKSPGYG LSLVAETTTG CVLGADAAST ACASAMSEAA DLEWADDAEA RVPEDVGRRV
AEALVAEIQR GGVVDSTHQS LALILLAIGP EQVSRIRLGQ LTPRAIETLR ALKAFFGVTF
HVQPEPESGT VFCSVVGVGL KNVARRST