Gene OSTLU_28572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_28572 
Symbol 
ID5006491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009373 
Strand
Start bp96881 
End bp98080 
Gene Length1200 bp 
Protein Length286 aa 
Translation table 
GC content64% 
IMG OID640421912 
Productpredicted protein 
Protein accessionXP_001422390 
Protein GI145356339 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0101] Pseudouridylate synthase 
TIGRFAM ID[TIGR00071] pseudouridylate synthase I 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value0.0301527 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.000000212396 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
CGGGCGTCCA TGCGGCGCGC GCGACGCGAT GGCGACGTGT GAATGGACGC GCGCGATCGT 
TCGAGCGCGC GCCGCGCGCG CGGAGAGGCG CGCAGACGAC GCGCGGGGAG GATTTCGAGC
GCAAAAAGTG CGCTCGCGCG CTCGGGCTGG CGCGGGAAAG CGACGCGCGC GGGCCAACGC
GACGGTTCGC GCGAGCGAAA ACGACGAAAG CGGCGCGGCG TCGAACGCGA GCGTCCGCGA
GAGCGAGCCG TATGGGGAAA ACGCGGCGAA AAACGCGGCG TATAAGGCTT CGATGTCGGA
CGCGCAGCTC GAGCTGCATC GCGCTTCGGG ACGCGACGTC ACGTACGCGC TGAAGATTTC
GTACGATGGC GAGCGTTATA ATGGATTTCA GTACCAAGGC GAAGACGTGC CGACGATTCA
GCGCGAACTC GAGCGCGCGC TCGCGAAGCT GACCGGGATC GATCGAGAAC GGCTTCGATT
GGGCGCCGCC GGTCGCACCG ACGCCGGCGT GCACGCGCGA GGGCAAGTCG CGCACTTTTA
TTGCGAAAAA TCGTTAGGAG AAGACTTGAC GCGGTGTCAA AAGGCGATGA ACGGGATGTT
ACCGAAAGAC ATTCGCGTCG ACGCGTTCTG GGAACCCCAT CCGTTGTTTC ATTCGAGATT
TCACGCGAGC GGCAAGACGT ACCACTACTA CGTGGACGCG CGCGCGACGT CGAGCCCGTT
CACGAGAAAG TACGCGCATC AGGTCGGATG GCGCCCGTGC GACGTCGAAC TGTTGCGTCA
AGCCGCGCAG TTGTTCGTCG GGACGATGGA TTACAAAGGG TTTTGCAACA CGTCGAGGGA
TAAATCAAAC GAGGACAGGA ACACGACGCG AACGATTCGT CGCTTCGACG TCTTCGAGGA
CGCCGCGGAC GACGGATTGA TTCGTCTCGA GGTCGAGGGC GACGGATTTT TGTATCGTCA
AGTGCGCAAC ATGGTCGGCG CGCTCTTGGT CGTCGCGAGC GGCAAGCACG ACCTGGCGTA
CCTGCGAACG CTCATCGAAA CCAAGGATCG CTCGCGCGCG CCCATGGGCG CGCCCGCGCG
CGGCTTGTTT CTTCACGAAG TCTTCTACCC GACCGAGGTC TTAGCGAGAC CTCGTTCGGA
CGACACCGCT TAAACGCGCG CGCGGCGTCG CTCGCGCGTT CGCTCGCCGC GCGCGCCCGC
 
Protein sequence
MSDAQLELHR ASGRDVTYAL KISYDGERYN GFQYQGEDVP TIQRELERAL AKLTGIDRER 
LRLGAAGRTD AGVHARGQVA HFYCEKSLGE DLTRCQKAMN GMLPKDIRVD AFWEPHPLFH
SRFHASGKTY HYYVDARATS SPFTRKYAHQ VGWRPCDVEL LRQAAQLFVG TMDYKGFCNT
SRDKSNEDRN TTRTIRRFDV FEDAADDGLI RLEVEGDGFL YRQVRNMVGA LLVVASGKHD
LAYLRTLIET KDRSRAPMGA PARGLFLHEV FYPTEVLARP RSDDTA