Gene OSTLU_33894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33894 
Symbol 
ID5000920 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp202806 
End bp204263 
Gene Length1458 bp 
Protein Length398 aa 
Translation table 
GC content57% 
IMG OID640416341 
Productpredicted protein 
Protein accessionXP_001416582 
Protein GI145344112 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1222] ATP-dependent 26S proteasome regulatory subunit 
TIGRFAM ID[TIGR01242] 26S proteasome subunit P45 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00183921 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0445175 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTCG ACGCGCGACC GCAAACGGGT CTGCGCGCGT ATTACGACGC GAAAATCGAA 
GAGTTGGAGG TTCGCCTGCG CGACAAGACG CAGAATCTTC GACGCTTAGA GGCGCAGAGA
AATGAATTGA ACGGACGAGG TGCGCGGAGC GCGCGTGAGA AGGGCGTCGT CGTCGTCGAT
CGCGCGAGCG CGGTGCGCGA ACGGACGCGA AAGCGTGATG AGAGGAACGA TCGAGGCGAG
ATCGCGAACG ACGCGCGAAC GACGAGCGAA GACTGACTTT GAATTTCGTT CGGTTTACCG
CGCGCGCGCA GTTCGATCGC TTCGCGAGGA ATTGCACATG TTGCAGGAAC CCGGATCGTA
CGTCGGGGAG GTGGTGAAGG TGATGGGGAA GAAGAAAGTG TTGGTCAAGG TGAGCGGACG
ACTCGCCCGC GCGGCGCGCC GAGCGCGCGG AGACGGCGAG AGAGGAAAGA ATACTGACAG
TTTGTGTTTC GCGCGCAGGT GCACCCGGAG GGAAAATACG TGTGTGATAT GGACAAGAGC
ATCGATGTGA CGAAACTGAC GGCGGGGACT CGGGTGGCGT TGAGGAACGA TTCGTACACG
CTGCACGTGA TCCTTCCGTC GAACATCGAC CCGCTCGTGT CGCTCATGAA GGTTGAAAAG
GTTCCCGATT CCACGTTCGA TATGATTGGT GGATTGGATC AGCAAGTGAA GGAGATCAAG
GAAGTCGTGG AGTTACCGAT CAAGCACCCA GAGCTTTTCG ACGCGCTCGG GATCGCGCAA
CCGAAGGGGG TCATCCTTTA CGGTCCCCCG GGTACCGGGA AGACGCTCTT GGCTCGTGCC
GTTGCGCACC ACACCGATTG CTGCTTCATT CGCGTGTCTG GTTCGGAATT AGTTCAAAAG
TACATAGGAG AAGGGGCGCG GATGGTTCGT GAACTGTTCG TCATGGCTCG CGAGCACGCG
CCGAGCATCT TGTTCATGGA TGAAGTAGAT TCTATCGGTA GCGCTCGCGA CGGAGGCGGC
GGAGGTGGAG GCGACAGCGA AGTGCAGCGT ACGATGCTTG AACTGCTCAA CCAGCTCGAC
GGTTTCGAGG CGACGAACAA GATTAAGGTG ATCATGGCCA CGAACCGCCT CGATATCCTC
GATCAGGCGC TTCTTCGTCC GGGCCGCATC GATCGTAAAA TCGAGTTCCC CAATCCATCT
GAAGACAGCC GCGTCGATAT TCTCAAGATT CATAGCCGCA AGATGAACCT CGTTCGCGGG
ATCGATCTTA AGAAGATCGC GAGCAAGATG GGTGGGGCTT CCGGGGCAGA ATCCAAGGCG
GTGTGCACCG AGGCCGGAAT GTTCGCGCTT CGCGAACGTC GCGTCCACGT CACGCAAGAA
GACTTTGAAA TGGCCGTATC CAAGGTGATG CAAAAGGATA GCGAAAAGAA CATTTCCGTG
AAAAAGCTCT TTTCGTAA
 
Protein sequence
MDVDARPQTG LRAYYDAKIE ELEVRLRDKT QNLRRLEAQR NELNGRVRSL REELHMLQEP 
GSYVGEVVKV MGKKKVLVKV HPEGKYVCDM DKSIDVTKLT AGTRVALRND SYTLHVILPS
NIDPLVSLMK VEKVPDSTFD MIGGLDQQVK EIKEVVELPI KHPELFDALG IAQPKGVILY
GPPGTGKTLL ARAVAHHTDC CFIRVSGSEL VQKYIGEGAR MVRELFVMAR EHAPSILFMD
EVDSIGSARD GGGGGGGDSE VQRTMLELLN QLDGFEATNK IKVIMATNRL DILDQALLRP
GRIDRKIEFP NPSEDSRVDI LKIHSRKMNL VRGIDLKKIA SKMGGASGAE SKAVCTEAGM
FALRERRVHV TQEDFEMAVS KVMQKDSEKN ISVKKLFS