Gene OSTLU_35070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_35070 
Symbol 
ID5003561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp337727 
End bp338980 
Gene Length1254 bp 
Protein Length417 aa 
Translation table 
GC content57% 
IMG OID640418982 
Productpredicted protein 
Protein accessionXP_001419562 
Protein GI145350327 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1222] ATP-dependent 26S proteasome regulatory subunit 
TIGRFAM ID[TIGR01242] 26S proteasome subunit P45 family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.0746786 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACGAC TGACGACGAC GCGCGCGGTG TGCGATGACG CGCAGATGCC GGCGACGCAC 
GCGAACGCGA GCGGGGCGAC GACGACGAGC GAGGGATCCT CGACGACGAC GGATCTGTAC
GCGCGACTCA AGTCGCTTCA ACGCGAACTG GAGCTGGTGG AGATTCAAGA GGAGTACATC
AAGGATGAAC AAAAGAACTT GAAGATTGAA TTGCTCAGGG CGCAGGAAGA GGTGAAGCGG
ATACAGAGCG TGCCGTTGGT GATTGGACAG TTTTTGGAGA TGGTGGACGC GGAGACGGGG
ATCATATCGT CGACGACGGG GTCGAATTAT TACGTGCGGA TTTTGTCGAC GCTGAACCGG
GAGCTGTTGA AACCGTCGAG CTCGGTGGCG TTGCACAGAC ATTCGAACGC GCTGGTGGAG
ATTTTACCTC CCGAGGCGGA TTCGTCGATT TCTTTGTTGA GCGACGCGGA ACGGCCGGAT
GTGAAGTACA GCGACATCGG GGGGGCGGAT GTGCAAAAGC AAGAGATTCG CGAGGCGGTC
GAGCTTCCGT TGACGCACTT CGATTTGTAT AGGCAGATTG GAATCGATCC ACCGCGTGGG
GTCTTGCTGT ACGGACCACC CGGGACGGGG AAAACGATGT TGGCCAAGGC GGTGGCGCAC
CACACCACGG CGGCGTTTAT TCGCGTCGTC GGGAGCGAGT TCGTGCAGAA GTACCTCGGC
GAAGGGCCGA GAATGGTGAG AGATGTGTTT CGATTGGCGA AGGAAAACGC CCCAGCGATC
ATCTTCATCG ACGAGGTCGA TTCCATCGCG ACTGCGCGTT TCGACGCGCA CACCGGCGCG
GATCGTGAGG TGCAGCGTAT TTTGATGGAG CTCTTGAACC AAATGGACGG ATTCGATCAA
ACGGTCAACG TCAAAGTAAT CATGGCGACG AACCGTGCGG ATACCCTCGA TCCGGCGTTA
TTGCGCCCCG GTCGTCTCGA TCGAAAGATT GAGTGCCCGC ATCCCGATCG TCGTCAAAAG
CGTTTGGTGT TCCAGGTGTG CGTGAACAAG ATGAGCCTCA GCGACGAAGT AGATTTGGAG
GATTACGTCA GTCGACCGGA CAAGATCTCC GCCGCGGACA TTCGCTCCAT CTGCCAAGAA
GCCGGGTTGC AAGCCGTTCG GAAGAATCGA TACGTGGTTT TACCGAAAGA CTTTGAAGTC
GCGTACAAGA CGAACGTGCG CAAACCTGAC AACGACTTTG AATTTTACCG ATAG
 
Protein sequence
MERLTTTRAV CDDAQMPATH ANASGATTTS EGSSTTTDLY ARLKSLQREL ELVEIQEEYI 
KDEQKNLKIE LLRAQEEVKR IQSVPLVIGQ FLEMVDAETG IISSTTGSNY YVRILSTLNR
ELLKPSSSVA LHRHSNALVE ILPPEADSSI SLLSDAERPD VKYSDIGGAD VQKQEIREAV
ELPLTHFDLY RQIGIDPPRG VLLYGPPGTG KTMLAKAVAH HTTAAFIRVV GSEFVQKYLG
EGPRMVRDVF RLAKENAPAI IFIDEVDSIA TARFDAHTGA DREVQRILME LLNQMDGFDQ
TVNVKVIMAT NRADTLDPAL LRPGRLDRKI ECPHPDRRQK RLVFQVCVNK MSLSDEVDLE
DYVSRPDKIS AADIRSICQE AGLQAVRKNR YVVLPKDFEV AYKTNVRKPD NDFEFYR