Gene OSTLU_33401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33401 
Symbol 
ID5003728 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp152242 
End bp155190 
Gene Length2949 bp 
Protein Length923 aa 
Translation table 
GC content52% 
IMG OID640419149 
Productpredicted protein 
Protein accessionXP_001419727 
Protein GI145350679 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5110] 26S proteasome regulatory complex component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.461496 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TCTCGTCGAC GCGTCGTCAC CCGACGCGAC GCTTTAACCG ACTCACGCGA TGGTGACGAA 
GAAACCCAAC GACGCCGTCG CCGCGGCGAG CGCGAAGAAA GGAAAGAACG CCGCGGGCGA
CGACAAAGAC GACAAAAACG GCGATGGGGT CGTTGGCGAT GGAAAGTCAT CGACGACGAA
GAATAGAAAG GATAAGAAGA ACGAGGTGGA GTTGAGCGAA GAAGACGCGG CGCTGAAGGA
GAACTTGGAA CTGATGGTGA TGCGAGCGAG CGATCCGAAG GCGGGGGTGG CGAAATTGGC
GCTTGAAACG ATGAGACGCG AGATCCGAAC GGCGACGAGG TGCGCGAGTG CGAGAGCGAC
GAGGTCGCAT GAACGATTAC GACGCGATCG TGGATCACGC GACGATTGAG TGAGTGATAT
GCGCGATACT GACGAAATGA TACGGTACTT TACGGCGATG CGAACAGTTC GATGACTTCG
GTCCCAAAGC CTTTAAAGTT TTTGAGACCG CACTTGCAAA CCTTGAAGGA GGTGTACGAC
AAAACGAAGA ATGGAGAAAA CAAGTTACTG CTGGCGGATA TCATTTCCTT GTTGTCCATG
ACCAACGCTC CGGTGGCTGG GGAAATTCCA GAGTCGTTGA AGTACAGATT ATTAGGGTCA
AAGGAGGACA TCGGGAACTG GGGACACGAG TATGTGCGTA ATCTCGCCGC AGAGATCGGC
ACTGAATACC ATCGTCGCAT AGAGCAAGAT GGGGACAAGG CTTCGATGGA AGATCTCATG
GGTTTGGTGC ACGAAATCGT TCCTTTTCAC ATGCAACACA ACGCCGAGCC AGAGGCTGTC
GATTTACTAT TGGAAGTTGA GAAGCTCGAC ATTTTGCTCG ACAACGTGAA CGACGCCAAC
TATTCGAGGA CATGCTTGTA CCTGTTCAGC TGTGCCAACT ATCTTCCTGA GCCAGAAGAT
GCAATCGTGT TGAAGACGGC GCATGCGATT TTCATGAAGG TTGGGAAGAT GCCAGACGCT
ATGCGAGTTG CGCTCAAGCT CTGCGAGCAG AGCATCATCG AAGAGACGTT CAACGCGTGC
ACCAACTTGG CCATCAAGAA GCAACTTTGC TACATGTTGG CCCGTCACGG CCATCCGTTG
AAGCTCGATG AAGGCCCGTG TGAGGTTGAA GAAGATAACC TGGATATGCT GCAGAACATC
ATGAGTCACT CAGACTTGAC AAAGAACTAC TTGATGCTGG CGCGTGACTT GGATGTCATG
GAGGCGAAGT TACCCGAGGA CATTTACAAA TCGCATTTAA TGGAAGTTCG CGCTCCTTCG
GGCGCCGCAG TGGATAGTGC TCGTGCTAAC CTCGCAGCGA CATTCGTGAA CGCGTTCGTG
AACGCCGGTT TCGGACAAGA TAAGCTTTTA ACTTCTTCAG AAGCCGCGGA TGGTTCCACA
TCTAATGTGA GCTGGATTTT CAAAAACAAA GATCACGGTA AAATGTCAGC AGCTGCTAGC
CTTGGGAGCA TCTTGCTTTG GGACGTTGAA GGCGGTCTAC CGCAAGTCGA TGCTTATCTC
TATAGCGAGG AACCGAACAT CGTTGCGGGT GGCTTGCTTG CCGTTGGTCT CATCAACACT
AATGTGCGGA ACGATTGCGA TCCCGCTTAC GGTTTGTTGT ACGAGAGCGT GACCAAGGAA
AACTCCGCAG TAAGAATCGG TGCGATAATG GGACTTGGTT TGGCATATGC TGGTACTCAA
AAAGAAGAAG TTTCCGAGCT TCTCACGGAG GTGATTCACG ATGACAGTGC TCCGTTGGAA
GTCGTTGCTT TCGCAGCGCT GTCGCTTGGC CTCGTCTTCT GCGGCACGTG CCACGAAGAG
TCTGTGTCGA CCATCGTCCA AACGCTGATG ATGCGGCCTG AGAAAGAACT CGACAATACG
TTTGTGCACT TTTTGTGCCT AGGTTTAGGT TTACTTTTCT TACAGCGTCA AGCGGAAGTG
GAAGCGACAT TGGAAGTTGC GAAAACTCTC CCGGCGCGAA TCAGCGGATA TTTACAAACT
GTGCTGGACG TGTGCGCGTA CGCCGGTAGT GGTAATGTGT TGAAAATTCA GTCTCTTTTG
GCGAAGTGCG GCGAGCACCC GGAGGCTGAC GAAGGCGATG AATGGATTGC AGATCCCCAG
AGTGTCGCTG TGTTGGGCAT CGCTCTCGTT GCCATGGGCG AAGAACTTGG AGCAGACATG
GCGGTCCGTG CGTACGATCA CCTTATTCAG TATGGTGACG CGGCAGTAAA GAAAGCCGTT
CCTCTCGCAT TTGCGCTGTT ACACACCTCC AACCCGAAGC TTGATGTCAC GGATTTACTT
GGTAGACTCA GTCACGATAG CAATGAAGAG GTCGCGCAGT CCGCGTGCCT TGCCCTCGGT
ATTGTTGGAG CGGGAACCAA CAACGCGCGC TTGGCCTCGC AGCTACGACA ACTTAGTAGT
TACTACTACA AGGAGCCGTC GTGTTTGTTC CTCGTGCGTG TCTCACAGGG TCTTGTGCAT
ATGGGTAAGG GCTTGCTCAC GCTTTCTCCG GCGCACTCGG ACCGAGCGCT GGTGTCAAAC
GTCGCACTTG CTGGTTTGAT CATCACCGCC TTTGCGGGAC TCGACATGAA GCATACGATT
TTGGGTAAGC ATCACTACAT GCTGTACTAT TTGTTTGTCG CTGCCCAACC GCGCATGCTC
ATGACTGTGG ATGAACAAGG GGAACCGTTG CAAGTGTCTG TTCGAGTCGG TCAAGCTGTT
GACGTCGTGG GACAGGCTGG GCGACCAAAG ACTATCACTG GCTTCCAAAC ACATAACACT
CCCGTACTTC TGTCAGTTGG CGATCGAGCG GAGCTGGCGA GCGAGAAATA CATTCCTTTG
ACTCCTGTTT TAGAGGGAGT TGTCATCTTA AAGAAGAACC CGGAGTGGGT CGAGGAAATG
GAAAAGTAG
 
Protein sequence
MVTKKPNDAV AAASAKKGKN AAGDDKDDKN GDGVVGDGKS STTKNRKDKK NEVELSEEDA 
ALKENLELMV MRASDPKAGV AKLALETMRR EIRTATSSMT SVPKPLKFLR PHLQTLKEVY
DKTKNGENKL LLADIISLLS MTNAPVAGEI PESLKYRLLG SKEDIGNWGH EYVRNLAAEI
GTEYHRRIEQ DGDKASMEDL MGLVHEIVPF HMQHNAEPEA VDLLLEVEKL DILLDNVNDA
NYSRTCLYLF SCANYLPEPE DAIVLKTAHA IFMKVGKMPD AMRVALKLCE QSIIEETFNA
CTNLAIKKQL CYMLARHGHP LKLDEGPCEV EEDNLDMLQN IMSHSDLTKN YLMLARDLDV
MEAKLPEDIY KSHLMEVRAP SGAAVDSARA NLAATFVNAF VNAGFGQDKL LTSSEAADGS
TSNVSWIFKN KDHGKMSAAA SLGSILLWDV EGGLPQVDAY LYSEEPNIVA GGLLAVGLIN
TNVRNDCDPA YGLLYESVTK ENSAVRIGAI MGLGLAYAGT QKEEVSELLT EVIHDDSAPL
EVVAFAALSL GLVFCGTCHE ESVSTIVQTL MMRPEKELDN TFVHFLCLGL GLLFLQRQAE
VEATLEVAKT LPARISGYLQ TVLDVCAYAG SGNVLKIQSL LAKCGEHPEA DEGDEWIADP
QSVAVLGIAL VAMGEELGAD MAVRAYDHLI QYGDAAVKKA VPLAFALLHT SNPKLDVTDL
LGRLSHDSNE EVAQSACLAL GIVGAGTNNA RLASQLRQLS SYYYKEPSCL FLVRVSQGLV
HMGKGLLTLS PAHSDRALVS NVALAGLIIT AFAGLDMKHT ILGKHHYMLY YLFVAAQPRM
LMTVDEQGEP LQVSVRVGQA VDVVGQAGRP KTITGFQTHN TPVLLSVGDR AELASEKYIP
LTPVLEGVVI LKKNPEWVEE MEK