Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33401 |
Symbol | |
ID | 5003728 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | - |
Start bp | 152242 |
End bp | 155190 |
Gene Length | 2949 bp |
Protein Length | 923 aa |
Translation table | |
GC content | 52% |
IMG OID | 640419149 |
Product | predicted protein |
Protein accession | XP_001419727 |
Protein GI | 145350679 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5110] 26S proteasome regulatory complex component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.461496 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TCTCGTCGAC GCGTCGTCAC CCGACGCGAC GCTTTAACCG ACTCACGCGA TGGTGACGAA GAAACCCAAC GACGCCGTCG CCGCGGCGAG CGCGAAGAAA GGAAAGAACG CCGCGGGCGA CGACAAAGAC GACAAAAACG GCGATGGGGT CGTTGGCGAT GGAAAGTCAT CGACGACGAA GAATAGAAAG GATAAGAAGA ACGAGGTGGA GTTGAGCGAA GAAGACGCGG CGCTGAAGGA GAACTTGGAA CTGATGGTGA TGCGAGCGAG CGATCCGAAG GCGGGGGTGG CGAAATTGGC GCTTGAAACG ATGAGACGCG AGATCCGAAC GGCGACGAGG TGCGCGAGTG CGAGAGCGAC GAGGTCGCAT GAACGATTAC GACGCGATCG TGGATCACGC GACGATTGAG TGAGTGATAT GCGCGATACT GACGAAATGA TACGGTACTT TACGGCGATG CGAACAGTTC GATGACTTCG GTCCCAAAGC CTTTAAAGTT TTTGAGACCG CACTTGCAAA CCTTGAAGGA GGTGTACGAC AAAACGAAGA ATGGAGAAAA CAAGTTACTG CTGGCGGATA TCATTTCCTT GTTGTCCATG ACCAACGCTC CGGTGGCTGG GGAAATTCCA GAGTCGTTGA AGTACAGATT ATTAGGGTCA AAGGAGGACA TCGGGAACTG GGGACACGAG TATGTGCGTA ATCTCGCCGC AGAGATCGGC ACTGAATACC ATCGTCGCAT AGAGCAAGAT GGGGACAAGG CTTCGATGGA AGATCTCATG GGTTTGGTGC ACGAAATCGT TCCTTTTCAC ATGCAACACA ACGCCGAGCC AGAGGCTGTC GATTTACTAT TGGAAGTTGA GAAGCTCGAC ATTTTGCTCG ACAACGTGAA CGACGCCAAC TATTCGAGGA CATGCTTGTA CCTGTTCAGC TGTGCCAACT ATCTTCCTGA GCCAGAAGAT GCAATCGTGT TGAAGACGGC GCATGCGATT TTCATGAAGG TTGGGAAGAT GCCAGACGCT ATGCGAGTTG CGCTCAAGCT CTGCGAGCAG AGCATCATCG AAGAGACGTT CAACGCGTGC ACCAACTTGG CCATCAAGAA GCAACTTTGC TACATGTTGG CCCGTCACGG CCATCCGTTG AAGCTCGATG AAGGCCCGTG TGAGGTTGAA GAAGATAACC TGGATATGCT GCAGAACATC ATGAGTCACT CAGACTTGAC AAAGAACTAC TTGATGCTGG CGCGTGACTT GGATGTCATG GAGGCGAAGT TACCCGAGGA CATTTACAAA TCGCATTTAA TGGAAGTTCG CGCTCCTTCG GGCGCCGCAG TGGATAGTGC TCGTGCTAAC CTCGCAGCGA CATTCGTGAA CGCGTTCGTG AACGCCGGTT TCGGACAAGA TAAGCTTTTA ACTTCTTCAG AAGCCGCGGA TGGTTCCACA TCTAATGTGA GCTGGATTTT CAAAAACAAA GATCACGGTA AAATGTCAGC AGCTGCTAGC CTTGGGAGCA TCTTGCTTTG GGACGTTGAA GGCGGTCTAC CGCAAGTCGA TGCTTATCTC TATAGCGAGG AACCGAACAT CGTTGCGGGT GGCTTGCTTG CCGTTGGTCT CATCAACACT AATGTGCGGA ACGATTGCGA TCCCGCTTAC GGTTTGTTGT ACGAGAGCGT GACCAAGGAA AACTCCGCAG TAAGAATCGG TGCGATAATG GGACTTGGTT TGGCATATGC TGGTACTCAA AAAGAAGAAG TTTCCGAGCT TCTCACGGAG GTGATTCACG ATGACAGTGC TCCGTTGGAA GTCGTTGCTT TCGCAGCGCT GTCGCTTGGC CTCGTCTTCT GCGGCACGTG CCACGAAGAG TCTGTGTCGA CCATCGTCCA AACGCTGATG ATGCGGCCTG AGAAAGAACT CGACAATACG TTTGTGCACT TTTTGTGCCT AGGTTTAGGT TTACTTTTCT TACAGCGTCA AGCGGAAGTG GAAGCGACAT TGGAAGTTGC GAAAACTCTC CCGGCGCGAA TCAGCGGATA TTTACAAACT GTGCTGGACG TGTGCGCGTA CGCCGGTAGT GGTAATGTGT TGAAAATTCA GTCTCTTTTG GCGAAGTGCG GCGAGCACCC GGAGGCTGAC GAAGGCGATG AATGGATTGC AGATCCCCAG AGTGTCGCTG TGTTGGGCAT CGCTCTCGTT GCCATGGGCG AAGAACTTGG AGCAGACATG GCGGTCCGTG CGTACGATCA CCTTATTCAG TATGGTGACG CGGCAGTAAA GAAAGCCGTT CCTCTCGCAT TTGCGCTGTT ACACACCTCC AACCCGAAGC TTGATGTCAC GGATTTACTT GGTAGACTCA GTCACGATAG CAATGAAGAG GTCGCGCAGT CCGCGTGCCT TGCCCTCGGT ATTGTTGGAG CGGGAACCAA CAACGCGCGC TTGGCCTCGC AGCTACGACA ACTTAGTAGT TACTACTACA AGGAGCCGTC GTGTTTGTTC CTCGTGCGTG TCTCACAGGG TCTTGTGCAT ATGGGTAAGG GCTTGCTCAC GCTTTCTCCG GCGCACTCGG ACCGAGCGCT GGTGTCAAAC GTCGCACTTG CTGGTTTGAT CATCACCGCC TTTGCGGGAC TCGACATGAA GCATACGATT TTGGGTAAGC ATCACTACAT GCTGTACTAT TTGTTTGTCG CTGCCCAACC GCGCATGCTC ATGACTGTGG ATGAACAAGG GGAACCGTTG CAAGTGTCTG TTCGAGTCGG TCAAGCTGTT GACGTCGTGG GACAGGCTGG GCGACCAAAG ACTATCACTG GCTTCCAAAC ACATAACACT CCCGTACTTC TGTCAGTTGG CGATCGAGCG GAGCTGGCGA GCGAGAAATA CATTCCTTTG ACTCCTGTTT TAGAGGGAGT TGTCATCTTA AAGAAGAACC CGGAGTGGGT CGAGGAAATG GAAAAGTAG
|
Protein sequence | MVTKKPNDAV AAASAKKGKN AAGDDKDDKN GDGVVGDGKS STTKNRKDKK NEVELSEEDA ALKENLELMV MRASDPKAGV AKLALETMRR EIRTATSSMT SVPKPLKFLR PHLQTLKEVY DKTKNGENKL LLADIISLLS MTNAPVAGEI PESLKYRLLG SKEDIGNWGH EYVRNLAAEI GTEYHRRIEQ DGDKASMEDL MGLVHEIVPF HMQHNAEPEA VDLLLEVEKL DILLDNVNDA NYSRTCLYLF SCANYLPEPE DAIVLKTAHA IFMKVGKMPD AMRVALKLCE QSIIEETFNA CTNLAIKKQL CYMLARHGHP LKLDEGPCEV EEDNLDMLQN IMSHSDLTKN YLMLARDLDV MEAKLPEDIY KSHLMEVRAP SGAAVDSARA NLAATFVNAF VNAGFGQDKL LTSSEAADGS TSNVSWIFKN KDHGKMSAAA SLGSILLWDV EGGLPQVDAY LYSEEPNIVA GGLLAVGLIN TNVRNDCDPA YGLLYESVTK ENSAVRIGAI MGLGLAYAGT QKEEVSELLT EVIHDDSAPL EVVAFAALSL GLVFCGTCHE ESVSTIVQTL MMRPEKELDN TFVHFLCLGL GLLFLQRQAE VEATLEVAKT LPARISGYLQ TVLDVCAYAG SGNVLKIQSL LAKCGEHPEA DEGDEWIADP QSVAVLGIAL VAMGEELGAD MAVRAYDHLI QYGDAAVKKA VPLAFALLHT SNPKLDVTDL LGRLSHDSNE EVAQSACLAL GIVGAGTNNA RLASQLRQLS SYYYKEPSCL FLVRVSQGLV HMGKGLLTLS PAHSDRALVS NVALAGLIIT AFAGLDMKHT ILGKHHYMLY YLFVAAQPRM LMTVDEQGEP LQVSVRVGQA VDVVGQAGRP KTITGFQTHN TPVLLSVGDR AELASEKYIP LTPVLEGVVI LKKNPEWVEE MEK
|
| |