Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33894 |
Symbol | |
ID | 5000920 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 202806 |
End bp | 204263 |
Gene Length | 1458 bp |
Protein Length | 398 aa |
Translation table | |
GC content | 57% |
IMG OID | 640416341 |
Product | predicted protein |
Protein accession | XP_001416582 |
Protein GI | 145344112 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1222] ATP-dependent 26S proteasome regulatory subunit |
TIGRFAM ID | [TIGR01242] 26S proteasome subunit P45 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00183921 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0445175 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGTCG ACGCGCGACC GCAAACGGGT CTGCGCGCGT ATTACGACGC GAAAATCGAA GAGTTGGAGG TTCGCCTGCG CGACAAGACG CAGAATCTTC GACGCTTAGA GGCGCAGAGA AATGAATTGA ACGGACGAGG TGCGCGGAGC GCGCGTGAGA AGGGCGTCGT CGTCGTCGAT CGCGCGAGCG CGGTGCGCGA ACGGACGCGA AAGCGTGATG AGAGGAACGA TCGAGGCGAG ATCGCGAACG ACGCGCGAAC GACGAGCGAA GACTGACTTT GAATTTCGTT CGGTTTACCG CGCGCGCGCA GTTCGATCGC TTCGCGAGGA ATTGCACATG TTGCAGGAAC CCGGATCGTA CGTCGGGGAG GTGGTGAAGG TGATGGGGAA GAAGAAAGTG TTGGTCAAGG TGAGCGGACG ACTCGCCCGC GCGGCGCGCC GAGCGCGCGG AGACGGCGAG AGAGGAAAGA ATACTGACAG TTTGTGTTTC GCGCGCAGGT GCACCCGGAG GGAAAATACG TGTGTGATAT GGACAAGAGC ATCGATGTGA CGAAACTGAC GGCGGGGACT CGGGTGGCGT TGAGGAACGA TTCGTACACG CTGCACGTGA TCCTTCCGTC GAACATCGAC CCGCTCGTGT CGCTCATGAA GGTTGAAAAG GTTCCCGATT CCACGTTCGA TATGATTGGT GGATTGGATC AGCAAGTGAA GGAGATCAAG GAAGTCGTGG AGTTACCGAT CAAGCACCCA GAGCTTTTCG ACGCGCTCGG GATCGCGCAA CCGAAGGGGG TCATCCTTTA CGGTCCCCCG GGTACCGGGA AGACGCTCTT GGCTCGTGCC GTTGCGCACC ACACCGATTG CTGCTTCATT CGCGTGTCTG GTTCGGAATT AGTTCAAAAG TACATAGGAG AAGGGGCGCG GATGGTTCGT GAACTGTTCG TCATGGCTCG CGAGCACGCG CCGAGCATCT TGTTCATGGA TGAAGTAGAT TCTATCGGTA GCGCTCGCGA CGGAGGCGGC GGAGGTGGAG GCGACAGCGA AGTGCAGCGT ACGATGCTTG AACTGCTCAA CCAGCTCGAC GGTTTCGAGG CGACGAACAA GATTAAGGTG ATCATGGCCA CGAACCGCCT CGATATCCTC GATCAGGCGC TTCTTCGTCC GGGCCGCATC GATCGTAAAA TCGAGTTCCC CAATCCATCT GAAGACAGCC GCGTCGATAT TCTCAAGATT CATAGCCGCA AGATGAACCT CGTTCGCGGG ATCGATCTTA AGAAGATCGC GAGCAAGATG GGTGGGGCTT CCGGGGCAGA ATCCAAGGCG GTGTGCACCG AGGCCGGAAT GTTCGCGCTT CGCGAACGTC GCGTCCACGT CACGCAAGAA GACTTTGAAA TGGCCGTATC CAAGGTGATG CAAAAGGATA GCGAAAAGAA CATTTCCGTG AAAAAGCTCT TTTCGTAA
|
Protein sequence | MDVDARPQTG LRAYYDAKIE ELEVRLRDKT QNLRRLEAQR NELNGRVRSL REELHMLQEP GSYVGEVVKV MGKKKVLVKV HPEGKYVCDM DKSIDVTKLT AGTRVALRND SYTLHVILPS NIDPLVSLMK VEKVPDSTFD MIGGLDQQVK EIKEVVELPI KHPELFDALG IAQPKGVILY GPPGTGKTLL ARAVAHHTDC CFIRVSGSEL VQKYIGEGAR MVRELFVMAR EHAPSILFMD EVDSIGSARD GGGGGGGDSE VQRTMLELLN QLDGFEATNK IKVIMATNRL DILDQALLRP GRIDRKIEFP NPSEDSRVDI LKIHSRKMNL VRGIDLKKIA SKMGGASGAE SKAVCTEAGM FALRERRVHV TQEDFEMAVS KVMQKDSEKN ISVKKLFS
|
| |