Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_30617 |
Symbol | |
ID | 5001034 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 467129 |
End bp | 468573 |
Gene Length | 1445 bp |
Protein Length | 443 aa |
Translation table | |
GC content | 59% |
IMG OID | 640416455 |
Product | predicted protein |
Protein accession | XP_001416670 |
Protein GI | 145344292 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1222] ATP-dependent 26S proteasome regulatory subunit |
TIGRFAM ID | [TIGR01242] 26S proteasome subunit P45 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.580577 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ACACGGCGAC GCGTCTCGAC GCGCGACGGC GACGGCGCGA GCGACGGCGC GAGCGACGGC GACGATCGAC GCGCGACGCG CGCGCGCGAG CGAACATGGG CCAGGGACAG AGCGCGGACG GCGGCGACGG CGCCGACGGA CGCCACGGCC GGGGCAAGAA GAAGGAAAAG AAGAAGTACG TCCCGCCCGC GCCGCCGATG CGCGTGGGAA AGAAGAAGAA GAAGACCGGG ATCGAGGGCA GCACGCGATT GCCGAACGTC GCGCCGCAGT CGAAGTGTAA GCTGCGGATG CTGAAGCTGG AGCGGGTGAA GGATTATTTG CTGATGGAGG AGGAGTTCGT GGGGAATCAG GAGCGGTTGA AGCCGCGAGA GGAGCGGGAC GAGGACGAGC AGAGCAAGAT TGACGAGATG CGGGGGGCGC CGATGAGCGT GGGGTCGTTG GAGGAGATCA TCGATGACAC GCACGGGATC GTGTCGTCGT CGATCGGGCC GGAGTATTAC GTGAACATCG CGTCGTTCGT GGACAAGAGT CAGCTCGAAC CGGGGTGCGC GGTGCTGTTG CATCACAAGA ATTCTGCCGT CGTGGGGACT CTGGCGGACG ACGTCGATCC CATGGTGAGC GTGATGAAGG TTGATAAGGC GCCGTTGGAG TCGTACGCCG ATGTTGGGGG ATTAGAGGAT CAGATTCAAG AGATCAAGGA AGCCGTGGAG TTGCCGCTGA CGCACCCCGA ACTGTACGAA GACATCGGCA TCAAGCCGCC GAAAGGGGTG ATCTTGTACG GAGCTCCGGG AACTGGGAAG ACGCTGTTAG CTAAGGCGGT GGCGAACTCA ACGAGCGCGA CTTTTTTGCG CATCGTTGGA TCTGAATTGA TTCAAAAATA CTTGGGCGAC GGCCCGAAGC TCGTGCGCGA GCTCTTCCGC GTCGCCGACG AGATGAGTCC CTCTATCGTT TTCATGGATG AGATCGACGC CGTCGGTACG AAGCGATACG ATTCTCAATC GGGCGGCGAG CGCGAGATCC AACGTACGAT GTTAGAGTTA CTGAACCAGA TGGATGGTTT TGACTCGCGC GGCGACGTCA AGGTCATCAT GGCTACGAAT AGAATCGAAT CGCTCGACCC CGCGCTCTTA CGCCCGGGTC GAATAGATCG AAAGATTGAA TTCCCTTTAC CGGACGTCAA GACAAAGCGA CACATTTTCA ACATTCACAC CGGGCGCATG AACCTTTCCG CCGACGTACA GTTGGAGGAA TTTGTCATGG CCAAGGACGA ACTCTCGGGC GCCGACATCA AGGCGCTTTG CACCGAAGCC GGTTTGCTCG CCTTACGTGA GCGCCGAATG CAAGTAACGC ACGCCGACTT CAGCAAGGCT AAAGAAAAGG TTTTGTACAA GAAGAAGGAA GGCGTGCCGG AGGGAATGTT TACGTGATTA GAACCGTTTT AGGAG
|
Protein sequence | MGQGQSADGG DGADGRHGRG KKKEKKKYVP PAPPMRVGKK KKKTGIEGST RLPNVAPQSK CKLRMLKLER VKDYLLMEEE FVGNQERLKP REERDEDEQS KIDEMRGAPM SVGSLEEIID DTHGIVSSSI GPEYYVNIAS FVDKSQLEPG CAVLLHHKNS AVVGTLADDV DPMVSVMKVD KAPLESYADV GGLEDQIQEI KEAVELPLTH PELYEDIGIK PPKGVILYGA PGTGKTLLAK AVANSTSATF LRIVGSELIQ KYLGDGPKLV RELFRVADEM SPSIVFMDEI DAVGTKRYDS QSGGEREIQR TMLELLNQMD GFDSRGDVKV IMATNRIESL DPALLRPGRI DRKIEFPLPD VKTKRHIFNI HTGRMNLSAD VQLEEFVMAK DELSGADIKA LCTEAGLLAL RERRMQVTHA DFSKAKEKVL YKKKEGVPEG MFT
|
| |