Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_41449 |
Symbol | |
ID | 5002299 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | + |
Start bp | 51974 |
End bp | 53125 |
Gene Length | 1152 bp |
Protein Length | 246 aa |
Translation table | |
GC content | 60% |
IMG OID | 640417720 |
Product | predicted protein |
Protein accession | XP_001418143 |
Protein GI | 145347374 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0638] 20S proteasome, alpha and beta subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.220218 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCGCG GTGCCGGCGC CGGATACGAT CGGCACATCA CCGTATTCTC TCCCGAAGGA CGTCTGTACC AAGTCGGTGC GTGTTCGCGC GCGCGCCGAC GCCTCCTCGC GCCGTCACCT CGCCCGCGAC GCGCGACGTT CACCGACGAA ACGATCTCAC CGACTGACCG CGATTCGACG ACTTTCGACG CGCAGAGTAC GCGTTCAAGG CGATCAAATC CGTCGGCGTC ACCACCATCG GCGTGCGAGG GAAAGACTCC GTGTGCGTCG TGACGCAGAA GAAGATTCCG GTGCGCGAAA CGGGCGAACG CGAGGGCGAA GAACGATCTC GTCGGCGAAC GCGAAGGCGC GAACGCGCTC GAGACGACGC GGGAGGGATC GACCGCGCTG GATCCCGCGA ACGCGCGCGC GAGAGGAAGA CTGACGCGTG GTGTGTGATC GGACGTCGCG CGAGCGCAGG ATAAATTGAT CGATGCGTCG GATGTGACGC ACATGTATAA GATTACGAAA ACCGTGGGCA TGTGCGCGAC GGGAAAAGGA CGTACGTTTT CGTTTTCGTC GCTCGGCGGC TTTGTTTGAA TGAACACGTA CGCGCGCCCG CGCGAGCGCG ACTTTTCGCT CGACCGATCG ACTGACGACT GCTCGCGTCG ATTTCGTTTC GCAGCGGATA TCCGAGACAT AGTACAAAAG GCGCGCAGAA AGGCGGCGGA TTTCAAGCAA CACTACGGGT ACGAGGTCCC GGTGGACGTG TTGGCGAACA TACTCGCCGA TGAGTTCCAG GTGTACACGC AGCACGCATA CATGCGTCCG CTCGCGGTGA TGGTGATATT AATCGCCGTA GATGAGGATC GCGGGCCGAG TCTGTTCAAG TGCGATCCGG CAGGATACTT TGTCGGTTAC AGCGCGACGA GCGCGGGGGC GAAGGAGGTC GAGGCGGTGA ACTTCTTGGA GAAGAAGGTC AAGAGCGGCG CATCGTTCGA TGTGAATCAG ACGGCCCAGC TCGCGATCAG CGCTCTTCAG CACGTGCTCG GGGAAGAGGT CAAGGCGAGC GAGTTAGAAG TCGCCGTCGT CACGGCGGAC AATCCCAATT TCCGCGTCAT CAGCGAGAGC GAAGTGGAAG ATCATTTGAC ATCGATTTCT GAGAGAGACT AG
|
Protein sequence | MSRGAGAGYD RHITVFSPEG RLYQVEYAFK AIKSVGVTTI GVRGKDSVCV VTQKKIPDKL IDASDVTHMY KITKTVGMCA TGKGPDIRDI VQKARRKAAD FKQHYGYEVP VDVLANILAD EFQVYTQHAY MRPLAVMVIL IAVDEDRGPS LFKCDPAGYF VGYSATSAGA KEVEAVNFLE KKVKSGASFD VNQTAQLAIS ALQHVLGEEV KASELEVAVV TADNPNFRVI SESEVEDHLT SISERD
|
| |