Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_27330 |
Symbol | |
ID | 5005530 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | + |
Start bp | 77269 |
End bp | 78364 |
Gene Length | 1096 bp |
Protein Length | 297 aa |
Translation table | |
GC content | 59% |
IMG OID | 640420951 |
Product | predicted protein |
Protein accession | XP_001421201 |
Protein GI | 145353826 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0638] 20S proteasome, alpha and beta subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.00217957 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.014135 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGGGC CGACTTTAGA TTTCAGCTTC CTCGACGCCA CCGCGCGCGG CTCGATGGCG TCGAAACACG TCGACGAAAG GTTCGTTTCA GATCGCGTCA ACGCGCGATC GACGACTGAC GCCGCGTTTT AACGCGTTCA CAGCCTCGAC GCGATCGATG GAAACATGCA CGAGTGTAAC TTTAAAGCCC CTGCGGTGGA AGACGTGCGT GAGCGACGCG AACGCGACGC GCACGACGAG TGAGCGCTGG CGATCGGGAG AGCAACGGTT TGGTTAGGTC GCGGCGCTCG CGAAGAGTGA AAATGGACGA CTGACGATGA CGGTGGACGA TTTGCGCGCG CAGTTTGAAG GATTCCAGCG AGAGGTGATC AATTACGTCA AACCGAACCA CGGGACGACG ACGCTGGCGT TTATTTTTGA GCACGGTATC GTCGTCGCGG TGGACTCTCG CGCGTCGCAA GGACCGTACA TTTCTTCGCA GACGGTGAAA AAGGTGATCG AGATTAATCC GTTCTTGCTC GGGACCATGG CCGGGGGGGC GGCAGATTGT CAGTTTTGGC AGCGAAACCT CGGGATTCAG TGCCGGTTGC ACGAGTTGGA AAATGGGAAG CGAATCACGG TGCGCGCGGC GAGCAAGCTG TTGGCGAACA CGCTGTACAG TTACAAGGGC AAGGGATTGT CCATGGGGAC GATGGTGGCT GGGTGGGATT TGAACGGGCC TGGGCTGTAT TACGTCGATA GCGAGGGCAC ACGGTTGAAG GGGCAGCGGT TTAGCGTCGG TTCGGGGTCG TTGTTTGCAT ACGGGGTGTT GGATCAAGGA TACAAGTGGG ACTTGACGGT TGAGGAAGCG TGCGAACTCG GACGACGCGC GATTTATCAC GCCACGTTTC GCGACGCATT TTCTGGTGGT ACTGTCAGTG TGTACCACGT CGGTGCGAAT GGGTGGACTA AGGTGACCGG GGACGATGTC GGCGAGTTAC ACTTCTCGTA TTACCCGGCG ACGCCGGTCG ACGACGTCGA TGCGCACTGC GGCGGAGAAG GCAAGAAGGA AGCCGAGGCG AGAGCGGCTA CGGAAGCGAG CGCGATGGAG ACGTGA
|
Protein sequence | MNGPTLDFSF LDATARGSMA SKHVDESLDA IDGNMHECNF KAPAVEDFEG FQREVINYVK PNHGTTTLAF IFEHGIVVAV DSRASQGPYI SSQTVKKVIE INPFLLGTMA GGAADCQFWQ RNLGIQCRLH ELENGKRITV RAASKLLANT LYSYKGKGLS MGTMVAGWDL NGPGLYYVDS EGTRLKGQRF SVGSGSLFAY GVLDQGYKWD LTVEEACELG RRAIYHATFR DAFSGGTVSV YHVGANGWTK VTGDDVGELH FSYYPATPVD DVDAHCGGEG KKEAEARAAT EASAMET
|
| |