Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_44133 |
Symbol | |
ID | 5004285 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | - |
Start bp | 88249 |
End bp | 89550 |
Gene Length | 1302 bp |
Protein Length | 405 aa |
Translation table | |
GC content | 55% |
IMG OID | 640419706 |
Product | predicted protein |
Protein accession | XP_001420410 |
Protein GI | 145352130 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5243] HRD ubiquitin ligase complex, ER membrane component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGATTT GGGTGCGAAA CGCGCTCAAC GAAGACGAGC GCGAGGGGAT GATTTCATCG ATGCGACACG CCACGCAAAA CACGAGGTTC GCGCAGTGGA TGAACACGTG GTACAGCGGA AAGGTGGAAG ACTTACAGAC GATCGGAGAC GATTGGGTGA AGTCTACGGC GATGGACGAC GACGACGCGT CCGCGACGAC GACGAAGACG AACGCCGCCG ATACCGCCGC GGTTTCGTCC CAAGACGGCA TGCGACAGGT ACACGCGTAC CTCAGCCGTC GATCTGGCGA TGTGGATAAG GAAACATTCA AGCCCGGTTG GGAAGATATT TTTCGCATGA ATCAGATCCA GCTCGAAGAC GCCGTGCGCA CGGTGAATCG CGACGACACG CTCGCGCCGA CGAAGAAGGC ATACTTGATT CAAAACTTGC TCGCGTCGCG ATGGATCGTG GGAAATCAAT TGCAAGCGCA AAAAGATAAG ACGACGGAGC TGTCAAGCGT GGGTAGCGCA CAAAAATCGG GTGCGCCTGC GATTCTGTGC CAGCCCGTCG ACGAAGACGA GGAGAACGGC AAAGACGGTT GCAAGCATTA CAAGCGTCGG TGCAAAATCG TCGCGCCGTG CTGCGACCAA GCTTTCACGT GCAGATTTTG TCACGACGAC GCGAGTGATC ACACCGTGAA TCGCTACGCG GTGAAAGAAA TGGTGTGTAA CGAGTGCAAG AAGCGTCAAC CCGTGAACGA GGTATGCGTG GGGTGCTCGA CGTCGATGGC AAAGTATCAC TGCAACGTGT GCAACTTGTT TGACGACTCG AGCGAAGCCA TCTATCATTG CCCGTTTTGC AACGTCTGTC GACGCGGTAA AGGTCTGGGC GTTGACTTCT TCCATTGCAT GAAGTGTAAC GCGTGCGTGA GCTTACAGCA CGGCAAGCAT GAGTGCTCAG AACGAGGCAT GGATAGCGAA TGTCCAGTGT GCAAAGAATT TCTTGCCGAG TCGGAGACAC CGGTTAAGGA GCTTCCGTGC GGGCATATCA TGCACGCCAC GTGCTTTACC ACGTATACGC GGCACTACTA CACGTGTCCT TTGTGTCGGA AATCGCTCGG CGACTTTTCC ATGTACTTTA GAATGCTCGA CGCCATCCTA GCGGATGAGA GCGACGACAG CGTTCCAGAG GCGCTGCGGG GAAAGACGCA AAAGGTGTCG TGCAACGACT GTGCGAAGGA TTCGGATGCA AAGTTTCATT TTGTATACCA CGCGTGCGCG CACTGTAGAA GCTATAATAC GCGCGTGCTC ACGTTTGATT AG
|
Protein sequence | MLIWVRNALN EDEREGMISS MRHATQNTRF AQWMNTWYSG KTNAADTAAV SSQDGMRQVH AYLSRRSGDV DKETFKPGWE DIFRMNQIQL EDAVRTVNRD DTLAPTKKAY LIQNLLASRW IVGNQLQAQK DKTTELSSVG SAQKSGAPAI LCQPVDEDEE NGKDGCKHYK RRCKIVAPCC DQAFTCRFCH DDASDHTVNR YAVKEMVCNE CKKRQPVNEV CVGCSTSMAK YHCNVCNLFD DSSEAIYHCP FCNVCRRGKG LGVDFFHCMK CNACVSLQHG KHECSERGMD SECPVCKEFL AESETPVKEL PCGHIMHATC FTTYTRHYYT CPLCRKSLGD FSMYFRMLDA ILADESDDSV PEALRGKTQK VSCNDCAKDS DAKFHFVYHA CAHCRSYNTR VLTFD
|
| |