Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_36349 |
Symbol | |
ID | 5000294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | - |
Start bp | 295784 |
End bp | 299167 |
Gene Length | 3384 bp |
Protein Length | 1127 aa |
Translation table | |
GC content | 51% |
IMG OID | 640415715 |
Product | predicted protein |
Protein accession | XP_001416361 |
Protein GI | 145343504 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5077] Ubiquitin carboxyl-terminal hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0304624 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGAGCG GTTCGGAAGA TAGCGAGTTG GCGTTCTCGA CGGCGGAGCC GACGACGTTC TCGTGGCGCG CCGAGTTCTC GCGATGGAAG AAACGAGACG CGAAGGTGGT GAGTCAAACG TTTGAGTGCG GAGATACACT CTTTCGGTTG GCGATGTATC CGTTCGGGAG TAATTTGAAC TCGAAATCGG AGACGCCGGC GCAGGTGAGC TTGTTCTTGG ACACGGGGGC GACAAAGCCG CGCCGAATCG AGGACGACAT GAGTAGAGAG TGGAGAAGGC ACGCGAAGTT CGAATTGCAG CTGCTTCATC CGACGGATGC GTCGGTGGTA GAATCGAAGG AAACGTCGCA CACGTTCGAC AGACGCGAAG CGGATTGGGG GTTCGCGTCG TTCATCACGC GCGAAGACGT TTTTGAAAAG GGCTACGTTG ACGCCGAAGG CTGTGTGAAT TTCCGAGTGC ATGTGACGCC GATTGAGGAG CACGAAGTCG ACCGACCGAT GCAGAGTGCG TTTTATAGCG AATACGACTC GCGCAAAGAG ACAGGGCTGA TCGGTCTGAA GAATCAAGGG GCGACGTGTT ACATGAACTC GCTCTTGCAG ACGCTCTATC ACATTCCCTC CTTTCGTCGC GCGGTCTATC ACATGCCTAC GAACGAAACG GAAGAGGCGC ACACATCGAT GCCATTAGCG TTACAATCGG TGTTCTACTG TCTTCAGTAC GCCAAAGAAG GCGACGTGAG CACGGAGGAT TTGACGCGAT CGTTTGGATG GGACTCTTAC GATTCCTTCA TGCAACACGA CGTACAGGAA CTCAACCGTG TACTTCAGGA TAAGCTTGAA GAGGCCATGA AACAAACGTG CGTCGAGGGC ACGATTCAGA AGCTCTTCGA AGGGCACACG ACGAACTTCA TCGAGTGCAT CAACGTTGAT TACAAGAGCG AACGCAAGGA GGAGTTTCTC GATCTTCAGT TAGACGTCAA GGGATGCAAA GATATTTATG CGTCCTTTGA TCGTTACACT GAGATTGAAA AACTTGATGG CGAGAACAAG TATCGCGCCG AGGGACACGG ATTGCAAGAC GCTCGCAAAG GCACGCTGTT CCACGACTTT CCTCCCGTGT TGCAGATTCA GCTGAAGCGT TTTGAGTACG ATTACCAACG AGACACCATG GTGAAGATCC ATGATAGATA TGAGTTCCCC GAAGAGCTCG ATCTCGACGT GGGTGATCGT AAGTACCTCG TTCCCGAGTC CGACAAGAGT GTTCGCAACA AGTATAAGCT TCATAGCGTC TTAGTACACA GCGGGGGGAT AAATGGTGGA CATTATTACG CGTTTGTCAA GCCCAATTTG CAGGCGGAAG ATGCGCAGTG GTTCAAGTTT GATGACGAAC ACGTGACGAA AGAAACCGCA GAAAAGTCGG TGGTGGAACA GTACGGAAGC GGCGGCGCAG CCGCGGTCGA TAGCGATATG GATGCAGACG ATGACTCGAC AAACGTCCGC GTGGCGCCGA ACTTGCGGTT CCAAAAAGTG AGTAGCGCCT ACATGCTCGT GTACATTCGA GAGGATGACA TGGATCAAAT CATGTGCGTC GCCAATAAGT CTCATCTCAC GGAATACCTC CAGGCCCGCT TCGCTGAGGA GCAGAAGGCA AAGGAGAAAG AGGCGCAGGA AAAGAAGGAG GCGCATCTGT ACACCATCAT CAAGGTTTTG ACCAGGCAAG ATTTAGAGCG GCAGATCGCT TCGGAGAAGT TTTTTGATCT GGGCAATTTT GAGAGCGCGC AAAGGTTCCG ACTGCATAAA AAGTCGACAT TTACAAAGTT TAGGGAGCTC GTTTCCGAGA AGCTCGGAAT ACCAGCTGAG AGGCAGCGGT ACTGGACTTA CTCGCCACGA CGCAACAAAA CATCTCGGCC AGCCACCGCG CTTCCAGACC ACGCAAATAC CCCACCGACC TGGACGGTTG AAAAAACACG GCTGAAATAC ACGGTGCCTA ACTCACAGCA TGCTTCATCA GGCGAATTCA GACTCTACCT GGAAGAGCTT GACGATGATG CGTTCGCGAA CAGTGACCCA GAAAGAGACA TTTGGTTGCA CGTCAAGCTT TACAATCCGC ATGAGGCGCG GTTGAGTTAC TGTGGCACGC TTTACGCGAG TCCCGAGGAG ACGCTCAGTA CCTACATGCC GAAGATCAAA TCCATGGCAG GGTTTGCCAG TACCGCGTCA ACGCTCATGT TTGAGGAAAT TGCGTTCGTT CGCGAAAGTA AGATTCAAAT TGATCAATTA TCGGACAAAC AAGTGAAAAC ATATCCGTTG AGCGACCCCG ATGACGGTAG CAAGACGTTG CAGCTCGGTA ACGGTGATAT CTTGCTTATT CAACCAGAAA TCACTGAAGA CATGGAAGAT TCACTAAAGT TCCCCAACGT GGTTCAATAT GCGGACTTCA GACACAATCA TCAAATCGTT CACTTCAGAG AGCTGGAGGC CCCTAAGGTG GACAAAGTGA CTCTGGAGTT GACGAAAATG ATGAGTTACG ATCAAGTCGC TGATGTTTTG GCTTCGGCGA TCGGTTTGGA CGATCCTCTT CGATTGCGCT TCACCGCGCA TCACGTGTAC ACGAACGGTC CAAAGAGTGC AAGCTTCCAA TTTAGAGGCG CGGATACTTT GATAAAAATG CTGGAAAACC AGCAAAGCGA CGTGCTCTAT TACGAAGTCT TAGACATGCC GCTGCCCGAA CTCCAGGAAT TGAAGACGCT AAAAGTTTTC TTCCATGGAC TGAACACAAA GCTCGTTGAA GAATTCCAAC TGCGTCTGTC GAAGAGCGCA GCGGTGAAGG ATGTCCTCGA AGAGGTCAGA TCTAGGCTCG GTACTCGAGT CGGCGGTCGC AAATTACGTC TGCTTGAACT TTTCTACTCG CAAATCTACA AAGTTTTCGA GGAGGAAAAG GATATCGCAG ATATCAACGA CCAATATTGG ACGCTTCGTG CGGAGGAAGT TCCCGATGAC GAGTCAGAGG AAGACAGACT CTTGCGTGTG TACAACATTT CCAAAGACTT GTCCAATCCT AACCAGTTCT ATGCCTACGA TGAACCGATG TTACTTCGCA CGTGTGAAGG TGAGACGCTC GGGGAAGTGA AGGCTCGGAT CAAGACGAGG CTCGAAGCGA CGGATGAGGA CTTCGCAAAA TGGAAGTTCT ACATCGGCCA CCCGCCGCGG TATGAAATCT TGGACGACGA CGAGTTGGTT ATATCGAGTA AATTGGTTCG CATCGCCAAA GAGGGCTTTT GCGAATCGAC GCTGGGGATC GAGCGCGAAG TTAGAGGTCC GCGAAGGCCG GCTAGCCGTC AGGGGAAGCC GGCTGGATTT GAGCGAGCGA TCAAAATCAT GTGA
|
Protein sequence | MSSGSEDSEL AFSTAEPTTF SWRAEFSRWK KRDAKVVSQT FECGDTLFRL AMYPFGSNLN SKSETPAQVS LFLDTGATKP RRIEDDMSRE WRRHAKFELQ LLHPTDASVV ESKETSHTFD RREADWGFAS FITREDVFEK GYVDAEGCVN FRVHVTPIEE HEVDRPMQSA FYSEYDSRKE TGLIGLKNQG ATCYMNSLLQ TLYHIPSFRR AVYHMPTNET EEAHTSMPLA LQSVFYCLQY AKEGDVSTED LTRSFGWDSY DSFMQHDVQE LNRVLQDKLE EAMKQTCVEG TIQKLFEGHT TNFIECINVD YKSERKEEFL DLQLDVKGCK DIYASFDRYT EIEKLDGENK YRAEGHGLQD ARKGTLFHDF PPVLQIQLKR FEYDYQRDTM VKIHDRYEFP EELDLDVGDR KYLVPESDKS VRNKYKLHSV LVHSGGINGG HYYAFVKPNL QAEDAQWFKF DDEHVTKETA EKSVVEQYGS GGAAAVDSDM DADDDSTNVR VAPNLRFQKV SSAYMLVYIR EDDMDQIMCV ANKSHLTEYL QARFAEEQKA KEKEAQEKKE AHLYTIIKVL TRQDLERQIA SEKFFDLGNF ESAQRFRLHK KSTFTKFREL VSEKLGIPAE RQRYWTYSPR RNKTSRPATA LPDHANTPPT WTVEKTRLKY TVPNSQHASS GEFRLYLEEL DDDAFANSDP ERDIWLHVKL YNPHEARLSY CGTLYASPEE TLSTYMPKIK SMAGFASTAS TLMFEEIAFV RESKIQIDQL SDKQVKTYPL SDPDDGSKTL QLGNGDILLI QPEITEDMED SLKFPNVVQY ADFRHNHQIV HFRELEAPKV DKVTLELTKM MSYDQVADVL ASAIGLDDPL RLRFTAHHVY TNGPKSASFQ FRGADTLIKM LENQQSDVLY YEVLDMPLPE LQELKTLKVF FHGLNTKLVE EFQLRLSKSA AVKDVLEEVR SRLGTRVGGR KLRLLELFYS QIYKVFEEEK DIADINDQYW TLRAEEVPDD ESEEDRLLRV YNISKDLSNP NQFYAYDEPM LLRTCEGETL GEVKARIKTR LEATDEDFAK WKFYIGHPPR YEILDDDELV ISSKLVRIAK EGFCESTLGI EREVRGPRRP ASRQGKPAGF ERAIKIM
|
| |