Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31402 |
Symbol | |
ID | 5001272 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | - |
Start bp | 821693 |
End bp | 824805 |
Gene Length | 3113 bp |
Protein Length | 900 aa |
Translation table | |
GC content | 59% |
IMG OID | 640416693 |
Product | predicted protein |
Protein accession | XP_001417611 |
Protein GI | 145346262 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0818576 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCGTCCCGCG CGACCGCTCG CCGCGCGCCG CGAGCGAGCG AAACGACGCG CGAGTTTTTA CCGCGCGCGT CGACCCGCGA GCGCGACGCG ACGCGCGTCG GACGCGACCG CGAACGCGCG CGAGAGGCGA TCGGAGATTC AAACGCGATG AGCCCGCCGG GGTCGTCGGC GCGACACTCG CCCGCGGCGT CGGGGGCGTC CGCCGAGCGG TGGTTGCAGA AGGCGTTCGG GGAGGCGAGC GTGAACGTCG TCGATCGAGC GTCGGCGCGA CGGGAGGAGG TGCGCGGGAC GTCGCGCGCG CGCGCGCGAA CGGGTCGAGG GCGACGCGGG ATTTCGAAAC GGTTTTTAAA CGGTCGAATT CAGAGGATTC GGGCGGACGG GCGCGGAGAC GCGAGGTCGG AGACGCGCGC GGGTGATTCG AGGACGGAGA ACGAGACTGA CGAATGATGC GCGGTGGTTC ACACGCAGGC GCGCGAATCT TCGTACGGGG GTGCGAGGTA CGAGCGGGAG ACGGAGCGAC GCGCGGCGCC GGGGACGGCG GCGAAGCCGG CGTATTGGGC GCACGGACAA GCGGTGGATT CGAGTCAAGA CGATGGACGG GCATCTCCCA AGCGGTTCTT TAACAAGGCG AACGAGCGGT TGTACGAGGC GTATTCGCAG TTGCACACGA TGGCGCAAGA GTTTGATAAG CCGTTTGATT CGCCGGCGAT TCTCGTCGTC GGACACCAGA CGGATGGAAA GTCGGCGCTC GTGGAGGCGT TGATGGGGTT CCAGTTCAAC CACGTCGGTG GTGGAACGAA GACGAGGCGT CCAATCGCGA TCAACATGAA GTACAACCCT TCCGCCGTGG AGCCGAGGTG CTTTTTGATG AAGGACGACA ATCTTGGGCG CGAGGACGAG CTCTCGCTTC CCGAGTTGCA GGCGCACATT GAAGGCGAAA ACAGACGATT GGAAAACGAG AACGGGTTTT GGGCGAAGGA CATCGTCGTG AAGATCGAGT ATAAGTATTG CCCGAACCTC ACCATCATCG ATACCCCCGG TTTGATCTCC GCCGCGCCGG GCCGAAAGTT TTCTGGTTTG CAACAGTCTG CGCGCTTGGT GGAGGATTTA GTCAAGCAAA AGATGCACCA GCGAGACTAC ATCATCTTGT GCCTGGAAGA CAGCTCAGAT TGGTCCAACG CCACCACTCG GCGATTGGTT CTCGAAGCCG ATCCCGAGCT GCGACGTACC GTCGTGGTGA GCACCAAGTT CGACACGCGA ATCCCGCAAT TTTCTCACTC GCAAGACGTC GAAATGTTCT TGCACCCGCC GGCGCGTCTA CTCGAGCCGA CCGTCTTGGG TGGTGGTCCG TTCTTTTCCT CCGTGCCCTC TGGCCGCGTT GGACTCGCAC GAGATTCGAA GTATCGCTCG AACGATCACT ATCGGGAGGC CGTTCTCGAG CGCGAGCAAC ACGATATCGC CGAACTCGAG CGCCGCTTGG ATCGTCACCT GTCTTCGCAC GAGCGCGGCC GCATCGGCGT GAGCCAACTG CGATTTTTCC TCGAGCGATT GTTGCAGCAG CGATACTTGG AGAACGTTCC CACGATTGTT CCCGTGCTCG AACGCGAGCA TCGCATCGCC ACGAGCAAGC TCTCTGAAAC CGTGCAAGAG TTGGCGGACT TGAACCAAGA TCACCTCAAG GAGAAGGGTC GCGCCTTTTA TCAGCACTTT TTGGAAAAGA TTCCGGAGAT TGTCCGAGGT ACGCTCGCCG CGCCGCCGCG CATCTTTGGC GAGTCTCTCG CGCACGAGCA CATTCGTGGC GGCGCGTTCG TGAACGCGGA CGGTCGACCT TGCATGCCGC AGCAACCCGT GCCAAACGCC GATATGCGGC TCTTCGGTGG CGCGCAATAC CATCGAGCCC TCGAAGAGTT CCGCTTAATC GTCAACGCCG TGGAGTGCCC GCCGGTGAGC CGAGAAGATA TCGTCAACTC GTGCGGTGTT GATGAAATCC ACAACGGTGT CAATTACACG CGCACTGCGT GCGTCATTGC CATCGCTCGG GCTCGAGAAA CGTTTGAGCC CTTCGTGCAC CAACTCGGCT TCCGTCTGTC TCACATCGCG CGACGAACTT TGCCTGTGGC CATGTACCTT TTGCAAAAGG AGGGACGAAT TTTGAACGGG CACGAAGTCT TCTTGAAGAA GATTGGCGGC ACTTTCGCTC GATTCGTGGA CGACCGAGTC AAGGAATGCC AAGAAAAGTG CCACGAAGAC CTCAAGTCTA CGACTCAGTT CGTCACTTGG TCGTTGCACT CTGGAAACAA GGCTGGTCTA CGAAACGTCT TGGCGCCGCA TGAAGGCGAA CATGACGCGA AAGACGAAAA GCGCCGCTCC GGTGGCGAGC TCGTGATGTC GGATAAGGAG TTGAAGAACT GCAAGCTCAC GGATTTGGTT GAAAACACGC TCTGGAACAG AACGATGAAG TCGGTCACGG TGGATATTGT TGACATGTTG GTTCGTCAAA TCTTCTCCGG TATTCGCGCA CACATCGTTC AGTCGGTCGA GCTCAAGTTC AACTGCTTCT TCCTTATGCC TCTGCTGAAC GACTTCAATA GTTTTTTGCG CAACGAGATG GAAGACGCGT TTGAGATTAG TCTCGACTCC GTCTTCGACG TCAAGAGCGT CCGAAGCGCG CTCGAGTCTC GTCGTCACAA GCTTGAGAGC GAGCTCGAGC AAATGGAGCA CATACAGTCC AAGTTTGCAT CGATCCACTC GCAGCTCGAG GCATCGAGCG GTAACTCTCC TGCAACGCAA GCCGCCGCGA AGGCAGCCGT TGGTGGGACG ACGTTCCACA GCGCAGCTGC GCGCGAGATG GCGGCGCAGG ACGAAATCCT TCGAGCGAGC CACGATTTGA GTGAGGGCAT CGCCGAAGCC AAGGCGCAAT TCGCGCAATA TTCGCCGAAT CCTATGAGAG GCTCGAAGGC GTACGAGCCA TCTCCCAGGC GCTCGTCGGC TGGCCGAAGC CGAGACCGCG CGCCACTTTC GCCGATGTTC CACAACTAAT AGGCGTTTTA TTTATCAAGA ATGACGAGGG GCAAAACGGT TGTAAAAGAT TCGCAGTTCT TGTATGTATT CAA
|
Protein sequence | MSPPGSSARH SPAASGASAE RWLQKAFGEA SVNVVDRASA RREEARESSY GGARYERETE RRAAPGTAAK PAYWAHGQAV DSSQDDGRAS PKRFFNKANE RLYEAYSQLH TMAQEFDKPF DSPAILVVGH QTDGKSALVE ALMGFQFNHV GGGTKTRRPI AINMKYNPSA VEPRCFLMKD DNLGREDELS LPELQAHIEG ENRRLENENG FWAKDIVVKI EYKYCPNLTI IDTPGLISAA PGRKFSGLQQ SARLVEDLVK QKMHQRDYII LCLEDSSDWS NATTRRLVLE ADPELRRTVV VSTKFDTRIP QFSHSQDVEM FLHPPARLLE PTVLGGGPFF SSVPSGRVGL ARDSKYRSND HYREAVLERE QHDIAELERR LDRHLSSHER GRIGVSQLRF FLERLLQQRY LENVPTIVPV LEREHRIATS KLSETVQELA DLNQDHLKEK GRAFYQHFLE KIPEIVRGTL AAPPRIFGES LAHEHIRGGA FVNADGRPCM PQQPVPNADM RLFGGAQYHR ALEEFRLIVN AVECPPVSRE DIVNSCGVDE IHNGVNYTRT ACVIAIARAR ETFEPFVHQL GFRLSHIARR TLPVAMYLLQ KEGRILNGHE VFLKKIGGTF ARFVDDRVKE CQEKCHEDLK STTQFVTWSL HSGNKAGLRN VLAPHEGEHD AKDEKRRSGG ELVMSDKELK NCKLTDLVEN TLWNRTMKSV TVDIVDMLVR QIFSGIRAHI VQSVELKFNC FFLMPLLNDF NSFLRNEMED AFEISLDSVF DVKSVRSALE SRRHKLESEL EQMEHIQSKF ASIHSQLEAS SGNSPATQAA AKAAVGGTTF HSAAAREMAA QDEILRASHD LSEGIAEAKA QFAQYSPNPM RGSKAYEPSP RRSSAGRSRD RAPLSPMFHN
|
| |