Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_92778 |
Symbol | |
ID | 5001786 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 572592 |
End bp | 577742 |
Gene Length | 5151 bp |
Protein Length | 1716 aa |
Translation table | |
GC content | 52% |
IMG OID | 640417207 |
Product | predicted protein |
Protein accession | XP_001418052 |
Protein GI | 145347178 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.217653 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGAAGG CTTCGCCGCT GTCGGCATGC ACAGCGGCAA ACTGTGTGAG TCCCGCTCAT CTGGTCGATA GCACCAGTAC GAAGTTTAGG GTGCACGCCG ATGGCCTATC ATCGACAACA TCCAAAGACA TAACGTACGT GTCCACTTCT GAAATCACCG AGACGCTGCC ACCGAGCTTG AGTGCCACGG AGTTGCCAGC GTCGGTGACT ATTCTCGGCG CTTGGTTGGC GAGCGCATCT TGCGATGGGA TCGCTCTCTC GCTCAACTCT ACTTGGGCGT CCGAGTTTGC GTGCACGCTC GACCCTGTTG GCGTTGGTTA CACTGCCGTG AGCGTGATTT CGCGCGGCCA GACGATGAGC GTTTCGTATT TGATCAAAGA GACGCCACTA TTGCTGAGCG TTTCCCCTCC TGGCGCGTCA ACTATGCCTG GTGAGCTATT CACGTTGACC GTGCAGCACT TTATCGCTGA TGATGCCGAT CAATTCCACT GTTTATTTGA CGCCACGAGC GCGGTTGCGC CACACATAAT CTCCTCTAGT TTGATCCGGT GCGAGTCAGT CGCGACGACA AAAGTGTCGA CGCGCTTGAC CATCGAGGGT GGCGATGGCG CTTACCCACT GTCTCGACAA GCTGCTCCGG TGGTCTCGAG CATCGCGCCA TCGTCGAGCG GTGATATTGG AGGGACGTTG GTTACGCTCA CAGGCACGAA CATTCCTCTC ATAGACAACT CTGCCGTGTG TTCGTTTGGT TCAATTGGTC CCATCGCTGC GCAGTACGTC AGTTCACTGG TTGTGACGTG CGTGTCCCCG GCTGGTGTTT CATCGGCGAG TTCGAGTGTA TGCGCGAGTG TTTATAGCGC ATTGTCGCCA TCGAAAAGCT GTGCGTCAAG CCCCATGACT TACGTTGCGC CAGTAGATCC GCCAGTTATT TTGGATCACG GCGTTTCTAG CAAGCAAGGC GGTTACTTCT TCTTGTGGAG AAGCTCGGCG TCCTACTTGC TAGTTCCTAC TTTGTCCTTC GTCGAATTCG GTCGAGGAAA CGCGACAATG TCGAGCACCA TAGTCAGCGT GTATATCGCG CCGCAACTAC CTGGAGGCTT CACTACAGTC TCAGCCATAG ACGAAGTTGG AGGTGTTTCT TTCGATCAAG TCATGGTGCA ACCGGTGCCG ACGATAACTG GTTTGAACCC ACGAGTCACA CCCGCTGCAG GAGGCAGCCA AGTGTGGATT TCAGGCACAG ATTTAAAGAG TGAAGTTCTT AAACTCAGCG TTGATGATGC GAACGTCGAC TACAACATTG TCTCGTCTGC TTTGATCGTC ATTCAAACGC CGAACCACGC TTCTGGAGGC TCAATTATAA AAGCTCAACT TGGCAGTCGG ACAGATTCCA GTGGCGTCGG CATCGGCCCT TTCACCTCGA CTGGTCTTGA TTATATCGAT GGTATAGATC TGACGAGCAT TAGCTCGAGC TTAGGACCCA CTACGGGTAC TATGAGCGTT TCTCTGGTTG GTGAAGGGTT CTCCGATACC TCTGAGCTCG GATGCAGATT TGGCACCTTG GGCCCCGTTA CAGCGAGCTA CGTGCACGGT CGTGAGATTC GATGCGCTGT TCCTAATCAT ATTGCTGGAC AAGTGCCAGT TGAGGTGTCT GCAAATCGAC GCGATTTTAC GTTCAACTCG ACTATACCTT CAAATGGAGC GGTGTGGGGT ACTTCAACCT ATTATTACTG GAGTGCCTCC CCGGGTGGTA CACCGACAAC ATACAGAGGC GTCACGTACA CTTATGCAAA GTCCCCATAT AACGTCGATG CAGTCATACC AGCGCACGGA CCAATGCATA TCACGACTGA ATACACAATA TTCGGCGCGG GATTGGACTC TGCCGTTACA GACTTGTGCA GCTCGTTAAA CGTCACCGCA AGACAAGTCT CATCGACCGA CGGTTCCATC ACGTGTTCCT GGGGTGCAAG CGCAGTGGCG GGTTTCGTAC AGGTTGGAGT CTATGCAATC GTATCCACAT ACACGCAATA TGATTCGACA CAAACGCAGT TTGAGTACTA TGATCCTCTA TCGGTGACTA GTGTGGATCC ACTGGTTGGA CCCGTGGATG GTGGCACGGT TCTATTCATG TCTGGCGCAA ATTTCCGCAA AGGCGACGTG AGCCGAGTCA TGTTTGGAAG TACCGCCGTC GTTTCGCACG TCGTTTCTTC TGCTCTCGGC ATTATCGAAA CGCCCACATT CGCGACGAAA GGAACAAAAG CGCTTTACGC GTCGACGCAT GAGCATTCGC CATCGACTGC CTTCACTGCG CTCGATCAGC TGTTACTTTC ATCCGTCTCG CCCACGTTTT CCCCTATCGA AGGGGGGTCG ACATTGATCG TCAGTGGTGA TAAACTTTCA GCGCCGTCCA CTATTTGGTG CCGTGCCGGC ACAATTGGTC CTTTTTATGC ACGTGCATAC GATTCCAAAG TACTCAAGTG TATCTCTCCG GCTCACAGCA AAGAGAGCGT GCATCTTCAA ATTTCCATGA ACAAGCGAGA CTGGCGTTAT GAGCTCACGC TCAGTCTTGC GAACCCCGCG TTACCATCTG GTACGTCGGT GGATTCTATT TTGACGCTTG ATTACGTTTT CCCAGCGAAA ATCAAGGATG TCCTTCCTCG CGCTGGTCAC GTGACGGATC AAGTTGCGAC CGTTGAGGCC TTTTACGAAG CGCCGTATCC AGCTCAAAAC GTCTCCTCAC GAGGTTGCCT CACGGGAGGT ACTACATTAA CGAGTACTTA CTCTGTGACG ACGAATATCT ACTCCATTCT ATGCACGTTG GAACGAGATT CGTTCATCGA AGGAATGTAT CCCATCATAA TCACAGGACC TGATGCCGCC AATACGTCGT TCTCTGGTAG TTTCCATTAT GTCACGTCGC CGACAATCGA CGCGTGGGGG CCAGAGGTGA TTCACACCGG CGGCGGCACA TTAGTCTCTG TCATGCTGGT CGATGCGATT TCGGACGGAT TAGTGTGCAT GTTCTCGCCT TGGTATCAGA ACGAGTTCAC CGCGACGTTC GCCATCGATG CACACTTTGT GAGTTCATCG TTGATTATTT GCGAGTCACC GTACGCAGAT CCCCTGATTG AAATTGGCCT GACGGCTGGC TTATTGGGTA CGACACCGGA AGATGGGCAA GCCGAGCTTA ACTCGATCAT GCGACCAGCT ATTACGAGTT TACAAGCGGA TAGCTTATTC CTCGATGGCG GTAGCGCCAT GAACATTTTC GGAACTGATT TGGGTATGCA AGTGTATGAT CTGTATGCGT CACTTGGTAC GATTTCGCCT CTCGCTTTGC GCTGGGTGAC CGCTGCACAA GTCGAAGCAA TCAGCCCCGC GACTGTCAGT GGAAATAAAA TGCTGTTTTT GTCGCACGCC TTATCGGCGA GAAGTGATCC ATACGAGGCA GAGTTGACGT TCTTCAAGCC ATTCGAACCG AGTCCGCTCA TCCCGAGTGT CGTACCGGCG AAGGACAACG TATTCGTTCG TTTGCACGCG CACATCGGTA GCCTGCCTTC GTCAGTACCG ACGTGCAATC CAGCTGCGCC GCAGTACACG CTCTGTCAAG AAGCCAATAA CATAGGCGGA ATCCTCACGC AAATTCACTC GGTCAACTTT GCGATACTTC AACTCCCAGG AGGAGCAAAT CTGACTGTGA ATGTCCCGCA GCTCGCTTAC GTGGATGGCG CATCATCCGC CACTGTTCAG CCAAACATTG CGGTGACGGG AGGTGGCACT GTCGCGGTCG TGTCAGGAGT GAACTTCGTC GAGGGCATGA CGTCGATTCG TCTCGGCGAT GTGGCACAGA CGAACCCATA CATTTCACAG AGCTTTTTGT CTTCGGCGTT GATTCGCTTC GAGGCCCCAG CGGGCGCCTC TGATGCGACT GCCAGCATCT ACACATCCAC AGGTTTCGTC GATTCCGATT CGTGGGGCTC GGCAGGAGCC GTTATGAAGT ACCGCAGTTT GCCAAGCATG ACGAATGCCG CCGCTTTGGA AACTCTGGAA TCAGGAGGCA GGCTCGTCAA GTTAACTGGA AGTGGGTTCG TGTCGAACCG CGATTTATTT TGCAAGTTTG GTGAAATTCA TGTTCGAGCT ACTTATGTGA GCGCTACGAT TTTGAATTGC GTCGCTCCTG GTCTTAAACC TCAGACTTAT TCGGTCAGAG TGTCAAACAA CATGCTGGAT TACAGCAAAT TTGTCAACTC TGCAAGTTCT GACAGCGACA CCGACGCCAC AGTCACTCCT GTACCAGACT TCACTGGTGG TATCAATTCC GCCACGAACC TCTTCGGACC CAACTCAGGA GGCACATTAA TTTCATTCTC GTTCAGTGCT TCTGTACCCT CTTCGTTAGC GTGCAAGTTT TTTTCGAGAT ATGGAAACGG TTTCATCAGC GATTCGAGCA CGGCCAAGTG CTTGACACCA TCGAGTGACG CAGGCTTTGT GCCAGTGCAA CTGTCGCCCA GTTCCGAAGC TGGGACTTCG TACACTGCAA TTGGGATTCA GTTCGAGTTC CAAGAAGCAC CAGAAATCGA CATGGTATAC CCGGAGATGG GCGTCTTCGG GGGTGGGACA GTCATCAATG TGCACGGAGA CAACTTGATT CAATCCGTTT CTGTCGCCTC ACATGGATCG CCGATGATGC CAGGAACGTC GGTGGACGGC TTATCTTGCC GCTTCGGCGG GATGTATACC GTTGGTGCCG TTCATGTGTC TTCCACGATC ATGCGATGTG AGACTCCCAC GTTCTCCATC GGTTTGATGG ACGCGCCCCT TGTGGTTGAT CTCTCCTTGA ATGCGGACGA TTGGACCGGT TCTCAAATCG TCTTTGAACC GATTGAAAAT ATGCCGCTCT CTTCGTTGTC ACCACTTGCT GGGACTCGCG CTGGTGGTAC CACACTCACT GTTGCGAGTA GTTATTTCCC TCCAGACACT CCAGTTTGGT GTAAATTCGG CACCACTGGT CCCATCCACG CGCTGTTCAA CGGCGACGGA AGCGTGCGAT GCAAATCTCC TGCAAAAGCA GAAGGAGACA TTCCAATCGC GATTTCGCGC GGTAATCCGA TCGATTTCGC ATTCGACTAC ACAAAAATAT TCAAAATGTA G
|
Protein sequence | VKKASPLSAC TAANCVSPAH LVDSTSTKFR VHADGLSSTT SKDITYVSTS EITETLPPSL SATELPASVT ILGAWLASAS CDGIALSLNS TWASEFACTL DPVGVGYTAV SVISRGQTMS VSYLIKETPL LLSVSPPGAS TMPGELFTLT VQHFIADDAD QFHCLFDATS AVAPHIISSS LIRCESVATT KVSTRLTIEG GDGAYPLSRQ AAPVVSSIAP SSSGDIGGTL VTLTGTNIPL IDNSAVCSFG SIGPIAAQYV SSLVVTCVSP AGVSSASSSV CASVYSALSP SKSCASSPMT YVAPVDPPVI LDHGVSSKQG GYFFLWRSSA SYLLVPTLSF VEFGRGNATM SSTIVSVYIA PQLPGGFTTV SAIDEVGGVS FDQVMVQPVP TITGLNPRVT PAAGGSQVWI SGTDLKSEVL KLSVDDANVD YNIVSSALIV IQTPNHASGG SIIKAQLGSR TDSSGVGIGP FTSTGLDYID GIDLTSISSS LGPTTGTMSV SLVGEGFSDT SELGCRFGTL GPVTASYVHG REIRCAVPNH IAGQVPVEVS ANRRDFTFNS TIPSNGAVWG TSTYYYWSAS PGGTPTTYRG VTYTYAKSPY NVDAVIPAHG PMHITTEYTI FGAGLDSAVT DLCSSLNVTA RQVSSTDGSI TCSWGASAVA GFVQVGVYAI VSTYTQYDST QTQFEYYDPL SVTSVDPLVG PVDGGTVLFM SGANFRKGDV SRVMFGSTAV VSHVVSSALG IIETPTFATK GTKALYASTH EHSPSTAFTA LDQLLLSSVS PTFSPIEGGS TLIVSGDKLS APSTIWCRAG TIGPFYARAY DSKVLKCISP AHSKESVHLQ ISMNKRDWRY ELTLSLANPA LPSGTSVDSI LTLDYVFPAK IKDVLPRAGH VTDQVATVEA FYEAPYPAQN VSSRGCLTGG TTLTSTYSVT TNIYSILCTL ERDSFIEGMY PIIITGPDAA NTSFSGSFHY VTSPTIDAWG PEVIHTGGGT LVSVMLVDAI SDGLVCMFSP WYQNEFTATF AIDAHFVSSS LIICESPYAD PLIEIGLTAG LLGTTPEDGQ AELNSIMRPA ITSLQADSLF LDGGSAMNIF GTDLGMQVYD LYASLGTISP LALRWVTAAQ VEAISPATVS GNKMLFLSHA LSARSDPYEA ELTFFKPFEP SPLIPSVVPA KDNVFVRLHA HIGSLPSSVP TCNPAAPQYT LCQEANNIGG ILTQIHSVNF AILQLPGGAN LTVNVPQLAY VDGASSATVQ PNIAVTGGGT VAVVSGVNFV EGMTSIRLGD VAQTNPYISQ SFLSSALIRF EAPAGASDAT ASIYTSTGFV DSDSWGSAGA VMKYRSLPSM TNAAALETLE SGGRLVKLTG SGFVSNRDLF CKFGEIHVRA TYVSATILNC VAPGLKPQTY SVRVSNNMLD YSKFVNSASS DSDTDATVTP VPDFTGGINS ATNLFGPNSG GTLISFSFSA SVPSSLACKF FSRYGNGFIS DSSTAKCLTP SSDAGFVPVQ LSPSSEAGTS YTAIGIQFEF QEAPEIDMVY PEMGVFGGGT VINVHGDNLI QSVSVASHGS PMMPGTSVDG LSCRFGGMYT VGAVHVSSTI MRCETPTFSI GLMDAPLVVD LSLNADDWTG SQIVFEPIEN MPLSSLSPLA GTRAGGTTLT VASSYFPPDT PVWCKFGTTG PIHALFNGDG SVRCKSPAKA EGDIPIAISR GNPIDFAFDY TKIFKM
|
| |