Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_17395 |
Symbol | |
ID | 5004526 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | - |
Start bp | 448943 |
End bp | 451909 |
Gene Length | 2967 bp |
Protein Length | 988 aa |
Translation table | |
GC content | 54% |
IMG OID | 640419947 |
Product | predicted protein |
Protein accession | XP_001420511 |
Protein GI | 145352347 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0142348 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCCCG GTCGTCACTT TGTCGACGAC CCTCGATGCG CGCGTGAGGA CATGACGAAA CCGCGCGAAA CGACGACGAC GCGCGCGCGC GCGTCGTCGT CGTCGCGACC CGCGATCGCG CTCGGTCGAC CCGTGCGCGC GCACGTCGAA CCGCCGCCGA GGACGCGCGC GGAGGCGCGA GACGGGACGC GATTTCCGAG CGAAAAACTG ACGCGAGAGC TGGTGAGACG AGGGTACGAC GAAGAAAAGC ATCGCGAGTG GCGAATCGCG CGGACGGTGA AGACGATCGA TGGGAAGACG ACGTTCGATG ACGGGACGTT TATGTCGCCG CGAGCGCTGC AAGAGATGTT GACGCGAATG GCGACGAGCG AGACGCTGAA AGCGATACGA GCGAATGCGA ACGAGGGGGC GAAGGAAACG CGCAGAGTGC CGAAACGAGG ACGGAGAGAT GATGGGAGTT ACGACGTGGA GTTACGCGGG GAGGCGTTGA AGACGTGGAG CGCGCCCGAG TTGGTGCACG CGCGCGCGGA GGAGTCGGCG GCGTTTTTTC GGGATTTGGC GAGCGACGTC GGATTGCACG CGCTCGAGCG ACGCGTGCCA CACGGCGTGC GGAAGCGAAA TTTGTTTGAA ACCTTGCATG GGTACAACGT TCCGATTCAT CGCGCGATGT GGATGGTGAA GGTGAACTAT CTATCGCAGT GTAAACGAGA TTTTGACGCG TGTCGACGCG CGTGGACGGA GGACTTACTC GCGCACATCT TCGATGTATT GCGCGACGAG GAAATGGTGG CACAAAGTGG CGAAGGTGTC GAGACGGAAT TAAAGTACGT CCTAGCGTTA GCGAACTATT CAGTCGAGGA AGACTTGGTG GATCAGCAAA AGTACTTGCA CGAGTTTCTG CGATTTATGA ATGAACAAAA GCGGCAGAGA ACTGGAAGCG CTCATGATTT TGCTCGGGCG CTGAGCCCAG CTTTACGCAC GCTCGTGCCT CAGGCTTCAA AGTCGCACGC AGATTCAGTT GGTCTGGTGG ACAGAGTCTC GTGGTGTTTA CAGGAGATTT TGAAAGAGAG GGGACAGTCG GACAAGTATC TCGTGATAAG CTTAAGTGAT GTCGTGGCCT CCGTCGCCGC CGCGAATATG GATGCGTTCG TCGCCGCACC GCGGGACGGA GGCCTGAACA TCATCACTAA TTTGCGGCGT GACTTTCAGA GAAAGAAAAT GCCGTTGTCA CAACGACTGT GCAGTGTGCT CGACGACGTC GACAATCGCG TGAAATCGCT CGCGCAAGCG GCGAGCCCAG AATTGATCAC CATAAAGGCC CGAAATCTTA TTGAGACGTT ACACAAGTTG CTGGAAAGTA ATCTAGACCA AGTAGCGTCG CGTACGATCG CGCGAAAGTA CATTGATCGA GAGGAAAGTG AAGAGCTTGC GATGAAGTCT ATGGTGAAGA CTTCGTGCGA TTGGTGCATG GATGTTTCGG AAGACGAGGC AATGAAGCGT CGCACAGCGG TACGAATCTT TTTCGTCGAA CTCGCGGAAA CTTCTTCTAG ATTAAATATA TACGTGTTTG ACTGGATCAA AGAAAGAACC ATGATAGTAG CGCGTGAAGA TTACACCAAG TGCATCTCAG AACTGCTGAT TGAGTTGCTG TGGGGCAAAG TCGTGGACCT TTCTCAGTTG TTGAATTTCA TTATGGTGGA GGGTATCGTC GAGCACAAAG ATTCAGGCGC GTCGCAGAAA TCGCTCGCGG CGTTTCGCGA ATACATGAGC CAGATCGTGC GAAACAATGA CGAGCGAGAG TCCAAACTGG ACGACTCGAT GGTGCGCACT CTGAAACAAC TCCTCGAATC GGCGAATGAA ATGTCAGACG CCACAATATC GAGGGCGCTC ACCCCGCCTC ACTCTACGGC AGATGTGTCA AAGCATTTTG AAATCTCTAC AGACGAGGAA AAGCGACTGC TTGAATTGTT TCACGCGGTC ACCTACGCTG ATTTGCTTTC CAAGCTGGAG GAATTGAAGT GTGTGCACGA TGAATTCAAA TGTTCGCGAG ATGCGCACTC AGTCTTTTCG CGAGTTGTTG TCGGTGCGCT AGTGCTAAAG CCCTCTCGAG TCAGATTACT TGCCGAACTC GTGGGTGGAA TAGGCGACGA CGTCACAACG GATGTCCTTT CGTACGTCGG TGGCAGAGTT TGCGATGATG CGAATGTTGA CGGCGTACTT CAAGGTGCTT ACTGGGATGA CTCAAACGCT GCGCGTCGAT GGGAAATCGC TCAGAGCCTT GTCGAATTTC GGCTATGTTT ACTACATAAT ATCGTCAAAC GCAAGCGTGA CGCGGCGATA GGAATCGTCA AGGGTGCGAT TCAACAGCTC AAAGAATTGG CTCTGTCGAC GCGAGACGTT TCGCACAAAC TACTCCTCAT TTGGGTGCAG ATTCTAACAC TCATCCCGTT GATTGGTTAC GTGCTCATGT CCAATGAACT CAAAGATGAT TTTTTGCAGC TCATTGTCGA AGTCCTCGAC TCATTTATCG GTAAAACGAT CGTCGAGGAG TCTGAGGAAG ATTTGGTAGC CGAATCATTT GCGGGCGAAT CGCTCGTGGA CCGTCTCATC GCTCTGTTCT CTGTCATCTG GAGGAGCGAA GTGCCACAGT GGTTGCCAGC CATCCCAAAA CTACCGTTTT GGGACGTTTC GTCGAAGATG GTTCCAATGA CGGAGTTCAT TGATTCCAAA ACCCTGGCCG GCGTCGTCTG CGTGCGCCTG AAGCGGGCCA TTGGAGGACC AGCGCGCGGC AAGCTTGGCG ATGTGAACGC GTGGAAAACT TTAGCGAGTG GAGCGTCTAC GCTCAGCGCA AGATCGAAAA TCAGCGAGAA GGCCGAATTT TGGCTACAAG GGACAGTGCG GCGCCCGGGA GGTAATCTTG CGTGGGAGAA CGTCAGTGCC GAACCAAACG CGGATCATTT GAAATGA
|
Protein sequence | MRPGRHFVDD PRCAREDMTK PRETTTTRAR ASSSSRPAIA LGRPVRAHVE PPPRTRAEAR DGTRFPSEKL TRELVRRGYD EEKHREWRIA RTVKTIDGKT TFDDGTFMSP RALQEMLTRM ATSETLKAIR ANANEGAKET RRVPKRGRRD DGSYDVELRG EALKTWSAPE LVHARAEESA AFFRDLASDV GLHALERRVP HGVRKRNLFE TLHGYNVPIH RAMWMVKVNY LSQCKRDFDA CRRAWTEDLL AHIFDVLRDE EMVAQSGEGV ETELKYVLAL ANYSVEEDLV DQQKYLHEFL RFMNEQKRQR TGSAHDFARA LSPALRTLVP QASKSHADSV GLVDRVSWCL QEILKERGQS DKYLVISLSD VVASVAAANM DAFVAAPRDG GLNIITNLRR DFQRKKMPLS QRLCSVLDDV DNRVKSLAQA ASPELITIKA RNLIETLHKL LESNLDQVAS RTIARKYIDR EESEELAMKS MVKTSCDWCM DVSEDEAMKR RTAVRIFFVE LAETSSRLNI YVFDWIKERT MIVAREDYTK CISELLIELL WGKVVDLSQL LNFIMVEGIV EHKDSGASQK SLAAFREYMS QIVRNNDERE SKLDDSMVRT LKQLLESANE MSDATISRAL TPPHSTADVS KHFEISTDEE KRLLELFHAV TYADLLSKLE ELKCVHDEFK CSRDAHSVFS RVVVGALVLK PSRVRLLAEL VGGIGDDVTT DVLSYVGGRV CDDANVDGVL QGAYWDDSNA ARRWEIAQSL VEFRLCLLHN IVKRKRDAAI GIVKGAIQQL KELALSTRDV SHKLLLIWVQ ILTLIPLIGY VLMSNELKDD FLQLIVEVLD SFIGKTIVEE SEEDLVAESF AGESLVDRLI ALFSVIWRSE VPQWLPAIPK LPFWDVSSKM VPMTEFIDSK TLAGVVCVRL KRAIGGPARG KLGDVNAWKT LASGASTLSA RSKISEKAEF WLQGTVRRPG GNLAWENVSA EPNADHLK
|
| |