Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_30233 |
Symbol | |
ID | 5000450 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | - |
Start bp | 751210 |
End bp | 754407 |
Gene Length | 3198 bp |
Protein Length | 949 aa |
Translation table | |
GC content | 61% |
IMG OID | 640415871 |
Product | predicted protein |
Protein accession | XP_001416482 |
Protein GI | 145343768 |
COG category | [S] Function unknown |
COG ID | [COG3781] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.204381 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCGCGACTCG CGCGCATCAT GCTGCGCCTC GCGCCTACCG TTCGCGCGTC GAGCGCGTCG AGCGCTTCGG CGCGCGTTCC ACGGCGCGTT CCACCGCGCG CCGCTCGCAT CGCGCGTCAC GCGAGCCCGC GCGCGCGCGT TTTCGTTCGC CGTGTCGCGA TTCCACGAGG TGTGCAAATC ACGCCGCCGC GCGCGCTCTC GAGCGACGAC GACGACGCGC GATCGTCGAG CGAGGCGATT CGTTCCGATG CCGTCGACGA CGCGGCGCCG GCGCCGCGCG CGAGCGACGC GATGGTGGAG CCGTGCGACG GCGACGACGG CGCCGGCGTC GTCAGCGGCG CCTCCAAGCC GAACGACTCG TACGCCATCG AAAAGCGGCG CGCGAACACG ACGGCGATCG TGGGCACGGG ACCGGTGGCG CAGACGCCGG CGGTGGCGCG GGATGAGCCG ACGAAATCGA CGCCCGCGCG CGCGGAGTCG ACGACGCGGC CGGTGTTGAG CGAGGAAGAA CTGTTGCCGC CGTCGAAGCG GGAACCGAGA GGGAAAGAGT ACAAAGAAGA CGGTGGAGAT AAGGAAGGGG GGGACGCCGC AGACGGTGAT GGGAAAGGAC CGAGCGGAAG CGCGACGAAG GGAGGCGCGA CGACGAAGAC GAAACCGAAA ACTGACGGAG AGAAGGAGAC GGGGGAGAAG AAGCTGGCGA CGCCGAAGAG CGCGCCGGTG GTGAGCGGAG GCGCGATAAG GTCCGCGGCG GGAATGGCCG CAGAATCTGG TGCTGATCCC GTGTTAGCGG GCGTGGGCGG TGGCGATGAT GGCGACGACG CGCGGTCCGG CGGCGGCGGG GGAGGAGGAG GAGGAGACGA TAAGAATGAC GGAGACGACG AGGACGAAGA GGAAGATGAC GACGACGAGG TCGTGCTTCC CGATCCCTGG TACAAACTCG CGTGGGAAGA AACCGAGCGC GAGATTGTTC GCATCATGCA AAGGTTGCTC AATATTTGGA TCACGCGCGA CCCGAGCAAG ACGACGCGAC TCGTTTTGTT CTCCGTCATG GGTACCGGTT TATTCTTAGC TTCGCTGCTA CTTTACCCCG AGAATCCTCT CGAGTTTAGC GATAATGGAT TGTTTAGCTC TGTGTTCGGT CGTAACCCTC GCTGGGTGAT CAATCGTGAA CTCGGTCCTG TGCAGCCGAC GTTGTTGAGC ACAACGTGCG CGTGTGCCGT GGCTTTCGCC AAGGTTCGTC TCGGTGCGAA CATGTTCAAG CTCTCACCCT TCATGCACGG CGTCCTGGGC CTCCCCATGG GTTTCATGTT AGTCTTTAGA TGGAACAACG CGCACGAGCG CTGGTGGTAC GGCCGCACGT GTCTCGGAAA CATTTTGTTT TACTGCAAAA ATCTTGGCGG CACGTTCTGC ACGTGGGTCG CGCCGGACGA CCCGATACTC GCCGCGCGAA CGCTGGGTCT GATTGGTGCT TTGAAAGAAA CCGTGGCTGA TCGATTGAAT GGTACAGTGC TGAACGACGG AGCGATTTTG AGTCAGCTCA CCACACCGCT TGACGCGGGA GACTTGGAAG GATTATTTCT CGCCGAGAAT AAGGTGTTGT ACTGCCTCGA AAAGATTCGC GGCTGCGTCC AAGAAGCATT CAACAAAGGT TACGTCCCTC CCGCCATCGC GAGCACGATC CACAGCGAGG TGGCGATGAT CATGGATAAC TACGGGAGTT GCGAAAAAGT CGTCAACCAA CCGCCTCCGG GTTGCATCAT CACCCACTTG AAGTCTACGC TCATGGTGTA CGTGTGCTCG TTACCGATGA TTTTAGTGCA TGAAGTCGGA GTATGGGGCG TCGTCCCCGT CACCACCATT CTTTCTCTCG CCCTGTTCGG GATTGAGGCT GCGGCTGAGC AAATCGAGCA GCCGTTTGGC AACAGACCGT ACGATTTGCC CGTGCGCGCG CTGATGAATA GCAACTCGCG AGACCTCGAG CAGACGAGCG AAAAGGTGAT CGGAATGACG GGCTTCGTAA ACGGGGTGAA GATGCCGTTC ATTCCAGAAG GCTCAACGCC GTCGAAGGAG GCGATGTCGG CGAAGACGTT GCCACAGACG TTGAAGACGT CTGAACCGCC GACGCCACCA CCGCCGCCGC CGGCCGCTTT CGTCACGCCC GCGCCTTCTG CGGCTGCGCT CAAGGCGGCG GCGGCGCCGC CGATTTCGGA GTCATCCGTC GTTATTCCGA CGGCGCGCAA GGACAACACA GTCTCGTTTT CGAGCGATCG CTTAGATGCC GCTTCCGTCG CGGCGCTGAG CAAGGAGAGT CCGTTCGCGT TCACATCGAA CGAGCCTGCA ACGAGTTCTG CTGAGCCGGC GAAACAGCCG CAGCCCCCGT CACCAAAGGT AGAGACCCGA GACGGTGAAG GCGTGCCTTC GCGTCCATCG ACGCCTACGG CACCGATGAC GCCCCCTCGA GGAAAGAGCG CGTTGCAAGC TGCGATGGAT GCGGTTTCAG AGACGGCGAA TTCGCGCGCG CCGGTGAACC CTTCGTCCTA CTTGTCGCCG TTACGAACGG GTCGGATGTC CGATGCATCG AGTTTATCGT CGTCGGCGCC GGCGTCGACC ACCAAGGGTG AGCTTCCGGC TTGGAAAGCG CAGAGCCCAA TTGAAATTTA CGATCAAGTG CTCGATGAGC GACCAAGCTC GACGATTCCG GACTCGCCTC GATACCACAC GCAAAATTCA TTGCGACGAC GAGACTCCTA CGGGCAGCAA TTCTTCGATA TGTTCACTCA GCGACCGCAA GCCAACGGCG CGTCCGAGCC GAACAGTGCA AAAGCTGGTT CGAACTTACA GCGATCGAGC TCGGTGGGCC TACCGCGATC GAGTAGCGTC AGCAACTTGT CGGGCCCGTC GTATCGAACG AACGCCGAGG GTGAACGCGA CATTCGCGAC ATCCCATTCT CGCGCAATGC ATTCGGACGA TCGCCGTCGA GATCGGATCT TATCTCCGAG GAAGACGGCG GCGTGCCCAT GCGACGCGTG GGCGCGGTGA ACCGCTCGAC GTCCGTCACG GACATGAGCG CTCTAGAGGA GGCCATCAAG AGAGCGCGCG AGGCGCGCAA GCGAACGTCC GGGGGCTCGG GCTCACCGTG AAAAGAAGGC ACACGATAGA AAGACGCACA GTAGAGCCTT CAAATGTATT GTATAAAA
|
Protein sequence | MVEPCDGDDG AGVVSGASKP NDSYAIEKRR ANTTAIVGTG PVAQTPAVAR DEPTKSTPAR AESTTRPVLS EEELLPPSKR EPRGKEYKED GGDKEGGDAA DGDGKGPSGS ATKGGATTKT KPKTDGEKET GEKKLATPKS APVVSGGAIR SAAGMAAESG ADPVLAGVGG GDDGDDARSG GGGGGGGGDD KNDGDDEDEE EDDDDEVVLP DPWYKLAWEE TEREIVRIMQ RLLNIWITRD PSKTTRLVLF SVMGTGLFLA SLLLYPENPL EFSDNGLFSS VFGRNPRWVI NRELGPVQPT LLSTTCACAV AFAKVRLGAN MFKLSPFMHG VLGLPMGFML VFRWNNAHER WWYGRTCLGN ILFYCKNLGG TFCTWVAPDD PILAARTLGL IGALKETVAD RLNGTVLNDG AILSQLTTPL DAGDLEGLFL AENKVLYCLE KIRGCVQEAF NKGYVPPAIA STIHSEVAMI MDNYGSCEKV VNQPPPGCII THLKSTLMVY VCSLPMILVH EVGVWGVVPV TTILSLALFG IEAAAEQIEQ PFGNRPYDLP VRALMNSNSR DLEQTSEKVI GMTGFVNGVK MPFIPEGSTP SKEAMSAKTL PQTLKTSEPP TPPPPPPAAF VTPAPSAAAL KAAAAPPISE SSVVIPTARK DNTVSFSSDR LDAASVAALS KESPFAFTSN EPATSSAEPA KQPQPPSPKV ETRDGEGVPS RPSTPTAPMT PPRGKSALQA AMDAVSETAN SRAPVNPSSY LSPLRTGRMS DASSLSSSAP ASTTKGELPA WKAQSPIEIY DQVLDERPSS TIPDSPRYHT QNSLRRRDSY GQQFFDMFTQ RPQANGASEP NSAKAGSNLQ RSSSVGLPRS SSVSNLSGPS YRTNAEGERD IRDIPFSRNA FGRSPSRSDL ISEEDGGVPM RRVGAVNRST SVTDMSALEE AIKRAREARK RTSGGSGSP
|
| |