Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_32696 |
Symbol | |
ID | 5003040 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | - |
Start bp | 492247 |
End bp | 493878 |
Gene Length | 1632 bp |
Protein Length | 531 aa |
Translation table | |
GC content | 51% |
IMG OID | 640418461 |
Product | predicted protein |
Protein accession | XP_001418951 |
Protein GI | 145349045 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.144212 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0707375 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAAGT CTCTTAGCAT CCAAGTCCCC GCGCGAACGC CCCTGGGCGA ACGCTCAGGG AACGAAAACA TCAGCGCGGC GTCGAAATTC CCGTCCAGTA AGCTGAAGTT GGACGAGTTA TTTTTAAACT GGTTATCCAT GGCGGAGAGC CAAAACTTGG TGTACGAACT GTTGAAGGAT GCGAAGGCCG GGAAACCGCT GCGACAGCCG AAGAGCGGTG CGATGCACGC AAACGTCGCT GCAGCGATAG GAACGCCACC GAGGAGCCCG CAGAAAGGGT CGAGGTACGG TACGAGCGCG TTTTCGCCTA CGAGGCGACC GCTGTCGAGG CAGGCGTCTT TGTTCACTCG AGCGACGAAT GAGCCACACT CGATTCCGAC GTTTTACAAG CCAGGCGGTG AAGGCTTGAG CGAAGAGGTG AACGCAGGAA AGGTCGCGTT GGCGGAGCGC ATGTTTGAAA GACACTTGAC TGGGATGAAT TTAGAGGCAT TCGCGGGAGT GGTTCGAGAC GTCGTCGGAT TGCCGCGATA TTTCGCCAAA CGTGTGATGA AGCTCGTGGC CGGGGCGAAT GCGGATGTCG TCACGCGTGA GCAATGGTTT AGTTATTGGA ATTCTACGCT TCGTCGGCAG AAGGATGTGA GTTCAGCCAT GTTTGAAATT TTGCGACGAC CAAACGCTCG AGCGTTGGAA CACGCAGACT TTACCGAGGT GCTCACGGAG ATGACGCAAA CGCATCCGGG CTTGGATTTT TTGAAGACCA CTCGCGAATT CCAGGAACGT TACGTAGAAA CGGTGATTTA TCGTATATTT TACGAGTGCA ACACGACCTG GAACGGGCGT TTGACGCTGC GAGAGTTGAG AAAATCAGAC TTGCTTGAGC ACATGTTGCT CGCCGAAGAA GAGGAAGACA TCAACCGCGT GTTGAAGTAC TTTTCGTACG AGCATTTCTA CGTCATCTAC TGCAAGTTTT GGGAACTCGA TACCGACCAT GACTTTTTCA TCAATCGTGA AGACTTATTG CATTACGGTA ATCACGCGTT GACGTATAGA ATCGTGGCGA GGATATTTGA CCAGGCGGGG AGGCCGTTCA AATCAGACGT GCCCGGGAAG ATGAGCTACG AAGACTTTGT TTGGTTCATT TTGAGTGAAG AAAACAAGAA CCATCCGCTA GCGCTAGATT ACTGGTTCAA ATGCATCGAT ACTCATCACG ACGGCGTCAT CACGCGAGAT GAGATATACT ACTTTTATGA GGAACAGATT CAACGCATGG AATGCCTGGC GCAAGAACCC GTCCTGTTCG AGGATATTTT GTGTCAAATG ATGGATATGC TCAAGCCCGA AGTCGACGCG AGAGTGACTC TGAACGACTT ACGATCGAGC AAGATGAGTG GCAACTTCTT CAACGTTCTC TTCAACATGA ACAAGTTCAT CGCATTTGAA ACGAGGGATC CGTTCTTGAT GCGACAAGAG CGCGAAGAAC CGCACTTGAC GGAATGGGAC CGCTTCGCTC GCGGAGAGTA CCTCCGGCTG AGCATGGAGG AAGACGATGA GATGGACCAC GCGAGCGATG TGGTGTGGGA AGAATCCCCT ATATGAATAA TGATTAGAGA TACTGTACAC TGTAACGATA GC
|
Protein sequence | MSKSLSIQVP ARTPLGERSG NENISAASKF PSSKLKLDEL FLNWLSMAES QNLVYELLKD AKAGKPLRQP KSGAMHANVA AAIGTPPRSP QKGSRYGTSA FSPTRRPLSR QASLFTRATN EPHSIPTFYK PGGEGLSEEV NAGKVALAER MFERHLTGMN LEAFAGVVRD VVGLPRYFAK RVMKLVAGAN ADVVTREQWF SYWNSTLRRQ KDVSSAMFEI LRRPNARALE HADFTEVLTE MTQTHPGLDF LKTTREFQER YVETVIYRIF YECNTTWNGR LTLRELRKSD LLEHMLLAEE EEDINRVLKY FSYEHFYVIY CKFWELDTDH DFFINREDLL HYGNHALTYR IVARIFDQAG RPFKSDVPGK MSYEDFVWFI LSEENKNHPL ALDYWFKCID THHDGVITRD EIYYFYEEQI QRMECLAQEP VLFEDILCQM MDMLKPEVDA RVTLNDLRSS KMSGNFFNVL FNMNKFIAFE TRDPFLMRQE REEPHLTEWD RFARGEYLRL SMEEDDEMDH ASDVVWEESP I
|
| |