Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_30883 |
Symbol | |
ID | 5001117 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 923318 |
End bp | 925897 |
Gene Length | 2580 bp |
Protein Length | 637 aa |
Translation table | |
GC content | 58% |
IMG OID | 640416538 |
Product | predicted protein |
Protein accession | XP_001416798 |
Protein GI | 145344561 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.305439 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGGCG GTGTTGCGGT GGCGTCGGGA GAGATCCGGC ACGGCGATAG GGTCTTCAGT GGGAACTCTG GCCCCGCGCA GCGCGAACAA CCCATAAACA AAGAACTCAT CGCGCACTTC AAAAGCAAGG GAGACAGTGT GTCCTGGGGC ATCGCTGCAA GCATAGAGAA CTACGAGGAG GATACGAAGA GGCCAGTGTT GTCGGGCGCA TGCATCGCGT CGCTCTACGA CCAAGAGACC GAGCGCGAGT TGGCAGAGGA GGTGGAGAAC AGAATTGAGC GGGTGCTAAG CGACGACGCA GCCTTGTTGA AAATACGTCG ATATTTGAAG ACCTTCCACG CGGACGAGCG CGAATGGGAT GTGAGAACGT GTGCAGCGGC GGCTCGAGGG GGCAACTTGG AGTGTTTGAA ATATCTGCAC GAGCACGGGT GCGCGTGGGA TAAGAGTGCG TGTGAAGCGG CGGCTCAAGG TGACCACTTG GAGTGTTTGA AATATCTGCA TGCGAACGGG TGCACATGCG ATGAGAGTGC GTGTTTAGCA GCGGCTGAAG GTGGTCACTT GGAGTGTTTG AAATATCTGC ACGAGCACGG GTACCCATGG AGTGAGTGGG TGAGTTTAGC AGCGGCCGAA GGCGGTCACT TGGAGTGTTT GAAATATCTG CACGAGCACG GGTGCGTATG GAATAAGAAA ACGTGTCGAG CAGCGGCTCG AGGTGGCGAC TTGGAGTGTT TGAAATATCT GCACGAGCAC GGGTGCGCGT GGGATGAGAG TGCGTGTGAA GCGGCGGCTC GATGTGGTCA TTTGGAGTGT TTGAAATATC TGCATGAGCA CGGGTGCCCG TGCGATGAGA GTGCGTGTGA AGCGGCGGCT AGCTGTGGCG ACTTGGAGTT TTTGAAAGAC CTGCACGAGC ACGGGTGCCC ATGGGATGAG AGTGCGTGTG AAGCGGCGGC TCTAGGTGGT CACTTGGAGT GTTTGAAATA TCTGCACGAG CACGGGTGCG TATGGAATAA GAAAACGTGT CGAGCAGCGG CTCGAGGTGG CGACTTGGAG TGTTTGAAAT ATCTGCACGA GCACGGGTGC GCGTGGGATA AGAGTGCGTG TTCTGCAGCG GCTGGACGCG GTCACTTGGA GTGTTTGAAA TATCTGCACG AGCACGGGTA CCCATGGAGT GAGTGGGTGA GTTTAGCAGC GGCCGAAGGC GGTCACTTGG AGTGTTTGAA ATATCTGCAC GAGCACGGGT GCGTATGGAA TAAGAAAACG TGTCGAGCAG CGGCTCGAGG TGGCGACTTG GAGTGTTTGA AATATCTGCA CGAGCACGGG TGCCCGTGCG ATGAGAGTGC GTGTGAAGCG GCGGCTAGCT GTGGCGACTT GGAGTTTTTG AAAGACCTGC ACGAGCACGG GTGCCCATGG GATGAGAGTG CGTGTGAAGC GGCGGCTCTA GGTGGTCACT TGGAGTGTTT GAAATATCTG CACGAGCACG GGTGCGTATG GAATAAGAAA ACGTGTCGAG CAGCGGCTCG AGGTGGCGAC TTGGAGTGTT TGAAATATCT GCACGAGCAC GGGTGCCCGT GCGATGAGAG TGCGTGTGAA GCGGCGGCTA GCTGTGGCGA CTTGGAGTGT TTGAAATATC TGCGCGAGCA CGGGTGCCCG TGCGATGAGA GTGCGTGTGA AGCGGCGGCT CTAGGTGGTC ACTTGGAGTT TTTGAAAGAC CTGCATGAGC ACGGGTGCGC ATGGGATAAG AGTGCGTGTT CTGCAGCGGC TCGAGGTGGT CACTCGGAGT GCTTGAAGTA CTTGCACACG CACGGGTGCC CCGGGAATAA GCGTGCGTCT GAAGCGGCGG CTGGAGGTGA CGTCATGGAG TGTTTGAGTT TGAAGTACTT GCATAGATCG CTCAAACGAA AGATTTACAG GTAGCGGAAA AATTACGCGA ATTGAAATCA AATACCTACG AAATCGGCGT ACGCCGGACA GAGCCTACGA CAGTGGCGAA ATAAACGCAT CGCTTCCATG GCGTCGCCCG AGGTCTCCGC GTCCACGGCG CGCTTGTGCA GGGCGTACGC CTCGGCGCCG ACGGCGTCGG CGAACGCCGG CGGCTCGCGA GCGTACATGA CGTCGGTGCG AATGATGAAT TTCACCGCGC CCTCGCCCAC GGGCGCGCCC TCGTGCGGAA TGTCTTGTCG AAACACCAGA GCAGTGCCTC GCGCGCACGG CGCGGCGTCG GCGATCCATT CATCCGGCCA ACGGTATCGA TCGTCCTCGT CGCGCACGAA TTTCGACGGC TCAAAGTCGC TCGGCGGCGC GAACATCGAC GTCTCCCCAC CTCGCGCGCA ATCGTTCAGG TACACCAAGA CGCTATAAAA CGAACGCGTA TTGAAATCTT CGATCGTCGC CCCGTCGGTG TGCGGTGAAA AGTGCCCGCG TCCAGAGTAT CGATTAAACA ACAAGTGTTC GTTCACGCCT CGCGCCACCC ATCGTCCGCG CGCCCCCGCG CCGTGCGTCT CATCCTCCTC AATCGTGATC TCCGGCGTCG CGTGCGGTTT CAATCGCGCC CACAGCGCTT CCGCGACCGT CTTCGACATC
|
Protein sequence | MAGGVAVASG EIRHGDRVFS GNSGPAQREQ PINKELIAHF KSKGDSVSWG IAASIENYEE DTKRPVLSGA CIASLYDQET ERELAEEVEN RIERVLSDDA ALLKIRRYLK TFHADEREWD VRTCAAAARG GNLECLKYLH EHGCAWDKSA CEAAAQGDHL ECLKYLHANG CTCDESACLA AAEGGHLECL KYLHEHGYPW SEWVSLAAAE GGHLECLKYL HEHGCVWNKK TCRAAARGGD LECLKYLHEH GCAWDESACE AAARCGHLEC LKYLHEHGCP CDESACEAAA SCGDLEFLKD LHEHGCPWDE SACEAAALGG HLECLKYLHE HGCVWNKKTC RAAARGGDLE CLKYLHEHGC AWDKSACSAA AGRGHLECLK YLHEHGYPWS EWVSLAAAEG GHLECLKYLH EHGCVWNKKT CRAAARGGDL ECLKYLHEHG CPCDESACEA AASCGDLEFL KDLHEHGCPW DESACEAAAL GGHLECLKYL HEHGCVWNKK TCRAAARGGD LECLKYLHEH GCPCDESACE AAASCGDLEC LKYLREHGCP CDESACEAAA LGGHLEFLKD LHEHGCAWDK SACSAAARGG HSECLKYLHT HGCPGNKRAS EAAAGGDVME CLSLKYLHRS LKRKIYR
|
| |