Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_32186 |
Symbol | |
ID | 5002493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | + |
Start bp | 410155 |
End bp | 412113 |
Gene Length | 1959 bp |
Protein Length | 610 aa |
Translation table | |
GC content | 59% |
IMG OID | 640417914 |
Product | predicted protein |
Protein accession | XP_001418243 |
Protein GI | 145347583 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.420674 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGCGCGCGCA CGGCGATCGA CGCGCGCGAG TCTCGACGAG ACAAGAAAGA AACCAAGTCT CGAACGCACC GCTCGTCGCG CGTCGACGCG TCGACGCGTG GGTGTCGGCG GTCGCCGCGC GATCATGTCG GACGGCGAGC GCGCCGGATT GTTGGCGGAC GCCGAGACCC CGCAGACGAC GAAACACAAC TCGCGATGGA TCGCGAAGCG ATGGCGCGTC TTCGCCGCGC TCGGCGCGGG GGCGTGCTTG CTCGCGATGG GCGTGGCCAC GCGCGCGGCG CGCTCGAACG AAGCCGCGCT CGGCGCGAAC GAAGACCAAG TGAAGCGCGA GACGATCGAC GCGCGGCTCA AGTCGGTGTT TGGGATATCG TTAGAAGATT TCGCCGCGCT CGTGAGGGAT GAGAACGATG TGCAGATTGC GGGGAAGGAG TTGTACGAGA AGAAGAAGCA GTATAAGAGG CTGCAGCAGA TGGAGCGGGC GTGGGAAAAG ACGGAGGCGG CGGCGTTGAC GAAGGCGCGG GAGGAAAGAG AGCTCGATGC GGCGTCTGCG CCGCTGTCGA CGCCACAAAG CGATTTGATC AGCGATAGTG AGCCAGTGGA GGCACTCGTA GAGAGTGAAA CGCCTGTGGA GTTGGAATCA AGCAACAAAA CCGCTGACGC CGTTGGCGAG AGGGAGGCGC CAGCAGAAGA GTCGAGCGAC GGAGCCGCGG ATTCTATCGC CGTACTGAAG CACACGCTCG AAGTGCCGGT AAAGGTGCTC GCGAGAGAGG ATAAGCGACA TGCTCGGCGG AAGTTGCGCA GGGGTGTCGA GGCGCAACTG GGCGCCATCG AAGAGTACGA CGACAACGAG GAAATCGCCG AAGAATCGTC GGCGATTTTG CCCGTGTATT TCCATACGGA GAAGAGCGGT GGAACATCAC TCGTGTTGCA CACGCTAGAG CTCGTGAATT CGGACAACGA TGACGTGCTC GGTTTGATCA ATCGTGTGCG CAGCGAAGAC GTCATGCTTG ACAAAGATCT GCGTGCGAAA CATGCGTTGT GCCCGGGGAG CGCTATGTTT TTGACGACGG TGTGGAAGGC TGGGACGGTT TGGGAACCGG GACATCCGCG TCCGCTCGAG GATTCGACGC GGGAGAATTG GGAAAAATGT CGATTGTTGT CGAGTCATAC CGGTCGAGAA CTCTTGCGCC GAACAACAGA ACTTGAAGCG GAACTCGGAC TATCACGTCC AAAGATTCTC ATGGGCATGT TCCGCGACCC CGCGGAGTAT GAACAAGCAG CATGGCGAAG CGAGCTGTTC ATGTATCACG ATCTTCGCGC CAAACTTGGC TGGGGAAAGC TCGCGCAAAC CCCACTCGGA TCCGCGCTCA CTCAGGAAGA GCTGAAGGAC TTCGGAGCCG ACAGCAAGTT CGCCAAAATC ATGCTCGAGG ATCACTGCAA AGCCGGCTTG GATAAGAACT TCCAAACGAA GAAGTTACTC GAGGATAAGT GGACGTCCAT GAAGGACAAC CACGACGCCA TCATGGCGCT TGCGAAAGAG CGAGTGATGG AGCTTGATTG GGTCGGTTTG ACGCATCGCT TCGACGAAAG CGCGTGCGTG CTCGCGTATA CTTTGCGAAG ACAGCCGCTA TCGACGAGTG ATAGCAACTA CGACAGAGGA AGTTTGTTGC CGGCGACGCT CAGGGCCAAC CACCCGCACA GCGGCGATCA TAGTGAAGGA GCCATGGATC CAGGGCTGAA AGCGAAGCTG TACGAGTGCA ACGACTTAGA TGCCGCGGTT TTCAAGCTCG CAGAGACCCG CTTTGAGGCA AAGAAAACCG AGATGATGGA AACCTTACAA CGGGCAATAT CGGCGAGCGA ACAGCTGAGG CCGATTCGCG GCGGCGGCGA GCACCAGATG TTGGATCCGA AACCGTACAT CGATTGCATG CATCAAGCTG GGATGGGAGC AGCGTGATA
|
Protein sequence | MSDGERAGLL ADAETPQTTK HNSRWIAKRW RVFAALGAGA CLLAMGVATR AARSNEAALG ANEDQVKRET IDARLKSVFG ISLEDFAALV RDENDVQIAG KELYEKKKQY KRLQQMERAW EKTEAAALTK AREERELDAA SAPLSTPQSD LISDSEPVEA LVESETPVEL ESSNKTADAV GEREAPAEES SDGAADSIAV LKHTLEVPVK VLAREDKRHA RRKLRRGVEA QLGAIEEYDD NEEIAEESSA ILPVYFHTEK SGGTSLVLHT LELVNSDNDD VLGLINRVRS EDVMLDKDLR AKHALCPGSA MFLTTVWKAG TVWEPGHPRP LEDSTRENWE KCRLLSSHTG RELLRRTTEL EAELGLSRPK ILMGMFRDPA EYEQAAWRSE LFMYHDLRAK LGWGKLAQTP LGSALTQEEL KDFGADSKFA KIMLEDHCKA GLDKNFQTKK LLEDKWTSMK DNHDAIMALA KERVMELDWV GLTHRFDESA CVLAYTLRRQ PLSTSDSNYD RGSLLPATLR ANHPHSGDHS EGAMDPGLKA KLYECNDLDA AVFKLAETRF EAKKTEMMET LQRAISASEQ LRPIRGGGEH QMLDPKPYID CMHQAGMGAA
|
| |