Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_27044 |
Symbol | |
ID | 5005016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009367 |
Strand | + |
Start bp | 113309 |
End bp | 115540 |
Gene Length | 2232 bp |
Protein Length | 743 aa |
Translation table | |
GC content | 60% |
IMG OID | 640420437 |
Product | predicted protein |
Protein accession | XP_001420908 |
Protein GI | 145353196 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.0342016 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00159724 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCCGCGC GTCCGTGGCG CCCGGCCGAG CACACACGGC AGCGAGCGAT TTTTCACGAC CTCGGGAACG CGCCGTATCG CCGACGGGAG TCGTTCGACG CGGAGAAGTT GGCGGATGAG TTGGCGAATT TGCGTCGGGC GAATCGACAT CGGTCGGATG ACGACGGCGG CGACGACGGC GGCGGCGACG ATGGGCTCAA GCGCGAGCGC GCGTTGATGA CGAATGAACG CGTGCGAGCG TTGGAGACGC GCGCGAGGGA CGCGAGCGAG GTCGCGCGAA GGGAGACGGA AAAGCGAGAG ATGCAGGAGC GGGCGTTGGA GCGCGCGAAC GCGAACGCGC GAAGAGGCGA GGTGAGCGAG CGGTTGAAGC GACACGCGGA GCGCGAGGTA TTGGAGTTGA CGCGAGAGTT AGAGACGGTG CGGATGAAGG CGGACGAGGA GCGAGACGCG TTGGAGCGGT GCGTGGAAGA GATGAAGGCG ACGCACGAGT CGAGCCTGGC GAAGGCGCAG ACGGCGGAGG CACAGGCGGC GGGAGAGGCG AAGGAGTTGC TCGAGGTCAA GGCACGATTG GAGGAGGAGC GCGATGCGAC GACGAAAAAG AACGCTGCGC TCGAGGGTGA GTTGGCGAGC GTTGGGACGT GTTTGAAAGA GACTGAGGCG TCGTTGAAAG CGGTGAGCGA GGCCAAGGCG AGCGCGGACG ATGAGCTTAG CGAAGCGCGC GAGGAGATGG CGCGCTTGCG AGACGAATTG GAACGAACGG CGGAGTCAAA TAGGTCAAAC GAGCGGGAAC TACGTGAGTC GCGACGAAAG GCTGAGGCTT TCGCCGCAGC CGTGCACAGC GCGAGAGGAA ACGATAGCGC TGCGCGAGCG ATGATGCGTG ATAGCGTCCT CACGTTCGCT CGATTGGAAG ATGCGGCGAA CTCTCGGGAG ATACAAATTT TGAAGGAGCA TCACGCGGCT ATAGTCGAAC GCATGCAAGA AGAATTCGCC GTCGCTTTAG CGGATGTCAA AGCGAACACG GGCGCTGTAG CGCACAGTAC GGTGGATGAG TCGCAGCGAG TGAAGCGCAC CATGAAGGGC ATGGAAGAGG ACGCGCGTGA GGCGGTGTGC AACGCCAGAC GCCGGGAAGA AGAAGCAAAC GCCGTCGCCG CCAAGCTTCG CGAAGAATTG GTGATGACCA AAGCTAGGTT GAACTCGATG ATAGAAGCCG GGGAAGGGAA ACTGAATCAT GCGAGCATTC GAGACGCGCA GGCCTCGCGC TTGGCGGCTG CGCAAGCACA GGCGATATGT CGCGACTTGC AAGCCGAGCT CGAGGCAGAA CATGAATCAG CCGAAGCAGC ACGACACGAG GCTTCGGAAA CCGCGCACAA GTACTTCGCA CTTCGAGACA AGTACGAGCA ACGCGAAGCG CTGGTTAATC ATCAAAAAGA AACCATCTCG GCGCTCAAGC GCGATAACGA ACGAAGTGGC GGAGAGATCG ACGAACTCAA GGCGCGATGC AAAGAGCTCA AGCTGTGCTT GCGAGACAAC GAGATGACGT ATTTGAACAG GCAAGATCTC ACCGAGGAAG TCGAGCACTT GAATAGGATT ATCGCCGACG CAAATGTCCG GGATCGCGAA CGAGCCATCG AACTTCAGCG CGAACAAGAG CGTGCGGATA ATTACGCCGA GCGACTCGAG CGGTGCGAGG CCAAATTGAA CGAAGAAACG AATCGAAGGA TTCAAATCGA GCGCACGGCG TGTGATGCCG ATAGCACGAA AGTCCGAGCC GATAGCGATG CGCTGCGTGC CGAGGCGATG ACGGCGAGCG CCGTCCTACT CGAAGCCGAG GTTTTACGGC TCGCTCGAAT CAACGAGCAA GCCGAAACGC GCGTCGCCGA GCTTGAAGGG CGCGTCGAAC AGCTCATCGA ACAAGCCGAC GCTCTAGCTT CGTCGCAGAA TCCAAATCAA AGCATACGAT ATCTGGAAAA GCTTCGGGAC GAGCGAGAGA TGGCTGAGAA AGACGCCGAA GAGGCGGGTA AAGCCGTGCA AAACATGAAG GCGGCGCTGC AGTTCGTCGT GTGCGCGCGA CGAGAAACTC GAGACAAGGT GGTGCAGTAC GCGCGCGAAG CTCGCAAGAC TGGTCGAATC TACACCGAAG CGCCGGGCTT GCCTCAGGGC GTGAGCGCCG ATATCTGGGC TCGAGTCGTC GAAACCGTCG CTCGCCTGAA TTTAGACGCC GTCGAGTGCT GA
|
Protein sequence | MPARPWRPAE HTRQRAIFHD LGNAPYRRRE SFDAEKLADE LANLRRANRH RSDDDGGDDG GGDDGLKRER ALMTNERVRA LETRARDASE VARRETEKRE MQERALERAN ANARRGEVSE RLKRHAEREV LELTRELETV RMKADEERDA LERCVEEMKA THESSLAKAQ TAEAQAAGEA KELLEVKARL EEERDATTKK NAALEGELAS VGTCLKETEA SLKAVSEAKA SADDELSEAR EEMARLRDEL ERTAESNRSN ERELRESRRK AEAFAAAVHS ARGNDSAARA MMRDSVLTFA RLEDAANSRE IQILKEHHAA IVERMQEEFA VALADVKANT GAVAHSTVDE SQRVKRTMKG MEEDAREAVC NARRREEEAN AVAAKLREEL VMTKARLNSM IEAGEGKLNH ASIRDAQASR LAAAQAQAIC RDLQAELEAE HESAEAARHE ASETAHKYFA LRDKYEQREA LVNHQKETIS ALKRDNERSG GEIDELKARC KELKLCLRDN EMTYLNRQDL TEEVEHLNRI IADANVRDRE RAIELQREQE RADNYAERLE RCEAKLNEET NRRIQIERTA CDADSTKVRA DSDALRAEAM TASAVLLEAE VLRLARINEQ AETRVAELEG RVEQLIEQAD ALASSQNPNQ SIRYLEKLRD EREMAEKDAE EAGKAVQNMK AALQFVVCAR RETRDKVVQY AREARKTGRI YTEAPGLPQG VSADIWARVV ETVARLNLDA VEC
|
| |