Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_88612 |
Symbol | |
ID | 5004555 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | - |
Start bp | 331709 |
End bp | 333613 |
Gene Length | 1905 bp |
Protein Length | 634 aa |
Translation table | |
GC content | 55% |
IMG OID | 640419976 |
Product | predicted protein |
Protein accession | XP_001420477 |
Protein GI | 145352275 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.157382 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.137437 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACAGG CGTGGAAGGA CGTTTCCGCG GATGCGCTCG ATGATGCGGC GAAAATGGAG CTTGGAAAGA AGATGGAACA CGTGCCGGAG GAGTTGCGGC GGGTCGGCGC TAAGGCTGGG GGGGAGATGT TCGTGACGTT TGGCACGGCG AGCGTGCAGG ATTTCGTCTT CAATTGGGCG GCGGCGGCGA AAAAGTTGAG TTTAGAGCCG ATATTCGTCG GCGCTTTAGA CGAAGAGATG CACACACTAT GTGTTAAGGC TGGGATACCA TCCATGCTGC TCACGGGGCG GTCAGTGTTG GATAATAGGG ATCAAGAGTT CATCACGCAA AAGAGCAAGA CGTTTAAAAA GATGGGCACG GTGAAGACGA AGTTTATTCA AGATTTGCTC GAGCTTGGGA TAGCGCCGAT TCTGAGCGAC GCGGACGTGG TTTGGATGCG CGATCCGCGC GAGCTATTCA ACAACGGCAC TTACGCATAC GCGGACGTGC TGATATCGAG CGATTGCATC GACACCGTGA ACGATCGCGC CGACAACGCC AACTGTCGCA ACGTCAACTT TAACACGGGC ATCGTGCACA TTCGGCCCAC GGAGCCGGCG AAGGCGTTCG TGGAAAAGTG GAAGCAAAAA GTAGCGACGA GCGAGATCGC GTGGATGCGC GACCAGCCGG CGCTGAATTT GCTCGTGCGC GAAGGATCTC CCGCGCTCGC GCCCGCGGTG GCGGTTCCCG ATGACAAGCG CGGATTACCG GGGTATCGCT CGATCGTCTT CGCAGCAAAC AGCACGATTC GCATGGGCGT TTTGCCCATC GCGCAATTCT CCAACGGTCA TACTTTCTTC GTGCAAGAGC ACCACTTGTA CCACCCCGAG GACGGTGAGC CGTACGCGGT GCACACGACG TACCAATACG GCGATTCTGC GCGGTACGCG TACGGCAAGC GTCAAAGACT GCGACAGCAT GGTTTATGGT ATGCAGACGA TGACACAGAT TATTGGAAGC CAAAGAAGTA CTTGACCATC TCCACCAAAG GGTCGCAGAT GAAGTTCAAT GGTTCTCGGG CCATCGGCAT GGAAAACGAT GCGTATTTAA CCGCCATCAC GCGACACTTT GAAGAGGACA GATTGCGAAG AACGACGATT CGAAACGGCT TCGCCCTGGC GAAAGCGCTC GGACGAATCT TTGTGCTACC ACCCGCGCGA TGTTACTGCG ATAAAATATG GAACACGCTC GCCGGATGTC GCGCGCTCGG CGCCGAGACG GCGCATTTGC CGTACGCGTG CCCGATGGAT CACATCTACA ACCTCGAAGG CTTGCATGAC TTGGGCGTTG ATTTTCGCGA AGCAGGGTTC CTCGAAGACA TGCGACTTAA AGGGAACGTT CGAGAAGATG TCATTCACGT TAAAATTGGC GCGAAAGACG ACAAAAACAT GGCTGATGTC GTCATCGAGC GCGGTTTCTC AGCGAGTGAC GCCGTCGAGG CGCTGGAATC GTATAACGAT CACGGCGTCA TCATGATCGA CCAACTCGAT GAAGGCTCAT TTTGCGGTTT CGATGATAAA CAAAAGGACG AGGCATTTGA TACGGCGATC AACAACGCGC TCAACCACGA CCAGTACTTC TGCTTCAACG AAGCGTACGA CAAGCAAGGT CGACCTCGCA GCGGCGGGGG AAAGGATGGA AAAGAGTACG AGCCGCGAGT TGTGGAGCGG CACTGCGGCA TCGCGGAGGG AGAGCAATCT CGTCTCGCCA CTCGAGGCGT CGTGACAGAG GTATTGAAAG ATCCAATAAC ATGCTCGTGT GAGTGGGCTT ACAAATTACC CAAGGCGCTC GCCGACACGC GGTGCGCCGC ACAATCAAGA GACGAAGACG ACAGAACTAG GACAGGACTT GGTGATATTG AATAG
|
Protein sequence | MRQAWKDVSA DALDDAAKME LGKKMEHVPE ELRRVGAKAG GEMFVTFGTA SVQDFVFNWA AAAKKLSLEP IFVGALDEEM HTLCVKAGIP SMLLTGRSVL DNRDQEFITQ KSKTFKKMGT VKTKFIQDLL ELGIAPILSD ADVVWMRDPR ELFNNGTYAY ADVLISSDCI DTVNDRADNA NCRNVNFNTG IVHIRPTEPA KAFVEKWKQK VATSEIAWMR DQPALNLLVR EGSPALAPAV AVPDDKRGLP GYRSIVFAAN STIRMGVLPI AQFSNGHTFF VQEHHLYHPE DGEPYAVHTT YQYGDSARYA YGKRQRLRQH GLWYADDDTD YWKPKKYLTI STKGSQMKFN GSRAIGMEND AYLTAITRHF EEDRLRRTTI RNGFALAKAL GRIFVLPPAR CYCDKIWNTL AGCRALGAET AHLPYACPMD HIYNLEGLHD LGVDFREAGF LEDMRLKGNV REDVIHVKIG AKDDKNMADV VIERGFSASD AVEALESYND HGVIMIDQLD EGSFCGFDDK QKDEAFDTAI NNALNHDQYF CFNEAYDKQG RPRSGGGKDG KEYEPRVVER HCGIAEGEQS RLATRGVVTE VLKDPITCSC EWAYKLPKAL ADTRCAAQSR DEDDRTRTGL GDIE
|
| |