Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_26273 |
Symbol | |
ID | 5004177 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | + |
Start bp | 526664 |
End bp | 528621 |
Gene Length | 1958 bp |
Protein Length | 601 aa |
Translation table | |
GC content | 60% |
IMG OID | 640419598 |
Product | predicted protein |
Protein accession | XP_001420028 |
Protein GI | 145351317 |
COG category | [R] General function prediction only |
COG ID | [COG2041] Sulfite oxidase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00964514 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0789022 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACGA TGCAGACATC ACGGCGGTTC TTTCTCTTTT TGCTCATGTG CTGCGTCACG CGCGCGCGCG CCGATTTGAC AACTGAGAGC ACGTTTGAGC GCGTCGAGAG AGACGGAACT GAAGGGACTT TCGAACTGGT GGGAAAGTAC ACCATTAGAA ACGACGGCGC GAATGACGTG AACCTCGCCG GGCAAGCCCT GATGTTGCGA TTCCCAGGAT ATGTTCGCGT GAAGAACGAC GATGATCCAG CACAGTCGAC GTTGACGAGA GCGCTCCCGC GCGACTGGAT CTTCGAGTGT TTCTGGTCTT ACGTCGAAGG GCGATACAAC GGGACGAACG TGTGTCCCTC GATCGATTTC GTCGTTTCGT CGAATGTGGT GGAGGTACGA TTCGTGAACA CCTTGATTTT ATGTCCAGGA TGCACGCTGA GGGGGGACTC GTCGTACACG TCCTTCGTTC TTAAGCATCG TGCGTATTTC CCCATCTTTG AAAATGGCGA CATGCTCAAA TCGATGGGCA TGCGAACGTT TTTCGACGCA CCGCCACCGC CGCCGCCTCG AGGCAGGCGG GTGTGTTTCC CTCGCGAAGT TGATTTTTCA TTTAAAGTGT CAAGCTACCC GAGCCGTTTC GCGGCGGATG ACGAGTCGGC GTCCTCAGAC GACGCCGTCG TGCCGTTGGG CACGGCGGGC TTCGACGCGT TCTTACGCGT CTTCTTAGAC GTGCGCAATC GACAATCTTT CGATTACGAC ATGTCTTCAG TCGTCATCGT CGTCCCCTTT GATTGGAAAA TCTCTCCGAG CGAGAGCTCG GGCAAAATTC AGCAAACGCC GGATGATTTC TTCGCGCGAT GCCACGGGCA AGGCACGACG CTGTGCGACG TCGCTTACGT ACTCAAGGAA TCGTCGGGAT TCCAAATTCG ATTCAATCCA GGATTCACCC TCTGCCCTGG ATGCTCGCTG CGAGGCAAAG GTCCACAAGG CGCCGCATTC GAACTGTACT CGCCATTTCT TTTTCCGCTC GACATAGAAA GCGTTCGAGG CGCGTCGGCG TTTTGCGACG ACGCCCAACT GTAAAATCCA ACTGTACAAC GCCCGCGCGT CCGCACATTC CAGCGTCCAT GATCGACGTC GACGACCCCG TCGTGCGCAA ACACGTCGAA GAGAAAGCGC GCCTGCGCGA GCGCGCCGCC CGCGCGTCGT CGCGCGCGTC GTCCTCGCGC AGCGCCGGCG GCGCTTCGGG ACAGCGCGAG GTTCAACGCT GGATAAAACT CGACTTGGGG CGCCCCGTCC CGGCGACGCA CGACCCGTGG GCCGAGGGCG ACGCGTTCGA ACTGACGTTT TCGTGCGAAT CGGGCGGCGC GACGTCGGTC AGCGTGGGCG CGCTGAAAGC GGCGAACGGT GGTTGCTGGA GCGCGTTCGG GACGAGTTGG CACTGCGTCA CCGGGTGGAG TCGACTGGGC TTGGATTTTC GAGGCGTGGC GCTGGAAGTC GCGTTGGAGT GCGCGTTCGG CGAACGCGAC GTCGGTGGTT GGCGGTGTTT GTACCAGACG AGTGCGGATG GGTACACGGT TGGCGTGGAC GCGCGCGACT GCGAAGGCGC GTTTCTCGCC GTTACGGACG CCGACGGCGC GATGCTTTCG CGCGACCACG GTGGACCCAG GCTCGTGTTT CCCGCGTTGT ACGGCTGGAA GAGCGCGAAG TATTTATCTC GAATCGAGCT CAAGACGTCG TACGAGAATG GGTTTTGGGA AAACTTGGGG TGTCACGCTC GCGGTCGATG GCGCTACGAT GAGCGATGGA AGCCGGGGAC GTCCGCGCGA GTGTGGAACG TATTGGCGTG GATCACGGAT CGATACTACA TCGGTGGTGA ACGGGTGTGG ATATGGGTCA TGGTGTACGG CGGGCGGGCG TTGGGGATGG TCGCGTCTCT GTTCGCCAAG AAGAATAGGG TAGACTAA
|
Protein sequence | MATMQTSRRF FLFLLMCCVT RARADLTTES TFERVERDGT EGTFELVGKY TIRNDGANDV NLAGQALMLR FPGYVRVKND DDPAQSTLTR ALPRDWIFEC FWSYVEGRYN GTNVCPSIDF VVSSNVVEVR FVNTLILCPG CTLRGDSSYT SFVLKHRAYF PIFENGDMLK SMGMRTFFDA PPPPPPRGRR VCFPREVDFS FKVSSYPSRF AADDESASSD DAVVPLGTAG FDAFLRVFLD VRNRQSFDYD MSSVVIVVPF DWKISPSESS GKIQQTPDDF FARCHGQGTT LCDVAYVLKE SSGFQIRFNP GFTLCPGCSL RGKGPQGAAF ELYSPFLFPL DIESVRGASA FCDDAQLAGG ASGQREVQRW IKLDLGRPVP ATHDPWAEGD AFELTFSCES GGATSVSVGA LKAANGGCWS AFGTSWHCVT GWSRLGLDFR GVALEVALEC AFGERDVGGW RCLYQTSADG YTVGVDARDC EGAFLAVTDA DGAMLSRDHG GPRLVFPALY GWKSAKYLSR IELKTSYENG FWENLGCHAR GRWRYDERWK PGTSARVWNV LAWITDRYYI GGERVWIWVM VYGGRALGMV ASLFAKKNRV D
|
| |