Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_94483 |
Symbol | |
ID | 5002556 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | - |
Start bp | 261452 |
End bp | 263344 |
Gene Length | 1893 bp |
Protein Length | 571 aa |
Translation table | |
GC content | 51% |
IMG OID | 640417977 |
Product | predicted protein |
Protein accession | XP_001418424 |
Protein GI | 145347954 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.13386 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTACCGC TCGTCCTCGC CGCGGCGGCG GTGTACAGGC ACGTGAGCGA AGAAGAGTGG GGTGCATCCA TTGCAAAGAC GTTTTATTGG TTGAATGACG TTCCTGGAGC GGACAGTACG GCAGAAGAGA ACTGGAAAAG TGTTGTCGTC GCACAACTGA TTGTGTTTTG TGGGATGTTC ACCTTTGCAA TTTTAATCGG CGTTGTGTCT GACGAAATCG CGAGCAAGGT TGACGAGGTA AAGACGGGGA ACAGTAAGGT GTTCGAACAG AATCATACCG TAATATTGAA CTGGAATGAG CAGTTGATTC CGTTACTGAA ACAAGTCGCC GTGGCAAAGA GCGAAGGGAT TGGTTTCGAA AGGCCGGTGG TTTTACTGGC TAACAGGGAC AAGGAGGAGA TGGACGCCAC CATTGAAGAT GAACTCCAGG ATAGTCCTCC GCTGACCGTC GTAACTCGCT CGGGTCAAGC TCATAATGCT GAAGACTTGG ATCGAGTGAA CGCTTGGGCG GCCGAGCGGG TCGTGGTGTT GCACGATGAC GGCGAGAACG AGGACACTGT CGAGAGCCAA AAGGCGGCGG CGGTGTTGAA TTTGCGTTCT GGCGGCGGCA TCACAGGTGG TCGAGGGCCA AACGTCATCG TTCAATCTCC CACTCGCAGA TCTGAAATCG ATGATGGTGT CGCTCTCGCG GTCGATTTGA CTGAGAAAGA AGGGAATAAA CATGGCGAAT TCTGCGTCGT CAACGGCACT GGAGAGTTGT CGAAACTCAA AGCTTACTCC ATCATGCAAC CAGGCGGAAG TAAATTATTT GAAGATCTCA TGCTTCAATC CAACGATAGT TCTGAGTTTT ACACGTATTC GCACCCAAGT CTAGCGGGTA AGACATTCCA GGAAGCCTGG AGAATGTTCA ACACGACTAC ACTGGTCGGA ATTACCAACG CAGAAGGCAT GATCCTTGGT CCGAGCGAGA CTGACGTCAT TGGACCAAGC GGGGCAGTCA CCGTCGTGGC AGACAACAAG TCCACGATCG AAGCAGATAT TGCGAAGCGG AAAGGGTCCA ATAAAAACGA GAATATTCCT CCTCCGGGAT CACAGCACTT GACAATGCTT AGATGCCCGG TCCGGATGCC GGCTCCCCGC AAAGTTGTCA TGCTCGGTTG GAACGAGGAG AGTTCTAGTG TGCTCGAAGA CATGCTAGTC TTAGCTCCAC CTGGCTCGAG TATAACCCTC ATCAACAATT ATGAGCTCGA TAAAAGTATT TTAAAAGGGA ACACGAATTG CAACGTGAAA CACGTGGCCA TGGACGCCCA GAAGAGAGCG ACGTTGGAAC AAAATCGTGT GCACGAAGCG GGAGCGGTTT TGATCATGCC CCCAACAGAT AGCGACGACG CAACGCAAGA CAGTCACGCA CTGTCATCAA TCATGCAAGT CGCGTATTTA TCAAAAAGAG CTGACACGGG TCACGCTCCT CATATCGTCT CTGAGTTGTC GAGTGAAGTG GCAAAGAGGG TGGCTGAGGA TATGTATGCT GGCATCGGTA CGGTTGACGT GATTTTACAC GACAATCTCA TAGGCGGCGC GCTTCTTCAA GTTTCTGCCA ACACCAAACT CGCTGGTTTG TTTGACTATC TGCTCGAAAA GCAGGGCAAA GAGCTGTACA TGCGCATGTA CAATGAATTT GTCACTGAAA ATGACGCCGA AGTGTACTGG GGAACGATTT GCGAGCGCGC TCGCGAACGG GACGAAATCG CGTTGGGCAT CATGCGCGCT GATGGTGAAC TCGCCATTTC TCCGCGAAAG GACAAACGAG TCAGACTGAA TCCGGGTGAT CAAGTCGTCG TGCTCGCGGA GGATTGGTGG ACGCCTACGA GCGTCAAAGC CAAGCAAAGT TAG
|
Protein sequence | MVPLVLAAAA VYRHVSEEEW GASIAKTFYW LNDVPGADST AEENWKSVVV AQLIVFCGMF TFAILIGVVS DEIASKVDEV KTGNSKVFEQ NHTVILNWNE QLIPLLKQVA VAKSEGIGFE RPVVLLANRD KEEMDATIED ELQDSPPLTV VTRSGQAHNA EDLDRVNAWA AERVVVLHDD GENEDTVESQ KAAAVLNLRS GGGITGGSKL FEDLMLQSND SSEFYTYSHP SLAGKTFQEA WRMFNTTTLV GITNAEGMIL GPSETDVIGP SGAVTVVADN KSTIEADIAK RKGSNKNENI PPPGSQHLTM LRCPVRMPAP RKVVMLGWNE ESSSVLEDML VLAPPGSSIT LINNYELDKS ILKGNTNCNV KHVAMDAQKR ATLEQNRVHE AGAVLIMPPT DSDDATQDSH ALSSIMQVAY LSKRADTGHA PHIVSELSSE VAKRVAEDMY AGIGTVDVIL HDNLIGGALL QVSANTKLAG LFDYLLEKQG KELYMRMYNE FVTENDAEVY WGTICERARE RDEIALGIMR ADGELAISPR KDKRVRLNPG DQVVVLAEDW WTPTSVKAKQ S
|
| |