Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_92478 |
Symbol | |
ID | 5000945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | - |
Start bp | 691814 |
End bp | 694924 |
Gene Length | 3111 bp |
Protein Length | 1036 aa |
Translation table | |
GC content | 49% |
IMG OID | 640416366 |
Product | predicted protein |
Protein accession | XP_001417023 |
Protein GI | 145345023 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGTC GTTCTGAAGA CCCGGCGTGG GTACCACCGC CGCGAATCGT CATGGTGAAG GCTTTCCCGG AGGAGGACGT CGTGCTCGTG AAGCCCGAGC TCTTCGCGGA GTGCTCGCGA AGACGCTACC TCGCCCTGAG CGATGGTGTA AAGTCCCGCT TCGCCCAAGC GCACGCGAGC TCGCGAACGA AGTATCGTAA GCTCTGGCGG CGGTTGTTGG TTCACATGCA AACGTCGCTG GCGATGGCGA AGGCGCTGAC AAAGTTGGAG TTGTTGCGAA CGAATACGGC GCGCGGACAC AGATTGGACG TAGCTGGATT GGCTCTCATT GCGATGTACG ATAATTACAC GAAGAAGATT GATCAGATGT TGAGCAAGCA GATGCAGGAG TCGCTACAAA AGGCGATGGA TTCGAAAATG GAGAAAGATA AAGTGATTGC ACAATCGCTG CTCGAACTCG AAACGAACGA TGAGAAGAGT GGCGAAGATT TCAATCCGAT GGCTATCATG CGAGCTTTGC GTAAGCATAG CGCGCAGAAA CCAACAAAAA AACAGTTTCA AGGCTCTGAA GAAGAGATTG GCGTTGAGCA CACGCGTTTG TTGTTCCTCG TGTCGGTGTA CACGGCGGAC GCGAACAGAG ATGCGTTTCC AGGCTTGGCC GAAGACAGCG AGCTTTGGGT ACGAAAGACC CAGCTCTTGG TTTTGATTTA TGAATGTATT CGTGCTGGTG CGCTGAATTA CGATTACGCA CCTTTGGCCG AGACCATGGG ATCTAAGCGC GTTTGGTTGA ATATTTCTCA AGAAGGCGTC GACGACCTCG ACGATATGTG TCAAGTTGGA TTCTTATCGA GCATGAAGAT GAGTTCGACG AAGTATAGCA CCTCTACGGC TTACCGATTG ACAAAAGAGG GATATTTACA CCTCAAAACG CACCTCCGTC GTCGTGATAG AGCCGCCATC GAAGAAGTCG TGTACTCCGA TAAATTGCAG CCTTCACCAA GAAACCTGTT TGTCGCAAAA TGGGACGCCA AAGCGGATAC TTTTTATCTC CAAAGCGCTT CAGGTCACAC GAAATCAAGC GATGTCACCG ATATCGAAGA GGTTTCGTAC GTTTCATCTC CGTTCGTTCC GAAGTCGATG CGAAAGTGGG GGCGTGAGTG CACTAGTAAC AAACACAAGA CCGCCGCACT CGTCAAAGCA ACGGGCACCA TCAGAGACGA GCTGGATGAG CAGTTGTCAT TCGATAGATT GCGATTGATG GTTGGTGAAT GGATTCCTAT GGGCGCGAAT CAGGTGTTGA GTCTGAATGA CAAGCTAGGT TCGACGGACC GCGTTGCTGG CGGCTACTTT ACGAGCGAAA TGGACAAAGA TCCCAACAAC CCATGCTTCC AAGGCAAAGT CGATGGTTTG ACGCGGGTCA ACGTGCTAGA TTTCGAAGAA ACGTCGTACG TGAACTTCGA AGCCGAGGTA CAGTACGAAG AAGAACCTGG AATTGTGCAA ATTGAAAACT TCGGCATCCA CGTGAGTGAA GAAGGCTTCA TGCTTTACGG CTTGACGCTC GACGGTATGA TGAAAGTGAC CGATGGCAAC AACTTTTCGC TCGATCACCT CGCACGTTTG CTTCGTGATA TCGGCACGGA TTCAAGCGAA GTCATTGGTA ATTTACTCAC CGACCATCAG CGACACTTGC TGGATTTAGT GCACATGGGC GATGCGATGA ATCGCGAAAA GTTCAACGTG TTCTTCACGA GTAGAATCAA CAAGAGAGGT CAAGAAGAGA TGCCCATGGC GCATGAGCTG CTTGATATGG AAGACATGGA AAATGAGATT CGGCAGATCA TCGGTGAAGT AGAGTGTGGT TTTCAACTAT CGCGAGATGA TGAATTGATC ATCATCGGCT CAACCGGTAT GATTCTCTGC TCAAAGAATA CCGAAAAATT TGAACCCTTG GTATTACAAT ACATGTCCAT GATGTCACGA AATATGTTCA TTCAGGCGCT CTATAGGCAA ACATTTGTCA CCGTGGACAC GCTCGGAGAA ATCGATCATC TCATACGCAA TCATGACGCC GATCCAAACA ATATCTTCAA AATTCGCGAA CTCATGTCAA ATGTATCGGC AGATATCATT CTCATGCGGG AAATTCATAG CTATCTCCTT GAATCTCTCA CTGAGACGGC ACCAATGGCA ATAAACGATC AAGTATTGAA GCGCTTGTCT AAAATACTGC AGCTCGATGA TACAAACTTT CGCCTTGAGC GACGCATTCG AGATATTCGC AAAAGCTTAG ACGGCGCGAG CGGTGAGTTG CAAGCGTTGA AAAGTGCCGC CGATGTCATT CAAGAAAACA AAGAGTTCAA GGTGAACGAG GCGGTGTCAA ACAACACGCA GAACCTTGAA GAAGTTTTCC GCGCAAATGA GCGCGCATCG ACGTCCCTTG AGATCATGCA AGTAGTGTTG GCTGGATCTC TCGCTTTTGC AATTCTTGAT CGATTGCACG GTTTGTATCT CGGCGTTGCG GCTGACATCG ACTGGTCGGT CAAGGCTTTC GATTGGTACG TACAGACTCC AATGGTCATG TTCATCTTAA ATATGCTCTG GTGGTTCGCA TTGGGTGCGT CATTCAATCG ATTGATCAAG TATGCCGGAT CAAAATCAGC GGGAATCTTG TCCATTCGGT ACACGATGAA CTGTCGATTT AATCAGAAAG CCATGACTGC GTTCTTGCAT GTGGCGAACC CAGAAATGGA GGATGGTGAA GCTGATGCGA GGACAAATCT GAAAAAATTC ACTTGGGACG AGACAGATGA GATCCGATGG AAAGGATGCC CTCCAAAGAT TGAAATGATC GTCGACATGA AGAATGGTTT CTTGCTCAGC GTTTTCATTC AAATCGCCAC CAGGCGAAGT AAATGTACGC AGTCTGACGC AAAGAGACAT TTTTTTGCTC GACTCCGAGA ATTAGGTCTC ATTTCTGGTC CCGTGCCTGG GTTAGAGACG GCAAAGGATG CCGAATACGT CTACCGTAAG CCATTTCTCT CACGTGGAGC GAAATTCAAA CTATGGTTGA AGAAGACACG TGAGAATGTT CACTACTTCT TCACGTTCTA G
|
Protein sequence | MARRSEDPAW VPPPRIVMVK AFPEEDVVLV KPELFAECSR RRYLALSDGV KSRFAQAHAS SRTKYRKLWR RLLVHMQTSL AMAKALTKLE LLRTNTARGH RLDVAGLALI AMYDNYTKKI DQMLSKQMQE SLQKAMDSKM EKDKVIAQSL LELETNDEKS GEDFNPMAIM RALRKHSAQK PTKKQFQGSE EEIGVEHTRL LFLVSVYTAD ANRDAFPGLA EDSELWVRKT QLLVLIYECI RAGALNYDYA PLAETMGSKR VWLNISQEGV DDLDDMCQVG FLSSMKMSST KYSTSTAYRL TKEGYLHLKT HLRRRDRAAI EEVVYSDKLQ PSPRNLFVAK WDAKADTFYL QSASGHTKSS DVTDIEEVSY VSSPFVPKSM RKWGRECTSN KHKTAALVKA TGTIRDELDE QLSFDRLRLM VGEWIPMGAN QVLSLNDKLG STDRVAGGYF TSEMDKDPNN PCFQGKVDGL TRVNVLDFEE TSYVNFEAEV QYEEEPGIVQ IENFGIHVSE EGFMLYGLTL DGMMKVTDGN NFSLDHLARL LRDIGTDSSE VIGNLLTDHQ RHLLDLVHMG DAMNREKFNV FFTSRINKRG QEEMPMAHEL LDMEDMENEI RQIIGEVECG FQLSRDDELI IIGSTGMILC SKNTEKFEPL VLQYMSMMSR NMFIQALYRQ TFVTVDTLGE IDHLIRNHDA DPNNIFKIRE LMSNVSADII LMREIHSYLL ESLTETAPMA INDQVLKRLS KILQLDDTNF RLERRIRDIR KSLDGASGEL QALKSAADVI QENKEFKVNE AVSNNTQNLE EVFRANERAS TSLEIMQVVL AGSLAFAILD RLHGLYLGVA ADIDWSVKAF DWYVQTPMVM FILNMLWWFA LGASFNRLIK YAGSKSAGIL SIRYTMNCRF NQKAMTAFLH VANPEMEDGE ADARTNLKKF TWDETDEIRW KGCPPKIEMI VDMKNGFLLS VFIQIATRRS KCTQSDAKRH FFARLRELGL ISGPVPGLET AKDAEYVYRK PFLSRGAKFK LWLKKTRENV HYFFTF
|
| |