Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_18189 |
Symbol | |
ID | 5005420 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | - |
Start bp | 601765 |
End bp | 605049 |
Gene Length | 3285 bp |
Protein Length | 1094 aa |
Translation table | |
GC content | 63% |
IMG OID | 640420841 |
Product | predicted protein |
Protein accession | XP_001421514 |
Protein GI | 145354485 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000423092 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0423026 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGCGA GCGAGCGCGA GGAGTGTCGC GCGTTCGCGC TGCGATGGAC GCTGGACGCG GAGGGCGCGG CGAGGGCGGT GAGGAATCAG ATGACGGCGT GCTGCGCGAC GCTGGTGAAG CGCGCGGCGG TGGACGCGGA CGACGCGACG AAGATGGCGA CGCTGGGGGC GTGCGAACGC GAGGTGCGGG CGAAGATGGA ATCGTCGAAG GGGGAATCGG ACGCGGAGAC GATGGGACTG GAGGTGTTCG CGGCGATCGT GAGCGAGTTC GCGCCGGGGA CGGCGAGCGA ACTGGGGACG ACGTGGGAGC GGCACGAAGG ATGTCGAGCG AGCGCGGAGA AACATTTTTT GAAACCGTTC TTCGCGCACG GGTGCGAAGC GGCGAGACGG TGCGTGGAGA CGGGGAGGGT GAGCGATGGG AGCGATCGGG GGGCGTGCGC GGCGGCGCTG CGGTTGATGA ACGCGGTGTT GAGTTGGGAT TTTAATAGAG ACGTGAGTTA CGGGTTTCGG GGACGGGCGT TTCCGAGCTC GGAGAGCGCG GCGAACGCGT TCGTGAAGCT CACGCCGGGA ATGGAGTGGA GGGATGTGTT GTTGAATCCA GGCGCGCTGG ATTGGTTGTT CGACTTGCAC GCTGGGGCGG AGAGTGCGGT GTTGGCGGGC GGAGGCGTCG AGGCGAAGCG CGTCGCCGCG GCGAGTCGCA AGACGCTGAG CGCGCTCTGC ACGCTCAGTG GATGCGTGTT TCCCCCGCGG GATGCGGACG ATAGCCTGAG ACAGGGCCAT TTCGTGCGAT GCGCGCGCGC GATCGCCAAG TATCTCTTAC CGGCTGCGAC GAGCGTGCGC GCGGCGTTGG AGGGGCACGG AGAAGACGCC CTCATCGACG GCTGCCGCTC CATGTCCGCG TTGGCGCTCG TGCACGACGC CAACGATTTC GCGAGTTTAT CGCTCGGTCC GGAGTTGAAC GAGCGTACCG CGCTAGACTT ACTCGGCGAG CTCACGTTAG AGTGCTTGAA TCAAGACGCG CTGTCGGTGC AGTGCGAGGG CACGGTGACG GATGATTGTT TAAAGATGCT CCTCGAGGCG TGGGCGTCGT TGGTGAACAA GGGCATGAGC GCGCCGGGAG GCGTGGAAAC GGCGGTTCCG AGCGCGGTTT TAGAAGGAGC GGCAAATATT TCGCACGCTT ACGTCGTCGC TGGGTTGAAA GCCGCGCGCG AGGGCGCGCA CGAAGAGGAC GATGGGCACG AAGAGGAGGG CCAAGCCGGG GCGGCGGCGC TCGACGCGAG ACTCGAACTC GCCGCCCAAG TCTTGCGCGC GCATCCCACG ACGACTTTAC CGACGCTGCA GCACGTTTTA GTTGAAAAGC GAAACTCGCT TCCCGCGTGC ATGGCGAGCG GTCAAGATCC GAGTGAGTTG TTGGAGGAAT TATGGTGGCT GACGCGCCTG GCTGCGCACG TGCTCGCGGA TGACGGCGAC GGCGAAACGC CGATTCCACC AGACTCCCTC GCTGCGGCGT CCGCAGCCAC GGCACCAGGC GCACCAAACT GCGTAGTGGA ACTCGCGCGA GCGTTGATCG ATTTCGGTTG CTTGGCGCTC GACGCCAACG CGCGCGGCGC GCTGTCGCCT CGCCTGCTCG AAACCGTCGT GTGGGCGCTC GCGCGTTGGG CGGATACGTA TTTGATTCCA GAGGATTCCG GCGGTAGCTT ACACGCTGCG GTGTACGCCG CCGCTGGAGG CGTGCGACGA GGTGCTGACA TCGCGAACAA GCTCGCTGAG AACGGTGGGG GAATGTTCAG CGAAAGAAAC GGTGGCGTGG AGGCGCTCGA CGCGCTCGTG CAAATCGGCG TCAAGGCGCT CAGCGATTGG CCGGGGGAGA CGAGTTTACA AAAAACAATC GGATTCGTGT TGTTTCCCGT GCTCACGCGA CGAAAGACGC TGTTGAAGCA CTTAGTAAAC ATGCCTTCGT GGGATGCGTT GCGTCAGGCG TGCGCTGGGG CGCATCATGA GCGCGGCGTC GTCGCGTTCC CGCCCGAAGT CCATCGCGGT TTGAGCGAGT GCGTCGGGCG CGTCGCCGCG AGCGTGATTG ATCCCGCGCA GTGCGAGGCG TATGTGAACG CCCTCATCAC GCCCCCGGGC GAAGTCATCG CCGCGGTAAG CGTCGATCGT GAGGGTTTAC ATCACCCCGA AGGCGAGGCT CGAGCGTGCG CCGCGCTCGA GGCGTTGCGA GGCGTCGTGC GATCGACGAA TGGAAAGAGC CAACCGGCGG TTTTCAACTT TTTCGTCGCA GCCGTCGATC ACTTGCTAAA TTTGCAAACG CTCGCGAAAG ATTTAGGACG AGTGATGAAG CTATTGCTGC GCCTGACCGA GGAGTTCGTC GAGGCCAACT CGCCGTATCT CAACGCCCAA CAAGTGGATT GGGTGTGTCG GTATTGCCTG CGCGTGGTGG AGACGTACGC GAAATCCGGC CGCGGCGCCG TCAAGTCGGA AGCCGGCGCG CTATTGAGTC AAGAGGCGGT AAAAGAGGCG TATAAAGAAG TTCGCGCGCT ATTGCGCATG CTCACGCACA TGTCGAGCGG AAACTTACAC GACGCCATCA TCGAGAGCGC GCCGCCCGAC CAGGCGGCGG CGCTCGCGGA ACAAATCGAC ATCGCTCGCG TCGTCTTCGC CGGTTTGAAC GCCGTCATCC CGCTCATCAC GGATGAATTG CTCAAGTTCC CCAAGCTTTG CAGACAATAT TTCGAGCTTT TGGCGTACAT GCTCGAGGCC TACCCCAAAA AAGTCGCGCA GCTAGCGCCC GACTTATTCG GCACCCTCAT GTCGACGCTC GAATTCGGCT TGAAGCACGC CGACGAGACG GTGAGTAAGG AGAGCATGAC TGCGCTCGGC GCCCTGGCCA CGTTCCAATG CAACAGTGCG AAGACACAAA CCATCGGTTT AGGCGCACAC ATGGCCCCTA ACGCCGAGGG CGTGTCCATC CTCGCCCATC TCATGCGCCT CTTGTTCCAC CGTCTCGTCT ACGAGGAAGC CGTCTTCAAT CTCGTCGACG AAGCCGCCGA CGCACTCCTC CCCATCATCC TCCACGAGCG CCCGGCGTTC CAAAATCTCG CCTCTGCCTT CATCTCCGCC GTCGCCGACG AGCCACGAAG CGTCGATTTA CTCCAAAACG CCTTCGTCGC CCTCACGAGC GCCAACGGCC TCGCCGAGGG CGTCGACCGC GTCAACAAGC GTCGCTTCCG TCGCAACCTC GCCGATTTCC TCACCGTCGC TCGCGGCGTC TTGCGCACGC GTTAG
|
Protein sequence | MRASEREECR AFALRWTLDA EGAARAVRNQ MTACCATLVK RAAVDADDAT KMATLGACER EVRAKMESSK GESDAETMGL EVFAAIVSEF APGTASELGT TWERHEGCRA SAEKHFLKPF FAHGCEAARR CVETGRVSDG SDRGACAAAL RLMNAVLSWD FNRDVSYGFR GRAFPSSESA ANAFVKLTPG MEWRDVLLNP GALDWLFDLH AGAESAVLAG GGVEAKRVAA ASRKTLSALC TLSGCVFPPR DADDSLRQGH FVRCARAIAK YLLPAATSVR AALEGHGEDA LIDGCRSMSA LALVHDANDF ASLSLGPELN ERTALDLLGE LTLECLNQDA LSVQCEGTVT DDCLKMLLEA WASLVNKGMS APGGVETAVP SAVLEGAANI SHAYVVAGLK AAREGAHEED DGHEEEGQAG AAALDARLEL AAQVLRAHPT TTLPTLQHVL VEKRNSLPAC MASGQDPSEL LEELWWLTRL AAHVLADDGD GETPIPPDSL AAASAATAPG APNCVVELAR ALIDFGCLAL DANARGALSP RLLETVVWAL ARWADTYLIP EDSGGSLHAA VYAAAGGVRR GADIANKLAE NGGGMFSERN GGVEALDALV QIGVKALSDW PGETSLQKTI GFVLFPVLTR RKTLLKHLVN MPSWDALRQA CAGAHHERGV VAFPPEVHRG LSECVGRVAA SVIDPAQCEA YVNALITPPG EVIAAVSVDR EGLHHPEGEA RACAALEALR GVVRSTNGKS QPAVFNFFVA AVDHLLNLQT LAKDLGRVMK LLLRLTEEFV EANSPYLNAQ QVDWVCRYCL RVVETYAKSG RGAVKSEAGA LLSQEAVKEA YKEVRALLRM LTHMSSGNLH DAIIESAPPD QAAALAEQID IARVVFAGLN AVIPLITDEL LKFPKLCRQY FELLAYMLEA YPKKVAQLAP DLFGTLMSTL EFGLKHADET VSKESMTALG ALATFQCNSA KTQTIGLGAH MAPNAEGVSI LAHLMRLLFH RLVYEEAVFN LVDEAADALL PIILHERPAF QNLASAFISA VADEPRSVDL LQNAFVALTS ANGLAEGVDR VNKRRFRRNL ADFLTVARGV LRTR
|
| |