Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25878 |
Symbol | |
ID | 5006790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | + |
Start bp | 389014 |
End bp | 392128 |
Gene Length | 3115 bp |
Protein Length | 1005 aa |
Translation table | |
GC content | 56% |
IMG OID | 640422211 |
Product | predicted protein |
Protein accession | XP_001422576 |
Protein GI | 145356724 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.266953 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GACGCGCGCG CGACGAGCGA CGCGCCGCGC GACGAGCGAC GACGCGCCGG TCGAGCGTCG AGCTCGACGC GCGCCCGGAT GAGCGACAGC GAGAGCGATT ACAGCGACGA CGACGCGCAC ATCGGCGGCG GCGATCGGGA TGCGGAGGAA ACGTACGACG GCGCGGGAGG CGAGGCGACG CTGAACGTGG ACGTCGGATC GTCGCTCACG AGCGCGGATT GGGCGGACAT CGATAACTCG GACCTCGTGG CCAAGGTGCG CGGGAGCGCG GGGTTTCGAA TCATGCGAGA GATTCAGCCG TGGGACGGGG CGCTGGACGC GCCGCCGGAT TTGATGAAAC GACACGTGTT TGAATACTTT TACACGCAGC GCAAGGACAG TGACTTTTAT AAAAAGTTGG CCAAGGATTG CTTCCCGAGC GAGTACGAAA GAGCGGGGGC GGATAAATCG GAGAAGAAGC GGTTGGCGAA CCTCTGGGCG GGGGCGGTTT TGGACGAGTT GAAGGCGATG CGAGACGGGG CGAAGCACAC CTTGCGATGG ATCAACGCCA TCAAGGAAGC GCAGGCGGGT GATTTGTTAT TTTTAAAGCG TGAGGCTCTG ACTCACGGGC CCGGGAGCGT GCAGGAGATT GCCAGTCAAA TTGTTTGGGA CAACGTGAGT AATCGCATCA AGGATCAAAC CTTTGCGGCG ATGCACGCGT GGACGAACAA ATCGGGTGCT CAATCCGGTG GCGTCGCGCG CATGTACAAG CATCAGCACG AACTCATCGA TCAACTGGAC AAGCACTATG CCGCCATCGG GAAAATGCAA GAGGCGAAGA AGCCGTTACC CAAGCCGTTG CACGTCATCT TATCGACGCC GACGGGGTCG GGCAAGACGT TCACCGCCAT TCTCGTTCAC CTGCGTCTTT TGAAGGTCAA GTACCCGGAC GCTATCTTGC TCTACTCAGT GCCGACGAAA CAAGTTTTGA AGCGCGTTGG ACAAGAGTGT GAAGCGCACG CGGTGCCGTA CTGGACGGCG GCGCGCGACA ACACGGGCGA TGGAACGCTT CACCAAGTGC GTCGTCCGTA TTCCATTCGT GATAAGGCTC GAAACAAGAA AGCACTTCGT GCGGCGCGTG CTATCGGTGA AAAGGTCAGC GCTGGTTCCG GTACGATTCA ACAACAGCTC GAATACGCCG CAGATGTGGG TTACAAGCTC AAGGATCACG GCGCTGGTAA ACCGGATATC ATCATCGCGG ATCTCAATAC CACGGCGGCG TTGCTCAAAG CCTGCAAGGA AGAAAGCTCC AGCTCGTTCT ACCACGAGTC TAAGATTCTT TTGTACTTTG ACGAGCCGAA TATGGGTATT CACCTCGATC CGAATGTTTT GGGTGTGGTT TCGTCTATCC AGGCCAACAT GCCGCTCACC GCCGTACTTG CTTCGGCGAC CCTAGGCGCG TGGGAAGGTT TGGAGCCTTG GTGGCGCGGC CCGACCGATG CGAACCAAAT TACTATTAGT TTGGAGCCGT ACGAGTTGCC AATGGCGAAG CTCGCCGTTT TCAACGAAGG CACGAGCGAG TTTTCTCCCA TGAGTCCGCT CAACTTGATT GAAAATTATG CCGAATACCA ACGAGTGATG GAGGATTACC GTTTGCCCAC GCTTCTGTTG CGTCACTTGA CAGCTCGACA TGGTAATGAT TTATTGCAGA TTCAACCGCC CGGTGGACCT TGGGATAAGG TGCAGGGTGA CGTGAAATCG TTGCGTTTGG CAATCGAACC GACGTTTACG AGCCTGTCGC AAAAAGAATT TGAACGACTG CAAGGTCGAT GGAAGATGGG CGAGGACGCT CCTACCAAGG TCGATGGCAT CCGTGGTGCA CTCTCAAAAG AGGGCGTCAC TATGGTCGGT TGCTTGGATC CGCGCAAGGT GGCGTTCGAG CTCGCCGGCT TTGCTGACCA GGAGGCTTGG ATCCAAGACG TTCATAAATT GAACAACAAG CTCAAGGAAG CGGAAAGAAT GGTAAAAGAG AACGCCAAGG CTGAAAAACG CAAAAAGAAG GATGACGAAG ACGAGGCTAA GGATGGTGGC GACGAAGGCG GCGTCGGAGT CGTGACTCTT CGACCGATGT TAAAGATTAG TCTCGCCGAA GCTCTCGAGG CTGACATCAA CACCTTGGTC ATGCTTTCTA AGGGTATCGC TTATGCGTGC GGGTCAGGCA CAGAGCCTAT GGTGAAACGT CTTTATAACC AAGCGTTGCT CACCGTTCCC GATTCTCTTC GAGGACGATC TCCGCCGCTC AACGTGCTGG TTGTCGACTA CTCGTCCATT TACGGAACGG ATTGTCCCGC GGTTGATACT TTGTTGTTGC AAGAGGATTT GGGTCGACTC TTAGCTTGGG AAGATCTTCA GCAGTTCCTT GGTCGTCTTC GACGCGATGG AACAGCCGTT TTCTACTCCA AGAAAACTCT TCGCCGGGCC GCTCTCGGCG CCGCGGTCGA AGAGGAAGAG ACGACGGCGC TTATTGAATT CCAAAAGCTT GTTGAAAACT CTGTGCTGGA TCTCGAAAAG GCTCAAAAAC GCGACACCGA CAGCGTCACC GCCCTCGTGA CCAAGCTCTC TGCGTCTTCT GGTCGCAGTG CGGGAGAAGT CGCTTCGTAC GTGTTGGCGT CTGTTATATC GTTTGCGCTA TCTGCTCCGT CTCATCTCGA TGGTGCGGGC GTGTACCCGG CGACCATCCC GGAGAGCGAC AAAGAGCTCT TAGCCGCGAT CACAAAGCGC GTGGAGGGTT ACTCGGACGC ATTGGAAAAT GTGTTGAAAA AGATGTCGGA GCAAGTTCGT GCCGTGAGCG CGATCGAAGC CCTCTCGCTC TCTGCGAACC CGTTCGCCAG CCGTACCGGC GGCGCTCGCG TGCTCGGCAT TGCTGCGCAA GTGTTGAAAA TGCTGTACGA TGCTGATATC CTCTCCGAAG AAGCACTCTT TGCTTGGGCT AATGCGAAGC GCAAAGAACT TCTCGCTGAG TCAGACGGTG ACGCTCGCTT CTTTGGCAAA GCAAAGCCAT TCATTCAGTG GTTATCCGAA GCAAGCGATG AAGACAGCGA TGAAGAAGAA GAATAGGTCA CACACACATC ACATT
|
Protein sequence | MSDSESDYSD DDAHIGGGDR DAEETYDGAG GEATLNVDVG SSLTSADWAD IDNSDLVAKV RGSAGFRIMR EIQPWDGALD APPDLMKRHV FEYFYTQRKD SDFYKKLAKD CFPSEYERAG ADKSEKKRLA NLWAGAVLDE LKAMRDGAKH TLRWINAIKE AQAGDLLFLK REALTHGPGS VQEIASQIVW DNVSNRIKDQ TFAAMHAWTN KSGAQSGGVA RMYKHQHELI DQLDKHYAAI GKMQEAKKPL PKPLHVILST PTGSGKTFTA ILVHLRLLKV KYPDAILLYS VPTKQVLKRV GQECEAHAVP YWTAARDNTG DGTLHQVRRP YSIRDKARNK KALRAARAIG EKVSAGSGTI QQQLEYAADV GYKLKDHGAG KPDIIIADLN TTAALLKACK EESSSSFYHE SKILLYFDEP NMGIHLDPNV LGVVSSIQAN MPLTAVLASA TLGAWEGLEP WWRGPTDANQ ITISLEPYEL PMAKLAVFNE GTSEFSPMSP LNLIENYAEY QRVMEDYRLP TLLLRHLTAR HGNDLLQIQP PGGPWDKVQG DVKSLRLAIE PTFTSLSQKE FERLQGRWKM GEDAPTKVDG IRGALSKEGV TMVGCLDPRK VAFELAGFAD QEAWIQDVHK LNNKLKEAER MVKENAKAEK RKKKDDEDEA KDGGDEGGVG VVTLRPMLKI SLAEALEADI NTLVMLSKGI AYACGSGTEP MVKRLYNQAL LTVPDSLRGR SPPLNVLVVD YSSIYGTDCP AVDTLLLQED LGRLLAWEDL QQFLGRLRRD GTAVFYSKKT LRRAALGAAV EEEETTALIE FQKLVENSVL DLEKAQKRDT DSVTALVTKL SASSGRSAGE VASYVLASVI SFALSAPSHL DGAGVYPATI PESDKELLAA ITKRVEGYSD ALENVLKKMS EQVRAVSAIE ALSLSANPFA SRTGGARVLG IAAQVLKMLY DADILSEEAL FAWANAKRKE LLAESDGDAR FFGKAKPFIQ WLSEASDEDS DEEEE
|
| |