Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25714 |
Symbol | |
ID | 5006146 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009371 |
Strand | - |
Start bp | 58241 |
End bp | 60207 |
Gene Length | 1967 bp |
Protein Length | 646 aa |
Translation table | |
GC content | 63% |
IMG OID | 640421567 |
Product | predicted protein |
Protein accession | XP_001422191 |
Protein GI | 145355914 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.0474103 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.760083 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAGCG CGGCGATGGC GTCGCTGGGG ATGATGGCGA TGATGGCGGC GATGGGGGGG CGGGAGACGA CGGCGAACGC GAACGCGAAC GCGAACGCGG CGCTGGGGAA GCGAAGCGTG CGACGGGTCG AGGGAGCGGC GTACGAGGGA ACGGTGTTGG ATTTGGATGA GAAAACTCGC GATGATCTGA CGCGGCGGTA CCGAGCGAAG GAGACGGTGG CGCGGTTGGG CGGGGCGAGA CGCGACGGGG CGCTGACGGA GCTCGGGGCG TCGAATCCGC TGTTGGAGGG CGAAGCGTGG AACACGGCGA GCGAATCCGA GCGATCGAGG AGGGTGTCGA GAACGCACGA CGTGGACAGC GGTGCGCTCG GCGACGTGCC GCAGTTTTTC AAGGATGAGG CGGAGAATTT TAAGTCTTCG TCGATCGGTT TCTCGAGCGG CGCGGCGGCG AGCGGCGCGG CGGCGAGCGG CGCGGCGGCC GTCGGCTCAG GCATCGTCTC GACGCCCATC GCGGGTTCAG GCGTCGACGG TGCCGCGGCG TTGATCGCCG AAGGCGCGAC GTGCGACGCG GCCATGTGTC CAGATAAGTT CCAGCACGTC GATCCGGCGT GCGCGAACGG GGGCATCGGT TGCGTGGGCG ACAGCGGGTG CCGGTTCTGC CGCGTGCTCG GGATGGAAAA ACAAGGCGGT ATGAACGATA TTCCGGACTC TATGGGTCTT TGTGACCCGT GCGTGTGCGA GCATTATGGT TTCAGCACTG GATGCGCGGG GATGCCGACG CCGGAAGGTT TGGAATCCAC GCCGACGACG GCGCCGGTGC AAGCCACGCC AACGACGGCG CCGGTGCAAG CCACGCCGAC GCTCGAGCAA TCCTCAACCG TGGTGCAAGC GGCGCCGGTG CAAGCCGCGC AAGCGTATGA TGAATTCTCA GACATACCGC AAGCGGCGCC TGCGCAACAA TTCTCGAGCG AGCCGCAGGT GGCGGAGGTG ACAGACGAAG GACTGGCGTC GAGCTTCTTG ACGGCGTCGT CCACTGCTTC CGCCGCTTTA GGTCTCACGT GCAACTATAA CACGTGCGAT AGCAAGATTT TGTACTTGAT GTACGATAAG CGATGCCTGA CCGAGGGCGG CCAAGGTTGC TACGCTGACA GCGCTTGTCG ATTTTGCAAG ACGGATCACC AAGATCCATC CATCAGCTAC GAAGACGGAT GGGAAACGTG CACTCAGTGC GTGTGCGACC ATTACGGTGT CACCGGGTGC GAACAAGGCG GTTTACCCGA AGTTCCGATG AACCCGGTAG AAGCGTCGTC CACGGCACCG GTGATGCCGC CCGTGGAAAC GCAGCCGACG TTGGAGTCCA CGGCGCCGGT GGTGCAGGCG ACGGACGAAT ACCAAGATCG CGAATACGAC AACGATCATT ACGGCGAGCA AGTGGAGCAA CCGGCGCAAG TGGAGCAACC GGTGTTGATG GAACAGCCGG TGCTAGTGGA ACAACCGCCG CAACCGCCGA AGGCGAACGT CGATCCTAAC CGGTTTGCCG CCGCCGGCAT GGGCGCCATG GAAGAATCTG GCCCGGTCGA GGCGATCCCG GCCGATCAAC GCCTCACGGC GAACGGTATC GAGTGGACTC AAACCAAGTT TGGATCGTTG CAAACGTGCG AATCCCGATG CGCCGAGGTT CAGAAGACGT GCGCCGAACA CGTGTGGCCG AGCACTATGA ATGATTTCCG CGACGTCATC TCCAGGACGA CGAGCGCAGA CGGCTCTGGA AACATCGTGG CGTGCGACGA AATCTTGCTC AACGACGAGA CACAGCACTG CGGTGGCGTT TCCGCGCTTT CTGGACGGTG CTTCTTGAGC CCGTCTCACG GCCATCTCCC GGCGTCGTGC ACGTACGCCG CGGCGCACGC CGACTGCACA AACATTTGTC CGTGCGTCTG ACGCGTTTAG CGCGCCCCGA ACCGAGC
|
Protein sequence | MASAAMASLG MMAMMAAMGG RETTANANAN ANAALGKRSV RRVEGAAYEG TVLDLDEKTR DDLTRRYRAK ETVARLGGAR RDGALTELGA SNPLLEGEAW NTASESERSR RVSRTHDVDS GALGDVPQFF KDEAENFKSS SIGFSSGAAA SGAAASGAAA VGSGIVSTPI AGSGVDGAAA LIAEGATCDA AMCPDKFQHV DPACANGGIG CVGDSGCRFC RVLGMEKQGG MNDIPDSMGL CDPCVCEHYG FSTGCAGMPT PEGLESTPTT APVQATPTTA PVQATPTLEQ SSTVVQAAPV QAAQAYDEFS DIPQAAPAQQ FSSEPQVAEV TDEGLASSFL TASSTASAAL GLTCNYNTCD SKILYLMYDK RCLTEGGQGC YADSACRFCK TDHQDPSISY EDGWETCTQC VCDHYGVTGC EQGGLPEVPM NPVEASSTAP VMPPVETQPT LESTAPVVQA TDEYQDREYD NDHYGEQVEQ PAQVEQPVLM EQPVLVEQPP QPPKANVDPN RFAAAGMGAM EESGPVEAIP ADQRLTANGI EWTQTKFGSL QTCESRCAEV QKTCAEHVWP STMNDFRDVI SRTTSADGSG NIVACDEILL NDETQHCGGV SALSGRCFLS PSHGHLPASC TYAAAHADCT NICPCV
|
| |