Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_24368 |
Symbol | |
ID | 5001608 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | + |
Start bp | 201149 |
End bp | 204245 |
Gene Length | 3097 bp |
Protein Length | 1025 aa |
Translation table | |
GC content | 64% |
IMG OID | 640417029 |
Product | predicted protein |
Protein accession | XP_001417174 |
Protein GI | 145345344 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0213658 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCCGT CGTCGAACGA CGGTCGCGCG AGCGACGAAA CGCGCGAAGC GCCGTCGTCG AACGCGAAAA CTATGTGGCG CAAGGTGGCG ACGCCGGTGC GGAGCGCGAC GAAGATAGCG TCGGCGCTGG TGGGTGGATT CTTTTGGCAG GGCGAGGACG AGGGCGACGG CTTCGAGGGC GGGGCGAAGG AGAATCGAAC GATGGACGTC GGACGAGCGA TGGACGAGGA ACAGAGCGTG AGAACGCGTG GAAACGGTGC GGGAGGAGCG AGAGCGGGTA CGACGACGAC GACGGGGCGG ATCGGGGAGG TCGGCGGCGC GACGAGGACG ACGCGGTTCG CGAACGAGGC GGCGGCGGCG GAGACGGCGA GCGATTCGCC GACGCCGTTC ACGGAGATCA GGGAGTTCTT GCGCCGAGGA CAGATGGATA GAGAGCGAAT GGCGCGATCG AGCGCTGGCG GGGCGTATGG CGAGATTAGT GCGGCGGTGA AGCCGCCGTC GCCGTTTGAG CGCGCGCCGG CGTTTGACGC ACACGCGACT CCGGCTCGGG AGCGTCCGAT GGTGGCGCAG ATGGCCACGC CCGGAACGAC GGGTGTGGGC GCGAGTTCTC CGGGATACGC GTCCGTGCGT CGATCGTACG GCGCGTCGGG GACGCCATCG CTTCTCGGTA CGCGCGGTCG CACGGCGATA AAACCGACGA TTCTGGAACC GCCAATGGAG CAGACGCCAA CGCCGCCTCG CGCGAGCTTG AAGCGTGATC GTGAGCTGAC TATCATGGAT CTTGAGCGTG AGCGAGCGCG CGTGACGACG CCTCGACGAT TGGAGTCCGC CGCGCACAGC CCTGGTCCGG GAGCGCTGGC GCGAGAGTCT GCGAGTCCGG CGTACCACAC TCCCTCCGCC AAGCCAGTGA CGAGCGCGGT GACGACGGAT ACAGCGAAAC GTATATTGCA AACTTTGGAT CGCTTGGCAG GTGCGCGAAC GGCGACACCT ATGGGTGTCG CAGCGGCGAG ACCTTCGTTG TTTGCGACTA GACCCACGCA ACCTGTGCAA GCGCCGTCGC CCGTCGGCGC CGCGTTTGAC AAGCAAGTCA CGCCGACATC GGCTACCTTT GTTGCGACTC CCAAATCGGG CGTTTCTTTT GCGACTTCGC CGATGCCCTT CATCAGTGCA TCCTCGTCTG GTCGCGCGCC GACGACGCCG TACCCGACGA GTGCCGAGCC TGAGATTGAG ACGCCAAACT TCAAGTTCGG CGAAGATAGC ACCTTGCTCG CCACCGGTAA GAAACCAAAA GCGCCGTCCG GGCCTGTGGC TTCTGTTCCG ATAAAGTTCA CGTTCGGAGA AGGTCAAACA ACTCCGTTGT TTACGAAGAA ATCCGTCGCC GCGGCGCCAG TGAAAACAAC GACGACGGTG CCACCGCCGC CGGCGCAACC GACGGCGGTC ACGCAAAAGT CGGAGGAAGC GCCCGCTGCG AGTAAACCTG TCATGAATCT CTGGAGTGCG GATTTCTTGG CGAAGAATCA AGAGCATCAA AAGAAAGTGC AAGCGGCGAT CGAAGAAGAA GAGAAAGCTG CGAGCAAGCC GCCTGCACCG TCGCCGTTTG CGCCGTCTGC CCCTGCCGGG GACAAGCCGG CGTTCTCGTT CGGTATTCAC GCTGCGAGCA GCGCGCCCGC CCCTAGCACT GATGCGTCCG GGTTTACTTT CGGTTCCACC GCGCCCGCGG CGCCGACCTC GGCGCCGGCC TCGGCGCCAG CGTTTTCTTT TGGATCCACG GCACCGGTGA AACCAGCGGA AACCACCGAA GTGAAGAAAC CGGCGGGAGA GTCTGGTTTA CTCGCGTTTT TGGGCGCGTC TTCGAGCGAA GCTAAAACAG CAAGTGAACC TGCGACTGCG CCCACGTTCA CATTTGGCGC GCCACCCAAG TCGCCTGATA ATGTGCCCGT GAAAGTACCC GATCCTGCGC CCGCGTTTTC CTTCGGCGGC GAAACCAAAG CGCCAGAAAG TTCGAAACCG GCGAGCGCAC CGTTTACGTT CGGCACGGCG ACACCCGCCG CAGCCGCGGC GCCGGCGCTG TTTACGTTCG GGGCGAAACC GGTAGAGAAG GCGGCCGAAC CAGAGAAGGA AAAATCACAG CTTTCGGCGT CCGCGCCCGC TTTTTCGTTT GCCGCGGGGT CGAAGAAGGA AGATGAACCG AAAGCGGCGG AGACTGCCAA GCCGTTCACC TTTGGCGCCG CACCGGCGCT GGACGCAACG AAGCCCGCGG GGGACGCGTT ATTTAGCTTT GGGGGTGCTT CGGCGGAGAA GAAAGAAGAA CAGCCGAAAC CAGCGAGCGG CGGATTTAGC TTTGGGGCCA CGGCGGCGCC CGCGTTCGGA GCTGCATCAT CGGCGGTAAA GGAGGACGCC AAGCCCGTGA GCGCGCCGTT TGTGTTTGGC GGTGCGTCAA CCACGCCCGC TTCGAGCGGC GGATTCACCT TTGGTGCCTC TGCGCCGCCA GCCGCACCGG CGAGCGGTGG TGGCTTCACG TTTGGTGCCG CTTCGAGCAC TACCGCGGCT TCGACACCAT TCGGCGGCGG TTTAAGCGTG AACACTGGTA AACCCGCGGC GAGCGTCTTC GGTCAACCGC CGAAACCGGA CGAGCAACCA AATACCCCTG AACCGATGAG TCCGTTCGAA TCTAGCAAGT CGCCTCTAGG TGCCGCATCA ACTGGTAGTT CCCTGTTCGG TGGTGCGTCC ACGAATAGTT TTGCCGCCCC TCCGGCGAGC AGCGCGCCGT CGACGAATCC GTTTGGCGGC GGCGGCGCGA ATCCGTTCGC CGGCGCGTCG GCAGGCGTGA CGAACCCATT CGGTGGTGCA AGCTCAACGC CCGCGTTCGG CGGCGCTGCT GCCCCATTCG GTGGTGCGTC GGCGAGTGCC AACCCTTTCG GCGGCGTTGC CGCGCCCGCA CCCGCGGTTC CAGCGCCGAA TCCGTTCGGC GCCAACCCCG CGCCCGCTGG CGGCTTCTCC CTGGGCGCCG GCGACGAGTC CTCGCAGGGC GGAAGAAAGC CACCGCGCAA GTTCAAGCGC CCGCCGCCGC GCCGATAATC CCAACCAGCA AAATCAC
|
Protein sequence | MTPSSNDGRA SDETREAPSS NAKTMWRKVA TPVRSATKIA SALVGGFFWQ GEDEGDGFEG GAKENRTMDV GRAMDEEQSV RTRGNGAGGA RAGTTTTTGR IGEVGGATRT TRFANEAAAA ETASDSPTPF TEIREFLRRG QMDRERMARS SAGGAYGEIS AAVKPPSPFE RAPAFDAHAT PARERPMVAQ MATPGTTGVG ASSPGYASVR RSYGASGTPS LLGTRGRTAI KPTILEPPME QTPTPPRASL KRDRELTIMD LERERARVTT PRRLESAAHS PGPGALARES ASPAYHTPSA KPVTSAVTTD TAKRILQTLD RLAGARTATP MGVAAARPSL FATRPTQPVQ APSPVGAAFD KQVTPTSATF VATPKSGVSF ATSPMPFISA SSSGRAPTTP YPTSAEPEIE TPNFKFGEDS TLLATGKKPK APSGPVASVP IKFTFGEGQT TPLFTKKSVA AAPVKTTTTV PPPPAQPTAV TQKSEEAPAA SKPVMNLWSA DFLAKNQEHQ KKVQAAIEEE EKAASKPPAP SPFAPSAPAG DKPAFSFGIH AASSAPAPST DASGFTFGST APAAPTSAPA SAPAFSFGST APVKPAETTE VKKPAGESGL LAFLGASSSE AKTASEPATA PTFTFGAPPK SPDNVPVKVP DPAPAFSFGG ETKAPESSKP ASAPFTFGTA TPAAAAAPAL FTFGAKPVEK AAEPEKEKSQ LSASAPAFSF AAGSKKEDEP KAAETAKPFT FGAAPALDAT KPAGDALFSF GGASAEKKEE QPKPASGGFS FGATAAPAFG AASSAVKEDA KPVSAPFVFG GASTTPASSG GFTFGASAPP AAPASGGGFT FGAASSTTAA STPFGGGLSV NTGKPAASVF GQPPKPDEQP NTPEPMSPFE SSKSPLGAAS TGSSLFGGAS TNSFAAPPAS SAPSTNPFGG GGANPFAGAS AGVTNPFGGA SSTPAFGGAA APFGGASASA NPFGGVAAPA PAVPAPNPFG ANPAPAGGFS LGAGDESSQG GRKPPRKFKR PPPRR
|
| |