Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_19135 |
Symbol | |
ID | 5006671 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | + |
Start bp | 510672 |
End bp | 513769 |
Gene Length | 3098 bp |
Protein Length | 980 aa |
Translation table | |
GC content | 58% |
IMG OID | 640422092 |
Product | predicted protein |
Protein accession | XP_001422613 |
Protein GI | 145356799 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.439366 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTGGC AGTGGCACGG CGGTTCGTTA GGTCGCGATG GAGCGCTGTA CGCGGTGCCG TGCAACGCGT CGTCCGTCCT GCGGGTGTGC ACGAAGACGG AGGAGGTGAG TTTCATCGGC GGCGACGTGA TTTCGCCGAT GAAGAATAAG TGGTACGGCG GCATTCAAGC GCCGGATGGT TCGATTTACG GCGTGCCGTA TTGTTCGGAT AAAATCATAC ACATCGTTCC AGAGACACAA TCGGTGGAGA TGCTCGAACT ACAAGGAGCG ACTTTGGAGC CGAATAGCTA CGCTTGGCAC GGCGGAATCT TAGCGCCAAA TGGCTGCATT TATTGCTTTC CCAGTCACGC TCGTCGCGCG ATGAAGATTG ACTGCGCCAC GCGCACGTGC ACGCTCATCG GCGACGATCT CGGAGACAAG CGGTACAAAT TTGGCGGCGG ATGCGTCGGC CCCGACGGCG ACTCGGTGTA CGCGTTTCCG AGCGATTACA AGGCGGTGCT GAAAATCGAC ACGAGAACCG ACCAGACTTC TTTAGTGGGC GAAGGACTAC CCGGAATGCT CCCAGACTTG CTTAACAAGT GGCAAAACGG TGTTCTCGCG GGCGACGGTT ACATATACGG AATCCCGTGC GACGCACCGT CTGTGATCCA AATCGACCCC GGGACGGATT CGGTGCACTT CCTCGGCAAC CTGGGCGACT TACCCGACAA ATACCAAGGC GGTTTCTTAA ATCGCGACGA CGGCGTCGTG TACTGCATCC CCGAAAACGC TGAAAACGTC ATGCGCATCT GCCCCGTCGG CAGCGACGCG CACCCACCGC CCCGCAGCGG CTTCGATCAC GACGCTCGTC CCGAAAGTCG TCGCGCGGCG GCGTTGTAAC TAAAGTGTGG TTTCTGAGCC ACTTCCCGTT ATCCCTTCGC GCGTCTCCGC AGACGCGCGC GCGCGACCGC ACGTGCGCGC CGGCGCGTCG ACGCCGATCA CGCGCGCGCA CCATGCTCCC GTCCTGGCTC GCCGGGAGCT CGAGCGATGA GTACAGCGGC GACGAAAGCG AAGGCCCGCG ATCGCCGTCG AGGTTGAAAC AGATCGGTTC GACGGTGAAA CACGAGGCGA AGGACCGCCT GACGAAGCTG GCGCGAAGCG GAACCGCGAG CACGGCGGTG GATGGACTCA CGGATTTCAT CGCGGACCCG AAATCGCACG CGGCGGCGAC GAAGATTGGA AATTTGAAGG AAAAGTCCAA GACGCTGTGG ACGAACGCGA TGCGGGACGA CGAGGCGAAG ATTGAAAGGA CGAAAAACGC GACGAAAAAG GCGCTGGAGA TCACGGAAAA GTTATGCGTG AACGCGGATC CCGATGGTGA AGTCGCGCAC ACGTTTGCGG ACGCGGCGAG ACGGATGAAG GATATCGTGG ATACGGGGAA AAACCGCGAA GAGGTGCTCG AGCTGAGCAA GACGCTGGGG AAGGATGTGT GGGAACGGTT GAGCACGCGA GCGAAAGGAA ACGAACACTT CACCGGTATG CGGGGAACCG TGGAAAGGAT CGTGAACAGG GTGAAGGGGC TGATGCAAAA GTTGCAGGAT GAAAAGATGA AGCGAGACGA GGAGGCGGCG ATTTCCGCCG TGTTAAACAG CGAGGCGGCG GTCGACGAGA TTGATTCGCG GCTTAAGACG TTGGCCGAGG AGAGTAAAGA TGTGTGGAGT GATTTAAAGG CGGATACGCA ACTTAGAGCG TTGATTAAGG AGGAAGTCGT GCCGGGCTTC GAGCGTTTGA TTCGCGGAGC GGTGCAGGTT TCGTGCGAGC TCATGTCCAA ACTCGAACTC CCGCGCGTGG ACGGGGTGTA CGATTCCCCG ATCGGATCGG TGTGTTATCA CGTAGATAAC GTGCACTTCA CCGAGTTCCA CGTGTCCAAA GAGTCGTTAC GGGTGATCAA TCACATGGAT GAGGACGAAC TCGGCGCTGG CTTGAGCACG ACGGTGGAGG TCAGAGACAT AAACACCGTC ATGCAAAATG TGGAGTTCGC GTATTGCGAA TTTCCGCGAA ACTGGGGGGT TGTGGACGGT GAAGGATTAT GTACAGTCAC AGTGGACGGC GCGAGCGTGG GAATTTCGTA CGAAATCATC ATCAACACGA ATCAGTTGAT GAAACTCGTG AACCAAGGAG TCGAGTTGGC CAAGGACGAC GGGAAAATTG CGGAGATGAG AGAGAAGATT GAAGCCAAGA TGAAGGAACG TAAAGAGGCG AAAGAGCGAG AGGCGGCGCG GGCGCAGGCG GCGAAACCGG CGGCGACAAC ACCCGTTAAA ATCGAGAAGT GCGCGTCAAT CGATTCGGAC CGCGAAGAGT TCGGTTCACC CACGGGAGAG GTTGATGCGC TAGGTTCCGC GCTCGATCGC GCGTTTGGTG GTGGTATGTT CAGCGACGCG CACGACGATG TAGACTCGCC GCCGTTATCG CCGACGTCGC CGACGTTTCA CGACGCCACG GACGACGCTC GTATCTTGGG GGCCAAGGAA AAGCTGATTC GCAATCGCAA GGTGGTGAGT GAAATCTTAG GTGACGACTT TTTGGGCGAA GAACCCGTGT TAGAGCTTCG CGTACACACG ACGCACATCT CAGTCGGCGA GCTGGACGTG CAAATTAGCG GCACGTCCGC GGCGTGGTTG TACAACATGA TTGCTCTCGT CCTGACGCAA CAGCTTCGCG GAACGATTGA GGAGAAAATC AACAACATAA CGGTCAAACA GCTCGCGCGA GTGAGCGGTG CCGTTTCCGC GTACAGCGCT GGTCTAATTG AGGTTTCCAT CTACCAAGAC GACGAATCGG ACGACGACAT CGGCTCGATG TTATCCGGCG TCACGGGCTC GCTTCGCGAG AATCCGGGCG TCTGGGGCGA AGACTGGAGA TGTTCGCACT GCCCAGGTGA AGCCCCCGAA CACGTCGAAG CGCGACGCAA GTTTGGTTCT CGATCGCATT CAAAAGCCTC TTTCGAGGGT TTGGACGAAA TCGTCGAAGA AGCTGACGCG ATCGAGCGCG CGGAGGCCGC GCGCCGGTCG CTCGAAGACG AGGACGTCGA CTGGCACGAC ATCGCGAGTC CGATGTGA
|
Protein sequence | MKWQWHGGSL GRDGALYAVP CNASSVLRVC TKTEEVSFIG GDVISPMKNK WYGGIQAPDG SIYGVPYCSD KIIHIVPETQ SVEMLELQGA TLEPNSYAWH GGILAPNGCI YCFPSHARRA MKIDCATRTC TLIGDDLGDK RYKFGGGCVG PDGDSVYAFP SDYKAVLKID TRTDQTSLVG EGLPGMLPDL LNKWQNGVLA GDGYIYGIPC DAPSVIQIDP GTDSVHFLGN LGDLPDKYQG GFLNRDDGVV YCIPENAENT RARDRTCAPA RRRRSRARTM LPSWLAGSSS DEYSGDESEG PRSPSRLKQI GSTVKHEAKD RLTKLARSGT ASTAVDGLTD FIADPKSHAA ATKIGNLKEK SKTLWTNAMR DDEAKIERTK NATKKALEIT EKLCVNADPD GEVAHTFADA ARRMKDIVDT GKNREEVLEL SKTLGKDVWE RLSTRAKGNE HFTGMRGTVE RIVNRVKGLM QKLQDEKMKR DEEAAISAVL NSEAAVDEID SRLKTLAEES KDVWSDLKAD TQLRALIKEE VVPGFERLIR GAVQVSCELM SKLELPRVDG VYDSPIGSVC YHVDNVHFTE FHVSKESLRV INHMDEDELG AGLSTTVEVR DINTVMQNVE FAYCEFPRNW GVVDGEGLCT VTVDGASVGI SYEIIINTNQ LMKLVNQGVE LAKDDGKIAE MREKIEAKMK ERKEAKEREA ARAQAAKPAA TTPVKIEKCA SIDSDREEFG SPTGEVDALG SALDRAFGGG MFSDAHDDVD SPPLSPTSPT FHDATDDARI LGAKEKLIRN RKVVSEILGD DFLGEEPVLE LRVHTTHISV GELDVQISGT SAAWLYNMIA LVLTQQLRGT IEEKINNITV KQLARVSGAV SAYSAGLIEV SIYQDDESDD DIGSMLSGVT GSLRENPGVW GEDWRCSHCP GEAPEHVEAR RKFGSRSHSK ASFEGLDEIV EEADAIERAE AARRSLEDED VDWHDIASPM
|
| |