Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_17468 |
Symbol | |
ID | 5004505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | + |
Start bp | 589550 |
End bp | 592843 |
Gene Length | 3294 bp |
Protein Length | 1097 aa |
Translation table | |
GC content | 59% |
IMG OID | 640419926 |
Product | predicted protein |
Protein accession | XP_001420389 |
Protein GI | 145352085 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 74 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGAAC GTGCGGTGCG GGCGACGCCG CGCGCGGGGC CGTCGACGTC GACGCGGCGG CTGAAATCGT CGACGCCGTC ACGCGCGTCC ATCGTGCGAC GCGCGCGAGG GAAAGGCGAC AAGGACGATG CGTCGACGCC GACGTCCTCG ACGCCCGACC CAGAAGAGTT GAAACTGGCG TTGGCGCGCG CGAGGACGCG CTTGGAGCGG TCGCGAAAGG GTCTGCCGGC GTTGGTCGAA GAGAACGAGA GCGAGGGTTT CACGTCCGAT GAAGTCGTCG AGGAGTCGTC GACGGCGGAA AACGGCGACT TGCGAAGAAT CGTGTCGGAC TGGTTTGGCG CGGTCGGAGG CGTCTTTGAC TCGCTGAACT CCGACGACGA GGCGATGGAC GCGAGAGACG AGAATCCCGC CATCGCGGCG CAAGATGATG CGTCGTCGTC GCGTGATGAC GACGAAGACT CTTCGGCGCC CGTCGAGGCG GGCGGATGGT TTCGATGGAA GAGTCCATGG CGGGTGAGTG GAGTGGGCGA GTCCATCGCT ACGGCGACGT CGTCGTGGAT GCCGAACAAG AGTGTAGACG GTGATGGAAG CTTCGTAGTG AACACCGACG CAATCGGAGT CTCAACAGCG AGCGATGGAG CGTCATCGTC TTCAAGCGAT CGAGGAGTCG AGCCGACGAA CTTGCTCGAG CGCGTGCCGT TGCCGCCGTG GATTCGGCAA CGCGTGGACG GTTCGAGTGA TGGTGATTCA TCCTCAGACG ACGAGGAAGA TGATGAAATG GCGTACGACA CGACGATTTC GAACGAGAAC GTCGGTAGTG CGATTCGTGA TGCGAGCGAG ACGGCGGTGA GCGTGACGAC AGCGGCGGTG GAAAAGTTGA ATTCTGCGCT TCTCGGTATT GAAACCGTGG AGAAGAGCGC GGTGAAGGAT GGAAGGTTTG AAAACAACGC TTCTGCACTC AAAGCGTTGC AAGAGTCGCT ACGAGATGCA CGTGCTTCTG CAAAAGAAGC GCGCGAGGCG AGCCAAAGTC TCGAAGAAGC CGTGCGCGAA GTGAGTCGCG CGCAAGAAGC GCTTCGAGCA GAGGGCGATT TGAAAAAAGA AGAAGCGATC AAGGCTCTTG AGGCTGCCAA GGCGGCTGTC ATCCGACAAG CGACGAGCTC GAACGCGGCC GTCGCAGATG CGCTCCGGCA GGTGTCATAC GTCGCCACTC GAGTCGAGAA ACTGGGCTCA TGGACGCTCG AAGACGACGG CGAGGCGTCG TCACGCGCAC TCGACACTAG CAAGGCGCAA AAACGCGGGG TTCGTCGAGG CCTGATGGGC GCGATCGCTA GCGCGCGTGA GAGTGCTCAA GACGCGCTCG TAGCGTCAGA ACTCACGCAG ACGGTCTCAA GGCTGGCTTC ATTGGCCACG CAGCGCGGAG ACGGCAAATC GAAGAAGTCG GAAGAGCTGA GCGAAATGGT CAAGCCCGCG CTGCAACGAG TCGTCGCGAA CTCGCTCGTA CCTGTCGAGA GCGACATTGA GATTGGCAGT GCGAGATTGG CGTGTTCGCT CGCGGCTTGG GTGTACTACT TACCGCAAAT GCAACACGCG TTGCCTCGCA ACGGTTTGAA GTTGATCACG TCATCGTTGG ACGTCGAAGA AATCATTCCG TCCACTGAAT TTCGCGCGGG CAAATCGCTC TTGGAGGAAT CTGCGTGGCG TGCTGAACAG GCGGTTGAGG AAGCCGTAGT CGCGGCTCGA GTGGCTGCGA GTAAAGACGC CAAGCTGGAA ATCGCCGCAT CGAAAGCTGC AGCCGAGTCT GCCGAGCTCG CGGCGCAGGC TCTGAAGCTC GCGGCTGAAT TACGCTCTCA GGACGCCCAG GACGAATCGA CTAGACTCAA AGCCAGAAAG ACGGCGCGAG CGGCGAAAGC GCTCGCCGTC GAGGCGCAGA AGAAGTTGGA CGAAGCGAGT GCGATTGCCG AACGGAACAA GCGCAAGTTG AAAAAGTGGG CCATGCAGCG CGAGCTCGCG GATCTTCAGT CGCAAATGCA ACAAACGGTG ATCGAACAAC AGCGGAAAGT CGAAGCCGAG CGCGTGAGAC AAGAGCGAGC AGAAAATGCA TCCCTCCCTG TGAACTTTTG CGTCGCCGCA CAAGACGACA CCGCGACGCT GTGGGTCGTC GTCGAGGGCT CTACGAACTT TGCGAGCTGG CAAGCAAACT TGACCTTTCA ACCCGTGACG TTTGAAGATC CCGCTCTCGG TGTGGAAGTT CATAGAGGCG CTTACACGGC GGCGAAGACG ATGTACCGTC GAATCGAGAA AGCAGTCAAA GAGCACGTCG CAAAGCACGG CGCGCGAGCG CGAGTGCGAA TCACCGGACA TTCGATCGGT GGATCGATCG CGATGATCAT CGCGATGATG CTCCTCGTGC GAAACGGCGC ACCTCGCTAC GCCATAGCCG ACGTCTGGGC GTTCGGCGCG CCGTACGTCA TGACTGGCGG CGAAGCGCTC ATGACTCGAC TCGGATTACC TCGTTCGTTT ATTCGAATGA TCATGATGGG CGACGATGTC GTGCCGCGCT CATTCTCGTG CTATTATCCG CAGTGGGCGC GCCGAGTGTT AGACAACGCG CCTGGGCCGT TCAACGTCAA CACTTCTACG GCGAATTTTT TGGACGAGCA GATGTTTTAT ACGCCGATGG GTGACTTGTA CGTGCTGCAG GCGAACAACG GTTCTGAACA CCCGCTTCTG CCGCCGGGAC CCGGTCTTTA CATCCTCGAC GGCGACGGCG TGTACGAGAT GCTAGCGACG CGCGCGCGAT TGGGCGAAAA CGAAGACGGC GATGATGAGT CGTGGTTGAA TCGTCGCCCT TCGAGCAAGC ATTGGGACGA GGCGGCGGCG TACGACGTGG ACGGAGACTT CTCTTCGTCC GAGGATGAGA AAGCGAACAA ACTTCGAATG CATCAGCTCG CGTGTTTGAC GCAATCAGAC GCCGCCCTCA CGGCTTCGCT CATCATCAGT CACATCAAGC AAGACGAATT GCTCCCGAGC GAAGGCGGCA TTGCGGAGGT GTTGAATCAG CGAGGGCGAG ACGCCGCACA AAGAGTGTTG TTCAATAGCC CGCATCCTCT GAGTATTCTC AGCAAACCCG ACGCGTACGG CGACGCGGGC ATCATAAGTC GCCATCACAA TCCGTTTCAA TACGCGAAGA GTCTTTCTCT GTCGCGAAAA CGGAAGCCTG GCGTCGCCGA TCTGATTTCT AGCCTACCGA AAAAAAGCAT CGACGGCGCG CCAGCGAGTG GCGGTCCGAG ATGA
|
Protein sequence | MRERAVRATP RAGPSTSTRR LKSSTPSRAS IVRRARGKGD KDDASTPTSS TPDPEELKLA LARARTRLER SRKGLPALVE ENESEGFTSD EVVEESSTAE NGDLRRIVSD WFGAVGGVFD SLNSDDEAMD ARDENPAIAA QDDASSSRDD DEDSSAPVEA GGWFRWKSPW RVSGVGESIA TATSSWMPNK SVDGDGSFVV NTDAIGVSTA SDGASSSSSD RGVEPTNLLE RVPLPPWIRQ RVDGSSDGDS SSDDEEDDEM AYDTTISNEN VGSAIRDASE TAVSVTTAAV EKLNSALLGI ETVEKSAVKD GRFENNASAL KALQESLRDA RASAKEAREA SQSLEEAVRE VSRAQEALRA EGDLKKEEAI KALEAAKAAV IRQATSSNAA VADALRQVSY VATRVEKLGS WTLEDDGEAS SRALDTSKAQ KRGVRRGLMG AIASARESAQ DALVASELTQ TVSRLASLAT QRGDGKSKKS EELSEMVKPA LQRVVANSLV PVESDIEIGS ARLACSLAAW VYYLPQMQHA LPRNGLKLIT SSLDVEEIIP STEFRAGKSL LEESAWRAEQ AVEEAVVAAR VAASKDAKLE IAASKAAAES AELAAQALKL AAELRSQDAQ DESTRLKARK TARAAKALAV EAQKKLDEAS AIAERNKRKL KKWAMQRELA DLQSQMQQTV IEQQRKVEAE RVRQERAENA SLPVNFCVAA QDDTATLWVV VEGSTNFASW QANLTFQPVT FEDPALGVEV HRGAYTAAKT MYRRIEKAVK EHVAKHGARA RVRITGHSIG GSIAMIIAMM LLVRNGAPRY AIADVWAFGA PYVMTGGEAL MTRLGLPRSF IRMIMMGDDV VPRSFSCYYP QWARRVLDNA PGPFNVNTST ANFLDEQMFY TPMGDLYVLQ ANNGSEHPLL PPGPGLYILD GDGVYEMLAT RARLGENEDG DDESWLNRRP SSKHWDEAAA YDVDGDFSSS EDEKANKLRM HQLACLTQSD AALTASLIIS HIKQDELLPS EGGIAEVLNQ RGRDAAQRVL FNSPHPLSIL SKPDAYGDAG IISRHHNPFQ YAKSLSLSRK RKPGVADLIS SLPKKSIDGA PASGGPR
|
| |