Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_16936 |
Symbol | |
ID | 5004245 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | - |
Start bp | 778 |
End bp | 3918 |
Gene Length | 3141 bp |
Protein Length | 1046 aa |
Translation table | |
GC content | 56% |
IMG OID | 640419666 |
Product | predicted protein |
Protein accession | XP_001420051 |
Protein GI | 145351365 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGAAG ACATCCTGCG CGCGCACATC GCCTCGTTTT TTCAAAGTCA GGATGCGAGC GAACGTGCGA ACGCGGAAGC GGCGCTTTCG AGCTTCGGCA AGAGCGATGG ATCTTGGAGC GTGCTATTGC GCGTGTTAGA GCGAGACGAT GCGACGGCGG TCGAAACACT GTTCTGCGCG CGCACCCTGC ACGTGCTTTT GCGTCGCTGC GTCGCAAAGG AGGAGCGGAC GCAGGCGTCG CACGCGGCGT TCACGGAACG CGATTGGATC GATCTTCGCT CGCGCGTGTT GAAGCTGACG ATGCTCTTTG CCGTGAATTC CTCGTCGTTC GCGCACGACG AGTCGAACGC GTCGCGTGCG GTCGACTTGA GAAGCACGCT GACGCAACTC GCGCTAGCCA CGTCGGCGTT GGCGTGTAAA ATGCCGACAT GGGATCCCAC AGCGGTCGTG CGAGACGTCA TCAAGGTGTT TCAGGAAGAC GCTCGCGTGT CGAATGAAGC CAAGTTGTTG TGTTTGTGCA CATTTTTGGC GTTCGTGCCG CAGGAAGCGA GTTCCCGAGA GTTGTCCATA CATCCGGCGC GCCGCGAGCA AGTATTGACT GGTTTGCGTA GCACCGCGAA CGACGTCATG GACTTGCTCC AGCAGCTCGC GACGTCAGCC AGTGGCGACA CGCTGTTACA CAAGTACATA CTCGATGCTC TCGCGGCGTG GGCGGACATC GCAAACGTCA CACCGAGATT TCCTCGCGTC ATACTTGAAG GCGCGCTACA CATCGTGTGC TCGGAAGATC ACCACGCAAA CATCAAACAA AGCGCCGCGA GCGCGGCGTG TGCGTCACTG GTGCAGTGCG TTTGGACGAG TGACACTGAG CTTCGTGCGT TGCTTGCGAC GAGTCTAGCA AAGTTGCGCG CTGAAGTCGT CAAAGCTGAG AGATCGGAGG AGAGTCGCGC GCTGATCGTG AACGTACTTT CGAGCGTAGC GATGAAGGCT TTGCGAGACC AGAAAGACGC GACCAAAAGT CCATTTGCGA CAGGACCAGA TGCGGCTGGC GATCGCACGT ATGTCAAGTA CGCCGAATTC AAAAGTTTGC AAAGACAACA AAAGAAGACG CAGCGATCGG AGCAGAAGCA GAAGACAAAT ATCGCCGTGG ATATCGACAC AGAAGTGTTG CTTTTCGCAC TCGATGGCTT ATCCGAGGCG CTTTCTGTCG GTGCCTCCAT GGCGTCGGCG CTGGAACCTT GGGGTAAGCT GGCGAAATCA TTCACGCCGG ATTCGTTTGT GGAGTTGCTT CGCCCGGTGG CGGAGCGATG CGTTCACGCC GCAGTGCTGT ACGTTCAACT CTTGCCCAAG CACGATCTGG ACGACGACCA AGTGAAGGAA GAAATTTCTG ATTGTCTTCG CGACGTCATT TCAGCCGTGC CGATTGAAGA AATACTCGGC GACTTCAATC AGCGCCTCTG CGCGGAGATG TCCGCCGCAC AGAGCGGTGG ATGGAGAACG CTAAACGCGC GTCTGTACGT ATTGCTCTCA CTAGCGAAAT CCTTCCGAGC TGAAGCTAAT CAGTCGTCTT TTGCGATCTT GATTGAAAAT TTGTGCACTT TATCGACGAG TGAAGTTGTT CCGAAAGCGA CTTTGGAATC CACTTGTTGG GTTCTGGCGG GTGTCGCCAA GTGCATTTCG CAGCTTGAAG ACAACATTCT TCTTGGCGTT TCGCACGCGC TGATTCGTTC GATGAGCCAT TCAGAATTTG TTGTCGCACG AGGTGCCGCT GTGGCGATGA TGAAGCTCTC TGAATTTGCA GCTTCGCGAC TTGGCGCCAC GGACGTACCG TCTCTTTTAG CCGAGCTTCA CGTTCGCGGT GGGCCGACGC CGTCGCCGAC TTTGCGTCTG GGTCAAGAAC ACGAATCAAC CGTTTTGCTT CGCGCTCTTA CGTTCTATGT GAAGTGCGAG TGTCGAGAGC AGACAGAAAG TGCTTGCGCG TCGCTCGCCG AGCCCGTCAT CGAGGCGATG AATGTTTCCC TTCACCGCGG CAGCTCGGAA GAATATGTTC GTCGTTTGGT TGATTTGGAT ATCGTGCTTC GAGCGATGAA GAGCGCGTAT GAACACATTC AATCGCCCAG TGAGGTGCTC GCCGGGTTGG CAACGCGCGC CGCGATTGCG GTTGAGCAGA CGAGCTTGCG AATTGTCGAT CATCGAATGG TCGAGGAGCG CTTTAAAGTC GCATGGGCGA TGAAGGCTCT CGTGGAGCTG GCGCGCTTCG TCGATGGGCT CTTGGGAGCT GTCGTTCGAA TTTCAGTCGA GGCGTACATG CGAGCACCGG GTCTCGGCGC GTGTTACCTG GATGCGTTGT CTGTTATGCT GGAGTTCTAT GGCGATAGCC GATGCGGAAT TGAGATTGGA GGGACGAAAT TTCAATCCGT CGGTCACGTT GTCGTCGAGC TCTTGGCGAC CGTCTTACCT GCATCTCTCG AAGATTGCGA AGGGTGGACG AGCGCGTTCA CGCTCGCGCG CGCGACACTG CGCACTGCGT GTGTCGCAAT AGTTCCGCAT CTTCGTATGA TGGTCGAAGT CAGTCAGGCG TCGCTGCGCG GAGTCTCCGA CGAACCCGCG GCGGCGGCTT TGTTATTCGC GACCGACTTG CTTCGAGCGC CGGTGATGTT GAGTGCCGAG TTGGCGTCCA CTGCGAAGAA TGCTTCTGCG AGCGCCATGC ACGCCGGTCT CGGGCGTCTA GCGGCGGAGA TCTCGGGTCA AAGTAACAAA AAGCGAGGGA CGTGCGAGTG GAATCGACGT TCAGCCAAGC CGGTCGCGGA CGCCGTCGCC GCGGCGATGG AATCTCACAG CGTCGTAATC GTGCGCAAAA TCCTCGAAGC AGCAAACGGC GAGATGCCAC CGAGTATGAT ATCCGATATT TCTGCGGCTT TGCACCTCGT TTGGAGCACG TATGGCACCG AGAGATTTCA AGGCGTCATG TTGGCAGCGC TCGGCGGCGA AGACGACGCC TTCCCGAAGC CCAAAACAAA GCTATCCGAC AAGCGCGAGT GGGTCGCGTT TTTGACGAAC GAGACCTGCG CAAACGATTG TCGAGTATTT AAACGATTCT TAAAATCGTT CTTGGGAGGG AAAAAAGTAG GCAAAAACTA A
|
Protein sequence | MDEDILRAHI ASFFQSQDAS ERANAEAALS SFGKSDGSWS VLLRVLERDD ATAVETLFCA RTLHVLLRRC VAKEERTQAS HAAFTERDWI DLRSRVLKLT MLFAVNSSSF AHDESNASRA VDLRSTLTQL ALATSALACK MPTWDPTAVV RDVIKVFQED ARVSNEAKLL CLCTFLAFVP QEASSRELSI HPARREQVLT GLRSTANDVM DLLQQLATSA SGDTLLHKYI LDALAAWADI ANVTPRFPRV ILEGALHIVC SEDHHANIKQ SAASAACASL VQCVWTSDTE LRALLATSLA KLRAEVVKAE RSEESRALIV NVLSSVAMKA LRDQKDATKS PFATGPDAAG DRTYVKYAEF KSLQRQQKKT QRSEQKQKTN IAVDIDTEVL LFALDGLSEA LSVGASMASA LEPWGKLAKS FTPDSFVELL RPVAERCVHA AVLYVQLLPK HDLDDDQVKE EISDCLRDVI SAVPIEEILG DFNQRLCAEM SAAQSGGWRT LNARLYVLLS LAKSFRAEAN QSSFAILIEN LCTLSTSEVV PKATLESTCW VLAGVAKCIS QLEDNILLGV SHALIRSMSH SEFVVARGAA VAMMKLSEFA ASRLGATDVP SLLAELHVRG GPTPSPTLRL GQEHESTVLL RALTFYVKCE CREQTESACA SLAEPVIEAM NVSLHRGSSE EYVRRLVDLD IVLRAMKSAY EHIQSPSEVL AGLATRAAIA VEQTSLRIVD HRMVEERFKV AWAMKALVEL ARFVDGLLGA VVRISVEAYM RAPGLGACYL DALSVMLEFY GDSRCGIEIG GTKFQSVGHV VVELLATVLP ASLEDCEGWT SAFTLARATL RTACVAIVPH LRMMVEVSQA SLRGVSDEPA AAALLFATDL LRAPVMLSAE LASTAKNASA SAMHAGLGRL AAEISGQSNK KRGTCEWNRR SAKPVADAVA AAMESHSVVI VRKILEAANG EMPPSMISDI SAALHLVWST YGTERFQGVM LAALGGEDDA FPKPKTKLSD KREWVAFLTN ETCANDCRVF KRFLKSFLGG KKVGKN
|
| |