Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_16531 |
Symbol | |
ID | 5003165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | - |
Start bp | 520172 |
End bp | 522238 |
Gene Length | 2067 bp |
Protein Length | 688 aa |
Translation table | |
GC content | 62% |
IMG OID | 640418586 |
Product | predicted protein |
Protein accession | XP_001419408 |
Protein GI | 145349990 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00789069 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.505528 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGTCGA TGTTCGACGA CGCGCCGGCG ACGGCGCGCG GCGGCGGCGC GCGACGACGC GACGGCGACG ACGCGACGGC GAGCGACGAC GCGACGGCGA GCGACGGCGA AGGCGACGCG CTGAAAATCA ACGAAAAGTA CGCCGCGCGG TTCGAACACA ACGAACGAAG GAAGGAGACG CACCGATTGC AGGCGAAGCT GGAGCGGGAG TACGCGCGGG GCGCGCTGCG CGCGCGCGGC GACGGGAAGA GCGAGAGCGA GACGAGCGGG GAATCGAGCG ATGGGACGTC GAGCGATGAG GAAGAGACGC TGGAGCGAGC GCTGGACGGG GCGTTCGCGG AGGCGCTGAC GAAGATTCGG AGGAAGGATC CGAGCATTTA CGACGCGGAG ACGAAGCTGT TCGACGAGGC GAGCGAGGAC GAGGACGACG AGGACGGGGG GGGGAAGAAG GAGGCGACGT CGACGAAGAG GACGAAGAAG AAGACGAGGG CGACGTTGCG CGAAGTCGTG GCGACGCAGT TGCTGGAGGG TGGGGCCACC GCGCTCGAGG AGGCGGAGGC GGAGGCGGAG GCGGCGCGCG CGAACGACGA ACCGTCGTAC GTCGAGGAGC AAGCGGCGCT GAAGCGCGCG TTCAAGGACG CGGCGGCGGG CGGTGACGAC GACGAAGACG ATGAAGACGA GAGTGGTCTC GTGGTGAAAC GGCGCGCGGC GGCGATGACG GCGGCGACGG AAAAGTTGTC TGAATACTTT GACGCGAGAC GCGGCGACGC GAACGATTTG AGCGCCGAGG ATAAATTCTT GCGAGACTAC TTGCTGGAGA AACAGTGGAT GAAGGAAGAC TCGAAGAAAG ATTCCGCGGC GGTGCGGTTT CAAACATTAG GGCCGCCATC GAGCGACGAG GACGACGACG CGGGCGCCGA CGACGAGTCG AGCGATTCCG AACTCCTCGA TCGCGCCGAG GCGTTCGAGC ACAAGTACAA CTTTCGATTC GAAGAGCCGG GCGCGGATCG GCTCGTGTCG CACTCGCGTC ACATCGAAGG CTTAGTTCGT CGCGAGGATT CGAAGCGCAA GGACAAGCGA AAGAAAGTGC GAGAGCGCAA AGAATCTGAG CGAGCCAAGC TCCTCGCCGA GGTGCGTCGT CTGAAGAATC TCAAGCGCGA AGAAATCGCA AACAAGATGC GTCAAATCTC CGCCGTCGGT GGGTTGAAGG GCGGCGGCGC GAAGGTGGCG GATTTAACTG AAGAATTCGA CCCCGAAGCG CACGACCGAG CGATGCGAGA GATGTACGGT GACGAGTATT ACGACGCCGC GGGAGAGGAC GGCGAAGACG AAGTCTTCGG CGAGCTGGAA AAGCCCGAGT TCGGGGACTT GGAGGAGGAG ATGAAGGAAC TTTTGAAGGG TGCAGGGAAA CCTGACGATG ACGATTTAGA TGATGACGAT CATTTCGACG ATGACGACGA ACCGGCGCCG GACGAAGAAG AAGAAGAAAA CAAATTTAGC AAGCGCGCCG CGAAGAAGTG GCGCAAGGAG CTCGAGGCGA AGATGGACGA GTACTACAAG CTGAACGCCG AAGATTTCAT CGGTGAAGCG CCGACGCGCT TCCCGTACAA GGAGGTGGCG CCGAAGATGT TCGGTCTGAC CACGCGCGAC GTGCTTCTGA TGGAAGACAA GCACCTGAAC CAAATCGTCG GCTTGAAGAA GCTCGCGCCG TATCGCGACG ACGCAAACGA CGCCGCCGTG GACGCCAACC AACGCGCGAG AGCGCGTCGC ATGGCGAAAG AGTTCTTAGA GAAAGCCAAA GACAAGAAGA ATCGTTCGTC GCGACGCAGA AAAGACAAAA CCAAGCGCGA GGACGACGAC GAAGCTTCGG ACGGTTCTGA CGACGACGCC AAGGCGCGCG CGCGGTCGTA CGCCGACTCG GCGTTCGGCA AAAAGCGTAA ATCCGAAGCG CCGCTTCGAA ACACCACCGC GGATGACGCA TCCACCGGCG TGGGTAAGAA CGCTCGGAAA AACGCGAAGA AGCGCGCGAA GCGCAAAGCC AAAGAAATAG CGAGCGCCGC CGTGTAA
|
Protein sequence | MRSMFDDAPA TARGGGARRR DGDDATASDD ATASDGEGDA LKINEKYAAR FEHNERRKET HRLQAKLERE YARGALRARG DGKSESETSG ESSDGTSSDE EETLERALDG AFAEALTKIR RKDPSIYDAE TKLFDEASED EDDEDGGGKK EATSTKRTKK KTRATLREVV ATQLLEGGAT ALEEAEAEAE AARANDEPSY VEEQAALKRA FKDAAAGGDD DEDDEDESGL VVKRRAAAMT AATEKLSEYF DARRGDANDL SAEDKFLRDY LLEKQWMKED SKKDSAAVRF QTLGPPSSDE DDDAGADDES SDSELLDRAE AFEHKYNFRF EEPGADRLVS HSRHIEGLVR REDSKRKDKR KKVRERKESE RAKLLAEVRR LKNLKREEIA NKMRQISAVG GLKGGGAKVA DLTEEFDPEA HDRAMREMYG DEYYDAAGED GEDEVFGELE KPEFGDLEEE MKELLKGAGK PDDDDLDDDD HFDDDDEPAP DEEEEENKFS KRAAKKWRKE LEAKMDEYYK LNAEDFIGEA PTRFPYKEVA PKMFGLTTRD VLLMEDKHLN QIVGLKKLAP YRDDANDAAV DANQRARARR MAKEFLEKAK DKKNRSSRRR KDKTKREDDD EASDGSDDDA KARARSYADS AFGKKRKSEA PLRNTTADDA STGVGKNARK NAKKRAKRKA KEIASAAV
|
| |