Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_35547 |
Symbol | |
ID | 5002916 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | - |
Start bp | 57349 |
End bp | 60393 |
Gene Length | 3045 bp |
Protein Length | 974 aa |
Translation table | |
GC content | 58% |
IMG OID | 640418337 |
Product | predicted protein |
Protein accession | XP_001418820 |
Protein GI | 145348777 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0480] Translation elongation factors (GTPases) |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.531902 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGCGA ACCTGTACGA TGAGTTCGGA AACTACGTCG GCCCCGAGAT TGGATCGTCG GACGACGACG ACGACGTCGC GCGTCGCGAC GAGAGTCGGG ACGATCGCGC GGCGGCGGCG ACGAACGCCG CGCGCGAGGA CGCCCACGAC GCGTCCGACG CGAGCGAGGA CGACGCGATG GACGATGATG ACTTGAACGC GTCCGCGATC ACGCTCGCGG AGGATAAAAA GTACTACCCG ACCGCCGAGG AGGTGTACGG GGAGGGCACG GAGACGCTCG TGGAGAACGA AGACGCGCAG GCGCTCGAGA CGCCCATCGT GGCGCCGGTA AAGGTGAAGA ACATCGAAGC CGCGGGACAG AGATACGTGG GAGGAAGCGT GGGGGCGCAA GTGCAGTCGC AAGACGCGGA GGTCGCCATG AAGGTGAGCG AGGAGTTCCT GGTGGGCTTG CACGGGGAGT CGCGAAGAAT GGGACGCAAC GTGTGCGTGG CGGGACATTT GCATCACGGG AAAACGACGC TGTTTGACAT GCTGCTCGAG TGTTCGCACG ACGTGAATTA TGACTGGCTG GCGAACGATA AGCAGCTGAG GTACACGGAT ACGAGGATGG ATGAGCAGGC GAGAGGGATT TCGCTCAAGT CGACGCCGAT GACGCTGCCG TTGCAGACGA GTCGAGGGAA GACGATGGTG TTTAACGTCA TGGATACGCC AGGACACGTG AATTTTAGCG ATGAAGTGAC GGCATCGATG CGGTTGGCGG ATGGAGTGTT GCTTGTCGTC GATGCGTGCG AAGGTGTTAT GACTTCGACG ACGCGACAAA TCAAGCACGC GGCGCGAGAC GGTTTGCCCG TGTGCGTCTT CATATCGAAG ATTGATCGCT TGATTGTCGA GCTCAAGCTT CCGCCCGCGG ATGCCTATCA CAAGCTCCGT CACACGATCG AAGAAATCAA CGGCTTGATC GAGACGTTTT ACACACCAGA CGCCGACGGA GCTCTGCCGA CGGTGAGTCC GGAAAACGGC AAGGTTTGCT TTGGGAGCGC TTTGTACGGA TTTAGTTTCA CCTTAGAATC GTTCGCCAAA CTCTACGTCG ATGTGAACGG CGTGTTAGTC GATCATAAGG AGTTCGCAAA GAGGATGTGG GGCGATGTGT ATTATCACGG CGATACGCGG ATGTTCAAGA AGAAGCCACC GCCGGGCGGC GGCGAGCGCA CCTTCGTGCA GTTCATCCTC GAACCGTTGT ACAAAGTTTT CAGCCAAGTT GTCGGTGAAA CTATTGACAG CGTGTCTGAT AAGTTGAAGG AATTTGGCAT CAAACTGAAG CCAAAGGAGA CGAAAGCAAA CACGAAGCCG CTGCTCAAGA TGACGTGCCA GAAGATTTTC GGCGCCGCGT CTGGGCTGGC GGATATGCTC GCCGCACACA TCCCGACGGC GGAAGAAGGC GCGGCTATGA AGATTGAGCG CGCTTACAGC GGCCCGGTAA AGAACGGTGG CAAACTCGTC GATGCAATGC GGGCGTGTGA CCCGGACGCA CCAGCCGTGG TGATGGTTTC CAAACTCATC CCAAAAAGCG ATTGCTCGGC GTTCGACGCG CTCGGTCGCG TGATGTGCGG GACGCTTAGG AAGAACGATC GCGTTCGTGT TCTCGGCGAA AACTTCTCCC CAGATGACGA AGAAGACAGC GTGGTGAAGA ATGTGACGAA TATGTGGATA TATGAAGCGC GTTATCGTAT TCCCATCAAG GAAGCGCGCG CGGGCGCTTG GGTCCTCATC GAAGGCATAG ACCAAAGCAT CACCACAACT GCCACGTTGG TACCGGAGAA GATGCCGAAA GGGTACGACG ACGACTTGTA CGCTTTCAAA CCGTTAGAGT TCGATAACAA ATCGGTGATG AAGATCGCTG CGGAGCCTCT CAACCCGAGC GACTTGCCGA AAATGGTTGA AGGTTTACGC AAGATTACCA AATCCTACCC CGCCTGCGTG ACCAAGGTTG AGGAAAGCGG CGAACACACA ATCATGGGCA CGGGGGAGTT GTTTTTAGAT AGCGTAATGA AGGATCTCCG AGAAATGTAC AGCGAAATCG AGGTCAAGGT GAGTGATCCT GTGGTTTGCT TTAACGAAAC CGTCGTGGAG ACGAGCTCGC TCAAGTGCTA CGCGGAGACG CCGAACAAAA AGAACAAACT CACGATGATC GCCGAGCCCC TCGACAAGGG ACTTGCGCGA GATATCGAGA CCGGCAAAGT CAATTTGAGT GCGCCGAAGA AACAGGTGAG CGATTTCTTC AAGTCAGAGT ACGAGTGGGA CGCGCTCGCT GCGAAGAGTG TCTGGGCGTT TGGCCCCGAC GCGGCCGGCC CCAACGCGTT GTTGGACGAC ACTTTGCCGT CCGAAGTCGA CAAAGGATTG CTTGCGGCGA TTCGTGACTC CGTTGTACAA GGATTCCAGT GGGGCACGCG CGAGGGTCCG CTGTGCGATG AGCCTATCCG CGAGGTCAAG TTCAAAATTT TAGACGCCGT CGTCGCCGAC GCCCCGCTTC AGCGAGGGGG CGGGCAAATC ATCCCCACCG CGCGTCGCTG CGCGTACAGC GCCTTCCTCA TGGCCACTCC GCGATTGATG GAGCCCATTT ACGAAGTTGA GATCCAATCT CCCGCCGATT GCATGAGCGC CATCTACACC GTGCTCAGTA AACGTCGCGG ACACGTCGTC AGCGATGCCC CAAAACCTGG CACCCCCGTG TACACCGTCA AGGCGCTCAT CCCCGCCATC GAAACCTTTG GTTTCGAGAC CGACCTGCGC TACCACACCC AAGGCCAAGC CTTCGGTCAG TCCTACTTCG ACCACTGGGC CGTCGTTCCG GGCGATCCTC TCGACAAGAC CGTCGTGTTG CGTCCCCTCG AACCCGCCCC CGTCCCTCAC CTCGCTCGCG AGTTCATGGT CAAGACGCGT CGTCGCAAGG GCATGTCCGA AGACGTCACC GTGAGTAAAT TCTTCGACGA CGATTTGCTC ATCGAGCTCG CGCAGGCCGA CACCGAGCTC GCGGGTCTAT TTTAG
|
Protein sequence | MDANLYDEFG NYVGPEIGSS DDDDDDDAMD DDDLNASAIT LAEDKKYYPT AEEVYGEGTE TLVENEDAQA LETPIVAPVK RYVGGSVGAQ VQSQDAEVAM KVSEEFLVGL HGESRRMGRN VCVAGHLHHG KTTLFDMLLE CSHDVNYDWL ANDKQLRYTD TRMDEQARGI SLKSTPMTLP LQTSRGKTMV FNVMDTPGHV NFSDEVTASM RLADGVLLVV DACEGVMTST TRQIKHAARD GLPVCVFISK IDRLIVELKL PPADAYHKLR HTIEEINGLI ETFYTPDADG ALPTVSPENG KVCFGSALYG FSFTLESFAK LYVDVNGVLV DHKEFAKRMW GDVYYHGDTR MFKKKPPPGG GERTFVQFIL EPLYKVFSQV VGETIDSVSD KLKEFGIKLK PKETKANTKP LLKMTCQKIF GAASGLADML AAHIPTAEEG AAMKIERAYS GPVKNGGKLV DAMRACDPDA PAVVMVSKLI PKSDCSAFDA LGRVMCGTLR KNDRVRVLGE NFSPDDEEDS VVKNVTNMWI YEARYRIPIK EARAGAWVLI EGIDQSITTT ATLVPEKMPK GYDDDLYAFK PLEFDNKSVM KIAAEPLNPS DLPKMVEGLR KITKSYPACV TKVEESGEHT IMGTGELFLD SVMKDLREMY SEIEVKVSDP VVCFNETVVE TSSLKCYAET PNKKNKLTMI AEPLDKGLAR DIETGKVNLS APKKQVSDFF KSEYEWDALA AKSVWAFGPD AAGPNALLDD TLPSEVDKGL LAAIRDSVVQ GFQWGTREGP LCDEPIREVK FKILDAVVAD APLQRGGGQI IPTARRCAYS AFLMATPRLM EPIYEVEIQS PADCMSAIYT VLSKRRGHVV SDAPKPGTPV YTVKALIPAI ETFGFETDLR YHTQGQAFGQ SYFDHWAVVP GDPLDKTVVL RPLEPAPVPH LAREFMVKTR RRKGMSEDVT VSKFFDDDLL IELAQADTEL AGLF
|
| |