Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_50302 |
Symbol | |
ID | 5003300 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | - |
Start bp | 67458 |
End bp | 69399 |
Gene Length | 1942 bp |
Protein Length | 471 aa |
Translation table | |
GC content | 57% |
IMG OID | 640418721 |
Product | predicted protein |
Protein accession | XP_001419274 |
Protein GI | 145349716 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1498] Protein implicated in ribosomal biogenesis, Nop56p homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.444988 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGGCGATCAT GGTGCGACGC GACGCGACGC GAGACGCGAG ACGCGATGCG CGACGCGGCG GTGGGAGACG CGGCGCGCGA GACGGCGCGC GAGACGGCGA CGCGCGAGAC GGCGGTGAAA CGCGCGCGAG GACGATCGCG GCGTAAACCT CCGCGGAACG AAGACGCGAC GGGAGCGCGC GATGCGAGCG CGCGCGACGC GCGCGGGACG ATCGGAGACG CGCGGAGACT GACGGAGACG CGCGACGCGC GATCGGTGAC GCGAACAGGC GCAGTTCGTG CTGTTCGAGT CCTCGTCGGG GTACGGGTTG TTCGAGACGC TGGATTTGGA CGTCGTGGGA CAGGCGCTGG AGAAGGTGCA GGAGACGACG CAGAGCGCGG ATAAGTTCGG GAAGGTTGTG AAATTGCACG GGTTTAAACC GTTTACGTCG GCGGCGAACG CGCTGGAGCA GATTAATTGC GTGTCCGAGG GCGTGGCCTC GGAGGATTTG CAAAACTTTT TAGAGCAAAA CTTGCCAAAG TTGAAGGATT CTAAAAAGGC CAAGTTTCAG CTCGGCGTCG CGGACTCCAA GCTCGGGAAC TCGATCGTGG AGACGACGAA GATTCCGTGC GTGTGCAACG ATCACGTCGG TGAGATTATT CGGGGCATTC GCGCGTATTT CACAAAGTTT GTCAAGGGTT TCAAGGGTGG CGACTACGAA AAGGCGCAAC TCGGCTTGGC GCACTCGTAC TCTCGCGCCA AGGTCAAGTT CAACGTGAAC CGATCGGATA ACATGATCAT CCAGGCCATC GCGCTGATCG ACACGCTCGA TAAGGACATC AACACCTTCA TCATGCGCGT GCGCGAATGG TACGGTTGGC ACTTTCCCGA GCTCGTCAAA GTCGTCAACG ACAACTACAT GTACGCGAGA CTGGCGCTCG TGATCAAGGA CAAGGCGACG CTCACCGATG AGGCGATGCC GGCGTTGAAG GAAATCACCG GGGACGAGGA CAAGGCTAAA GAAGTGATCG AGGCAGCCAA AGCGTCGATG GGTCAAGACA TCTCACCGGT GGACATGATC AACATTGAGT CCTTCGCCAA GCGCGTCATT TCCCTCGCCG AGTACCGAAC CAGTCTTCAC AACTACTTGA ACAACAAGAT GAGTGTGGTA AGTATTTCGA AGCGCGCGAC GAACGCTCGT ACCGATCAAT CGCCAATGAT TCTCTCGCAG TTCGTGGGGT TGATGATTAT GATACACCAA TCACAGACCT GATCTGAATT ACCTTGTTGA AACAATTCAT GCACCACTCT GACTCGCGTC TGTGGGGAAC AACGTCGACG CGGAAATTTT TCATCAAAAC AACCCGAACG AAGACACTGA CCATTTTTAT CGTTTTATCG TACTGGCAGG TGGCCCCGAA CCTGGGCGCG TTGATCGGTG ACATTATCGC CGCGCGTTTG ATTTCGCACG CCGGTTCGCT CACGAACTTA GCCAAGTACC CGGCGTCGAC GGTGCAAATT CTCGGCGCCG AAAAAGCGCT CTTCCGCGCG CTGAAGACCA AGGGGAACAC GCCCAAGTAC GGTTTGATTT TCCACAGCAC GTTCATCGGC AAGGCGAACG CGCGAAACAA GGGGAGAATT TCTCGATACT TAGCCAACAA GTGTAGCATC GCAAGCAGAA TTGACTGCTT TAGCGACTTC CAAACCACCT TGTTCGGCGA AAAGTTGAAG GATCAAGTCG AAGAGCGATT GGCATTTTAC GACAAGGGCA CGGCACCGCG TAAGAATATC GCCATGATGC AAGAGGTGAT CGCCGAGATC GGCCCGCAGG CGGGCGGCGG CGGCAAGCGA AAGGCGACCG ACTCTGCGAC GGCGACTCCG AAGAAGTCCA AGAAGGAAAA GAAGGACAAG AAAGAAAAGT CCGCCAAGAA GGCGAAGGCC TAGAAGGCGA ATAATCGATC CG
|
Protein sequence | MAQFVLFESS SGYGLFETLD LDVVGQALEK VQETTQSADK FGKVVKLHGF KPFTSAANAL EQINCVSEGV ASEDLQNFLE QNLPKLKDSK KAKFQLGVAD SKLGNSIVET TKIPCVCNDH VGEIIRGIRA YFTKFVKGFK GGDYEKAQLG LAHSYSRAKV KFNVNRSDNM IIQAIALIDT LDKDINTFIM RVREWYGWHF PELVKVVNDN YMYARLALVI KDKATLTDEA MPALKEITGD EDKAKEVIEA AKASMGQDIS PVDMINIESF AKRVISLAEY RTSLHNYLNN KMSVVAPNLG ALIGDIIAAR LISHAGSLTN LAKYPASTVQ ILGAEKALFR ALKTKGNTPK YGLIFHSTFI GKANARNKGR ISRYLANKCS IASRIDCFSD FQTTLFGEKL KDQVEERLAF YDKGTAPRKN IAMMQEVIAE IGPQAGGGGK RKATDSATAT PKKSKKEKKD KKEKSAKKAK A
|
| |