Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31612 |
Symbol | |
ID | 5001696 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 228498 |
End bp | 230386 |
Gene Length | 1889 bp |
Protein Length | 571 aa |
Translation table | |
GC content | 58% |
IMG OID | 640417117 |
Product | predicted protein |
Protein accession | XP_001417957 |
Protein GI | 145346978 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG5384] U3 small nucleolar ribonucleoprotein component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.776736 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00187106 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GAAGACGAGG GCGAAAGCGA CGAAGGCGGC GGCGAAGACG AAAGCGAAGA CGACGACGAA GGACGGGACG ATCGACGCGC GAAGAGCGGT AAGAAATCGA GCGGAAGTGA TCGCACCGCG CAGATCTTTC AAAAGCCTGG AATGTTTAGT TTGGACGAGA TGGAGAGCTT CATGGATCAA GGAGACGTCG AGGAGGAGAA GCGCCGTCGC GCGAACGAAG AAGTAGACGG CGGCTCGGCG GACGAAGAGG GCGAGTCAGA CGACGATATG CTCGACATAT ACGGAGACAT GAGCGACAGC GACGAAGATG GGGACGAAAA CGAGAACTCT GACGAGGAGG ATGACCTTGA TGACGCGTTG GCGTACACGG CGAAACTCGC CGGCGTGAGC GCGAGCGCGC GCAAGAAAAA ATCGACGAGG AAAGCCACCA AGGGCAAGAA GGCGCAAGAT TTAATGTTTG AAGACTTTTT CGGCAAGCGC CAGGGGCAGC CTCTCGGTGG GCGCAAGGGT GGCAAGCTCG GCGCTTCGAC AGACGCTGAA CTGAACGAGT TGAGCGACGA GGAGGAGGAC ATGTTCAACG AACTTGAGGA CGGCGAAGAC GACGACGAAG AGGATGACGA TTTGGATGGG GAGTTGGAGA CCGGTATCGC CGGTAGTCGT GGAATCGAAA ACCATGACGA TGACGCGGAC GAGTACGACG ACGAGGAAGA AGAGGACGAA GAAGACGAAC TCGTCGCTGG CGACGAAGAA GAGCTCGATG AGTTGGATAG AAAGCTGGAC GCGGATTTGG ACGCCGAGCT TGCTCGTGCG GAGGCGGAGG GCGAAGACGG CGATAGCGAC GCCATCGACG ACGACGAGAA GGAAGTGCCG CGGTCGGGTC CGAAGTCGGC GTTCCAGCGC CAACAAGAAG CGCTCGGGCG CCAAATCGAA AAGCTCGAAG CCGCGGCCAT CGGCGAAAAG TCTTGGCTTC TCAAGGGTGA AGCCGCAGCA AAAGAGCGGC CGATGAATAG CGCGTTGGAA ACTGATCTCG AGTTCGAGCA CGTCATGGCA CCTGCGCCGG TGATCAGCGC AGAGATAACC CAGAAGCTCG AGGAAATCAT CAAGCAGCGA ATCATCGAAG GCCGATTCGA CGACGTCGAG CGTGTCGAAC CCGTGGAGGA GCGCGAACGC AAGGAGCTCC CACAACTCGA CGACACCAAG TCTACCAAGG GTTTAGGTGA TATTTACGCC GATGAGTACA TGCGTCAAAA GGCTGGCGTC GCGCTCGGCG AGAAGGAAGA CCCCATGGTT GCCGAGGTGA AGAAGCTCTG GGCCACGCTT TCGTATCGCT TGGACGTGCT CTTGCAAACG GGCGAGGTGG AGGATCCGAA GGATCTCGAA AAGAAGATCG ATCGGGAACT CGCCGCGCGT GCGAAGGGGA ATATTGTGCC GTTGACGTTC GACGAGTCCA AACGTCTGGC GCCCGAGGAA GTCTTCGCGG GAGGCGAAGG CAAGGGCGGA CAGCGCGGCT CCGCCGCTGG CGCCGTCAAG GCGGACGACG AACTCACCAA GGAGGAACGC AAGGCTGGTC GTGCCAAGCG AAAACGGAAG TCCAAAGCCG CGCAAGAAGA GAAAGATCGC GTCAAGGCCA AACGAGACCG CGCACGCGAG GCCCAGCACA AGGCAGAAGA AGATGCTGGT TTCACGCGCA AGGCGCCGAA AGTTGCGATG CTCGCCGTCG GTTCGGCGGC TGGGAAATCC AAATCCGATT TCTCCAAATC GAGCAAGGTT TTCGGCATGC TCCAAGATGC CAAAGACGCA GACGCCGCGC GCGGCGGTGT GGCGAAGAAG AGCAAGTCAG ACTCCGCGAA GAACAAGCCA TCTCTCAAAC TTTGAGTATA GTATTGTAT
|
Protein sequence | MESFMDQGDV EEEKRRRANE EVDGGSADEE GESDDDMLDI YGDMSDSDED GDENENSDEE DDLDDALAYT AKLAGVSASA RKKKSTRKAT KGKKAQDLMF EDFFGKRQGQ PLGGRKGGKL GASTDAELNE LSDEEEDMFN ELEDGEDDDE EDDDLDGELE TGIAGSRGIE NHDDDADEYD DEEEEDEEDE LVAGDEEELD ELDRKLDADL DAELARAEAE GEDGDSDAID DDEKEVPRSG PKSAFQRQQE ALGRQIEKLE AAAIGEKSWL LKGEAAAKER PMNSALETDL EFEHVMAPAP VISAEITQKL EEIIKQRIIE GRFDDVERVE PVEERERKEL PQLDDTKSTK GLGDIYADEY MRQKAGVALG EKEDPMVAEV KKLWATLSYR LDVLLQTGEV EDPKDLEKKI DRELAARAKG NIVPLTFDES KRLAPEEVFA GGEGKGGQRG SAAGAVKADD ELTKEERKAG RAKRKRKSKA AQEEKDRVKA KRDRAREAQH KAEEDAGFTR KAPKVAMLAV GSAAGKSKSD FSKSSKVFGM LQDAKDADAA RGGVAKKSKS DSAKNKPSLK L
|
| |