Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_17948 |
Symbol | SDG3506a |
ID | 5005051 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009367 |
Strand | - |
Start bp | 516947 |
End bp | 518632 |
Gene Length | 1686 bp |
Protein Length | 503 aa |
Translation table | |
GC content | 58% |
IMG OID | 640420472 |
Product | predicted protein |
Protein accession | XP_001421172 |
Protein GI | 145353759 |
COG category | [R] General function prediction only |
COG ID | [COG2940] Proteins containing SET domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 45 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 80 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGA CAAAGTACGA ACGCGAAGTG GTTTGGGTCA AGGTGCGACG ACAGCGCGCG CTCGCACGAC CGCGCGCGAG TGAAAACGGT GATTTCCACG AGTTCAAAAA TCAACGACGG GCGAGACCGA CGTTGAATCG GTGCTGAATC GATGCGACGC GTAGAGGCGA AAGACTGACC GAGGGACGAC GACCGACTCT CGACGCGTCG GCGCAGTACG GGAAAGGGGA GCGGTTTTGG CCCGCGCACG TGGTGGGACG AGACGCGGCG CGCGCGCGCG TGCCGGGCGC GGCGGCGGTC TTTGAAAAGC ATCCAGGCGC GCGCGCGTTT CAGTACTTTG GCTACGGCGG GTATTACCAG CTCGAGAAAC CGAACGCGCC GGCTGCGATA GCGTGGGCGG AAGGGTTAGC GCGACGGTTG GATGAAAAAG GGGTGGCCAA GGCGCACAAG ACGGCTGTGA CAAACGCGCG GGCGTACGTA GAGCGAGGCG ATTTGACGAG CGAAGTCGCG GCGGGCGCGC TTTGGTGGAG CATGCCGCTT CCAGCACCGC GCGAAGAGCC GGCGAAGAGA ACTCACGATT CGGCGGGGAC GTTGTCTGAG AAGAATAAGA AGCCGCGACG CGGCGACGCG GCCAAAGATA CATTCGACGA GCGTGAGGAA GTCAAAGACG CCGGCGCCGA CGAGTTTGAG TTTCCTCGCA TGTCCAAAAC GTTCGTCGGC GGCGAACGCA AGGAATACAC GCCCACGTTG AGCGTGCTTT CGCTCGGAAA ACCACCTCCT TTTGAGCGCA TCCATCGCAG CGTTTTCGTC AGCAGACCGC CGCCGGTGAA ACTGCACAAG TCTGAAACCG CGGTGTGCGA CTGCCATCCG CCGCCGTCGC GCGGCGACAG CGAGACGATT CGCGACGGAT GCGGGCAAGA GTGCTTGAAT AGAAAATTGC GATTTAGTTG CGACAGCCGA ACGTGTCCGT GTGGGGACGC GTGCAGTAAT CGCCCGTTGA GTCAGTTACC GGCGCCAAAG ACGAAGATTA TTCGCACAGA AAACAGAGGT TGGGGATTGA CTTTGCAAGA GCCCGTGCGC GCGGGAACCT TCATCGTTGA GTACGCGGGT GAGATTTTAG ACGAGCACGA ATGCGCCGAA CGGCTTTGGT ACGACAAGCA GTCGGGGGAA GAGAACTTTT ACTTGATGGA AATATCCGCA AACTACGTCA TCGACGCCAA GTTTAAGGGC TCGATCGCGA GATTTATCAA TAGCAGCTGT CACCCAAACT GCGAAACGCA GCGGTGGGTC GACGCTTCGA CGAACGAGAC GAGAGTCGGT ATCTTTGCCA CCGAAGACAT CGCGAGTGGG ACCGAGCTGA CGTACGATTA CAACTTTGCG CACTTTGGCG ATGAAAAGGG GACGTCGTTC GTGTGCATGT GTGGGCATCC CAAGTGTCGA GGCACGCTCG ACGCGGCGAA GACGTCGAAA AAGAATTTGC ATCGCCGACT TCGCGTGGAA ATCATGGTGA ATGGGAAAGT CGTTAAGTCG CGCAAGAAGA AACAAAAAGT CAAGGCCACT GTGGTGGACT ATGATGCCGC GAAGAATAGA TACAAAGTAC AAGTCGAAGG TGACGAGAAG GAAACCTTCG CGTGGGTGCG TCTCGATGGC GAAGGCGCAG CGAAACACTC GTGGCTGAGC AAATAG
|
Protein sequence | MTTTKYEREV VWVKYGKGER FWPAHVVGRD AARARVPGAA AVFEKHPGAR AFQYFGYGGY YQLEKPNAPA AIAWAEGLAR RLDEKGVAKA HKTAVTNARA YVERGDLTSE VAAGALWWSM PLPAPREEPA KRTHDSAGTL SEKNKKPRRG DAAKDTFDER EEVKDAGADE FEFPRMSKTF VGGERKEYTP TLSVLSLGKP PPFERIHRSV FVSRPPPVKL HKSETAVCDC HPPPSRGDSE TIRDGCGQEC LNRKLRFSCD SRTCPCGDAC SNRPLSQLPA PKTKIIRTEN RGWGLTLQEP VRAGTFIVEY AGEILDEHEC AERLWYDKQS GEENFYLMEI SANYVIDAKF KGSIARFINS SCHPNCETQR WVDASTNETR VGIFATEDIA SGTELTYDYN FAHFGDEKGT SFVCMCGHPK CRGTLDAAKT SKKNLHRRLR VEIMVNGKVV KSRKKKQKVK ATVVDYDAAK NRYKVQVEGD EKETFAWVRL DGEGAAKHSW LSK
|
| |