Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_34722 |
Symbol | SDG3518 |
ID | 5003816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | + |
Start bp | 460781 |
End bp | 462355 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | |
GC content | 65% |
IMG OID | 640419237 |
Product | predicted protein |
Protein accession | XP_001419603 |
Protein GI | 145350419 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0720158 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0311419 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGCG CGACGGCGCG CGCGTCGACG CGACGCGGCG CGCGCGAGGG ACGCGCGAGG GACGGGGAGG GAAACCTCGC GCGCGAAGAT TATCGGCGTC GTCGGGGGCG CGGACGTCGG CGTCGCGCGG GAACGATCGG CGGCGCGGCG TCGACGGAGG AGACGACGTC GACGCGCGGG CGCGCGGAGA GCGCGCGATA CGACGACGCG GAGATACCGC GAGGCGTGGG GAGCGCGACG AAGGCGGAGC TCGCGCGGTG GCTGGAGGGG CGACGCCTGC CGGGGCAAAA GATGGCGCTG GAGGTGAACC TGGCGGAGGG ACGAGGGTTG GTGGCGACGG AGGAGATCAA GCGGGGAGAG GCGTTGCTCG GGGTGCCCAG GACGACGCTG ATCACGGTGG AGCGAGCGAT CGCGGAGGCG AAGTTGGGGC CGAAGCACGC CGAGCTGCAG GAGTGGAGCG TGCTGGCGAC GTTTTTGGCG CAACAGGCGC TGGCGTTGGA GAGCGGGACC GCGGGGACGT TCGGGGAGTA CATCAGGGCG CTGCCTCGAC GCACGGGGAG CGTGTTAGAT TGGCCGGAAG ATGAGGTGGA TAAGCTTTTG AAGGGGTCGC CGTCGCGCTT GGCGGCGGCG GAGCGACAGG ATAGCGTCAA CGCGGCGATT GATGAGATTC GCTCGTACTT TCCCGAAATC ACGGTCGGAG CGCTTCGATG GGCGTTCGAT ATTCTTTTCA GTCGTTTGAT TCGTTTGGAC GCCATGGGGG GCGAGCTCGC GCTCGTGCCG TGGGCGGATA TGTTGAACCA CAAGCCGGGG TGCGCGGCGT TCATCGACTT GAACGGCGAC GCCGTCAACC TCACCACCGA TCGATCGTAC GTCAAGGGCG AGCAAGTGTG GGCGTCCTAC GGTCAACGAC CATCGAGTGA GCTTTTGATC TCGTACGGTT TCGCCCCAGA GGTTGGTGAA AACCCAGACG ACGAATACGC CCTCACGCTC GGCGTCGACG TCAACGACCC GCTCGCCGAC GCCAAGGCGC AAGTGCTCAG AGATATGGGC CTGAGTCCGG TGGAAACGTT CCCGCTTCGC TTGAACGGGT ACCCGCGTCA GTTGTTGCAG TACGCGTCCT TCATTCTTTG TAACCCGGAG AAGCCGAGCG AGCTTAAGGG TTTGGCGCAG TCGGCGTTCA CGGGGAGCGC GAACATCGGG CAGTCGATTT TCGATTCGGT GCGCGGCTTG ACTAATGGCA AGGCGCGCGG GAAGCAAGGG GTGATTTTGG GTGGCGTCGC GGGTGAGATA GCGGTTCGAG AAATGCTCGC GGACTTGTGC GCCGAGGCGT TGAGCGCGTA CCCGAACACG CTCGAAAAAG ACAAGGGTCT GGCTCAAGGG CGCATGCCCG ATTTCCCCGG CGCCGACGCG TGGACGGGCG TGGCGCCGGA CGCGATCCGG GCGACGCAGC GTTCGGTCTC CGCGGCGCGT GTGCGCGTGT CTGAGCGACG CATCTTGGCC AAGACCGATA GTGAAGTGCG TTTGCAGTTG CGCAAATTGA AACAAAAGTC GTTGATGGAC GACTTCAAGC AGTAG
|
Protein sequence | MARATARAST RRGAREGRAR DGEGNLARED YRRRRGRGRR RRAGTIGGAA STEETTSTRG RAESARYDDA EIPRGVGSAT KAELARWLEG RRLPGQKMAL EVNLAEGRGL VATEEIKRGE ALLGVPRTTL ITVERAIAEA KLGPKHAELQ EWSVLATFLA QQALALESGT AGTFGEYIRA LPRRTGSVLD WPEDEVDKLL KGSPSRLAAA ERQDSVNAAI DEIRSYFPEI TVGALRWAFD ILFSRLIRLD AMGGELALVP WADMLNHKPG CAAFIDLNGD AVNLTTDRSY VKGEQVWASY GQRPSSELLI SYGFAPEVGE NPDDEYALTL GVDVNDPLAD AKAQVLRDMG LSPVETFPLR LNGYPRQLLQ YASFILCNPE KPSELKGLAQ SAFTGSANIG QSIFDSVRGL TNGKARGKQG VILGGVAGEI AVREMLADLC AEALSAYPNT LEKDKGLAQG RMPDFPGADA WTGVAPDAIR ATQRSVSAAR VRVSERRILA KTDSEVRLQL RKLKQKSLMD DFKQ
|
| |