Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_35087 |
Symbol | |
ID | 5003718 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | - |
Start bp | 32131 |
End bp | 33708 |
Gene Length | 1578 bp |
Protein Length | 471 aa |
Translation table | |
GC content | 61% |
IMG OID | 640419139 |
Product | predicted protein |
Protein accession | XP_001419682 |
Protein GI | 145350584 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTAAGA CGAGCGCGAA ACAGCGACGG CGCGTTCGCG GCGTCGTGCG CGGCGCGGGG CGCGGGAGAG TGATTGGATT TTATCACCCG AGCGGTGGGG ACGGTGGCGG CGGCGAACGC GTGCTGTTCG CCGCGATCGC GGCGGCGCAG CGAGGTGATT TCGATGGCGA TCGGCGCGGC GCGCGAGGCG ACGACGCCGA CGCCGACGCG GCGGAGAAGG AAGGCGAGGC GGCGATGATG TCGCGCGCGG ACGAGCGCCG GGACGCGCGG TCGGACGTGA GATCGAACAG TGCGCGCGAG ACGCGCGAAT GGAGAGCGAC GGAGGGAAGG GATGTCACGT GTTGCGTGTA CGCGGGCGCG ACGTCGGCGC GAGGCGACGA ACCCGTCGAG GACGGCGACG AACTCATCGC GCGCGCGCGC GAACGGTTCG GGATCGAGTT GCGAGCGCCC ATAAACGTTA TCAGACTCAC GCGTGAGCGA TGGGCGCGCG CGGAGACGTA CAAACGGTGC ACGATATTGG GGCAGTTTAT CGGCGGCGCG TGGCTCGGGC TCGAGGCGCT GTGGACGTTC GCGCCGGATG TGTTCGTGGA TACCGTCGGA CACGCGGCGA CGTATCCGAT CGCGCGATAT CTGTTTGGGT GCCAAACAGT GGCGTACGTG CACTATCCGA CGGTGTCGAG GGACATGATC GCGCGCGTGG AGAGCGGAAG ACTGATGTAC AACAACAGCC GCCTGTTCGC GTCGTCCAAG TTTTTGAGCG GCCTCAAGGT ACTTTATTAC CGCGCCTTTG CCGTCGTGTA CGGGTGGTGC GGACGATCGT GTAAGTGCGT GATGGTGAAC TCTTCGTGGA CTAAATCGCA CATCGACGCA TTGTGGCGGG TTGATTCACG GGTGGTGTAT CCCCCGTGCA ACGTCGAAGA CTTGTCCAAA CTCCCGTTGA CGCGCCCGAG GTTGAACGCG CGCGGGACGC CGGTGAAAAA GGACAAGTCG TCGATACGCG TTGTTAGCGT GGGGCAGTTT CGACCGGAAA AAGCGCACGT GGTTCAAATC GCCGCCTGGA AGGCTTTGAA AAAGTTCAAG ACATTATCGA GCAAGATTGA AAACGCCATT TTGGTCTTCG TCGGTGGATG CCGTGACGAA GCCGACCGCG AGCGCTTGGC AGATTTGCAA CAAAGTGTCA AAGATCTGGA GCTCCAGGAT AGCGTTCAGT TCCACGTCGA CGTGTCGTAC GACGAAGTCA AGCGCGAGCT GTCGCGCGCG TCCATCGGCC TTCACTCCAT GATTGACGAG CATTTCGGTA TTTGTGTCGT CGAGTACATG GCGGCAGGCG CGGTGCCCGT CGCTCACGCA TCTGGCGGAC CTTTTCTCGA TATCATACGC GACCAACACG ACGGCCCGAC AGGTTTCACT GCGGATAGCG TGGCGACCTT CGCCGAAACG CTCGAGCACT TGTTGCTCAT GCGCCGAACC GAGCGGGAGG AAATTTCAGC GCGCGCGCGC GCGCGTAGTG ACATTTTCAG CGAAACAGAA TTCAACTCAA ACTTCATCGA CAGTCTCGTC AACTCTGGCG TTCTCTAG
|
Protein sequence | MFKTSAKQRR RVRGVVRGAG RGRVIGFYHP SGGDGGGGER VLFAAIAAAQ RATEGRDVTC CVYAGATSAR GDEPVEDGDE LIARARERFG IELRAPINVI RLTRERWARA ETYKRCTILG QFIGGAWLGL EALWTFAPDV FVDTVGHAAT YPIARYLFGC QTVAYVHYPT VSRDMIARVE SGRLMYNNSR LFASSKFLSG LKVLYYRAFA VVYGWCGRSC KCVMVNSSWT KSHIDALWRV DSRVVYPPCN VEDLSKLPLT RPRLNARGTP VKKDKSSIRV VSVGQFRPEK AHVVQIAAWK ALKKFKTLSS KIENAILVFV GGCRDEADRE RLADLQQSVK DLELQDSVQF HVDVSYDEVK RELSRASIGL HSMIDEHFGI CVVEYMAAGA VPVAHASGGP FLDIIRDQHD GPTGFTADSV ATFAETLEHL LLMRRTEREE ISARARARSD IFSETEFNSN FIDSLVNSGV L
|
| |