Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31680 |
Symbol | |
ID | 5001837 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 348263 |
End bp | 349365 |
Gene Length | 1103 bp |
Protein Length | 268 aa |
Translation table | |
GC content | 68% |
IMG OID | 640417258 |
Product | predicted protein |
Protein accession | XP_001417993 |
Protein GI | 145347053 |
COG category | [R] General function prediction only |
COG ID | [COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0246535 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.737397 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCGACGCCGC GCCGGCGGAC GCGCGACGCG ACGCGACGCG ACGCGACGCG AGATGACGCG CTTCGTCGCG CTCGGTCGCG GGCTCGGCGC GTTCGATCGC GGCGACGACG CGACGCGCGA CGCGCGACGA CGACGACGGC GACGCGCGCG CGAGGACGCG ACGGCGACGG CGCGGGCGGT GCGCGACGGC GCGCGACGCG CGGCGCCGAT GGTCGGGAAG GCGGCGAACG CGGTCGCGCT CGGCGCGGCG CTCGCGGGCC CGGGCGAGCG CGCGGACGGG GCGCGAGGAG GGGGGCGACG ACGCGAGGGC GGACGCCGCG CGGCGGCGCG CGGCGTGTTC GACGATCCCG ATCGCGCGTG CACGAACGCC ATCATGATCG GCACGGCGAG CGCGTTCGCG TTACAGATTT TGAGCGGACA GGCGATCACC GCGCTCGGGG CGAAGGTGAA CGAACGGATC GCCGCGGGGC AGCTGTGGCG GTTGGCGACG CCGATTTTTT TGCACGGCGG GCTGCCGCAC TTGATGGTGA ATATGTACTC GTTGAACAGC ATCGGACCGC TCATGGAGGC GACGTTCGGG CGCGAACAGT TTTTAGCGGT GTATTTCGGC GCGGGCGTGG CTGGAAATTA CGCGAGTTAT CGGTTTTGCG CGTCGAATAG CGTCGGCGCG AGCGGCGCCG TCTTCGGCTT GGCCGGCGCG TTGGCGGTGT ACTTGCAGCG CCACAAGCGA TATTTAGGCG AGCGCGCGGA CATGCAGCTG CAACAACTCG GCACGGCGTT GGCGGTGAAC ATGGGTTTCG GTCTCACGAG TAGACGAATA GACAATTGGG GGCACGCCGG CGGACTCGTC GGCGGCGCCG CGTTGGCCTT CTTAACCGGA CCTAATCTCG TCATGGAGAC CGACGGTGGC TACGGTCTGC GACGCAAACT CGTGAACAAA CCCAAGTTAC AATCCACGAT CCGCGCCATC AAGGATTTCT GGGACGAAGA CGACGAAGAC GAGGACGAGC GATGACTTCC AGGCTGCCCA GTCCCGAGCA ACAAACGTAG CGCAAGTAGA ACGACAATCG CGTCGAAATC GCACCGTTAG CGCGTGAGCG CGT
|
Protein sequence | MVGKAANAVA LGAALAGPGE RADGARGGGR RREGGRRAAA RGVFDDPDRA CTNAIMIGTA SAFALQILSG QAITALGAKV NERIAAGQLW RLATPIFLHG GLPHLMVNMY SLNSIGPLME ATFGREQFLA VYFGAGVAGN YASYRFCASN SVGASGAVFG LAGALAVYLQ RHKRYLGERA DMQLQQLGTA LAVNMGFGLT SRRIDNWGHA GGLVGGAALA FLTGPNLVME TDGGYGLRRK LVNKPKLQST IRAIKDFWDE DDEDEDER
|
| |