Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_32784 |
Symbol | |
ID | 5002838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | + |
Start bp | 633362 |
End bp | 634417 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | |
GC content | 62% |
IMG OID | 640418259 |
Product | predicted protein |
Protein accession | XP_001418758 |
Protein GI | 145348648 |
COG category | [R] General function prediction only |
COG ID | [COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0879664 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGCGG TCGCGCGAAG CGCGCAGGGC ACGCCGCTCG ACGGCGCGCC GCTCGCGACG ACGGCCGTCG TGATCGCGGT CGCGATATCT TTCGCGCTCG CGCTCGCCGC GAACGGCGAT TTCGCGCGCG TGTGCGCGTC GCCGCGGTTG GCGTTTGAAC ATCCGCTGTC GAGCTATCAT CGCGTCTGGA CGTCGACGTT CTCGCACGGG AGCTTCCCGC ACGCGCTGCT AAATTGTCTA GCGTTCGTTC CGATGGCGTC GGCGCTCGAG CGATCGATCG GGACGACGCA CTTCGCGTGG TTATTCGCGA CGTTCGCGCA CGCGGCGTAC GCGCTCTCGG CGAGCGCGGC GACGGCGCTT TGGATGGCGC TCGGATATCG CGCGTCGTAC GAGAGCTGCG CCATAGGAAT GTCTGGGGTG GTGTTCGCGC TGATCGTGTG CGAGACGAAC GTGAACGACG TGGAGCGGCG AAGCGTGTTC GGGTTGTTCA CAGTGTCGAG CGAGTATTAC CCGATCGCTC TTCTGCTTTT CATTCAACTT TTGATGCCTG GCGTCTCTTT CATCGGTCAC GCGGGTGGTA TCGCGGCTGG ATGGTTGTAC GTTCGCGGAT ACTTGAACTT TTTGCTCCTG AAGGAGACGC ACGTGGAGTA TTTAGAGAAA TTAGCGATTT GCGCGCCCGC GCGCGCCCTG GCGTCGTTCG TGCCGTCAAA CGCCGATCGG GGCGCGCGGC CGAACGCGGA GGCGAGTTCG ACCGCATTTC CCGCGTTTTC GACGGTTAGA GCGATCCCGA CTCGTATGAG TGAGGTGACG CGCAACGCGT TCGCGGGTAA TTTCCCGGGC CAAGGACGGA AACTTGGCGG CGACGGGTCC ACGACGGGTG AGATGGCGAA TCTCGTTCGG GTCGATCCTC GGGCGTTGGA TACGTTAGTA GAGCTGGGAT TCGCAGAACA CGCCGCGCGG CGAGCGTTGC AAGAATGTGA CGGCGACTCG CAGCGTGCGA TCGAGTTGTT GACGGAATCA GCGGCGCACG ACGCGAATAG CGACGAAATA GTTTAG
|
Protein sequence | MSAVARSAQG TPLDGAPLAT TAVVIAVAIS FALALAANGD FARVCASPRL AFEHPLSSYH RVWTSTFSHG SFPHALLNCL AFVPMASALE RSIGTTHFAW LFATFAHAAY ALSASAATAL WMALGYRASY ESCAIGMSGV VFALIVCETN VNDVERRSVF GLFTVSSEYY PIALLLFIQL LMPGVSFIGH AGGIAAGWLY VRGYLNFLLL KETHVEYLEK LAICAPARAL ASFVPSNADR GARPNAEASS TAFPAFSTVR AIPTRMSEVT RNAFAGNFPG QGRKLGGDGS TTGEMANLVR VDPRALDTLV ELGFAEHAAR RALQECDGDS QRAIELLTES AAHDANSDEI V
|
| |