Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25644 |
Symbol | |
ID | 5006115 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009370 |
Strand | + |
Start bp | 51512 |
End bp | 52750 |
Gene Length | 1239 bp |
Protein Length | 385 aa |
Translation table | |
GC content | 60% |
IMG OID | 640421536 |
Product | predicted protein |
Protein accession | XP_001421825 |
Protein GI | 145355138 |
COG category | [K] Transcription |
COG ID | [COG5641] GATA Zn-finger-containing transcription factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.10622 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CTTGACGCGT CGCGGCGCGG CGCGTCTCGG CGCGCGTCGC GGCGCGACGA CCGGTGCGCG CGCCCTGGAC GACGCGGTCG ATGCGCGCGG AACCGCTCGA CGCGACGGCG TCGAACGCGT CGGCGTCGTC GCCGCGGGAG ACGAGGAAAC GCGCGAGGGA GGCGATTTTC GCGCGCTTCG ACGCGCTCGC CACCGCGCGC GGGGCGAAGC TGGCGAAGAG GCGAGGAAAA AGGAACTGCG TCGAGCTGGA GTTAGAGGCG TTGGGAGGCG TGAGCGACGA GACGCTCGCG AAGTTTGCGA CGTACGGGGA CAAAGTGCAG AAAATGATGC AAAAAGCGAC GCCGCGAGGG GCGACGCCCG GGCTCGCGCC GATTCGGTAC GCGGCGCGAG GGACGCTTTT TAACGAGCCG TATCGCGTGT CGGTGGACGA GCAAGGCGTG CATCGCGTGG TTTGGGGGAA GGAAGGCGAG TCGCGCAGGG TCGTGCGGTC GAAAGATGGG GCGCGTACCC CGCAAGAGGC GGTGGAATGC GTGCACGTCG AAATTCTGAG GCGAATGAAT GAAAACATTG CCTCAGTCAG CGACGAATCG CATGATGGTA CATTCGAAGC CCGCATATGC AGAAATTGTC TATGCGACTG CTCGAAGACG CCACTGATGC GTCGTGGTCC CGATGGGATC GGTACGCTTT GCAACGCTTG TGGGCTGTGG TGGAGTCGAC ATCAAACGAT GCGCGAGTAT CCGTCGGTGG TGCCAGAAGA AACGCCGCAC AAGGCGATAT TTATCCGAAA TCCAGTCAAA AGCCGAAGAG CACTGAAAAC GTTAGACGTT TTTGGATACT ATTCATCGAG CGTGCAAGCG ACGCTCGCCA AGGCTTGCGC CGCCGTGCTC CAAGAAGAAC GGCGTTCACT TCGGCTTCCG CGCGTCAAGT CGAAACCAGA CGTACGATAC GACTTCAAGC GAGCTATCCA CGACGTGTTC GATCGGTGCG ACTGTGTCTT CGCGGACTCC CCTTCTCATT CAGATGATTT AGGATGGGAA CATTACCCCC CATCGATCAC AGACTTCGCA GCTGAACAAC TCGAAGAACT CGAAGGAGTC AAACTCGAAG GAGTCAAACT CGAAGGAGTC AAACTCGAAG GAGTCGAACT CGAAGGAGTC GAACTCGAAG AACTCGAACT CGAAGAACTC GAAGCTTTCG CGTTCGGTTT CGAGCCCCCG CGGTTGGATC CAGTCTGAT
|
Protein sequence | MRAEPLDATA SNASASSPRE TRKRAREAIF ARFDALATAR GAKLAKRRGK RNCVELELEA LGGVSDETLA KFATYGDKVQ KMMQKATPRG ATPGLAPIRY AARGTLFNEP YRVSVDEQGV HRVVWGKEGE SRRVVRSKDG ARTPQEAVEC VHVEILRRMN ENIASVSDES HDGTFEARIC RNCLCDCSKT PLMRRGPDGI GTLCNACGLW WSRHQTMREY PSVVPEETPH KAIFIRNPVK SRRALKTLDV FGYYSSSVQA TLAKACAAVL QEERRSLRLP RVKSKPDVRY DFKRAIHDVF DRCDCVFADS PSHSDDLGWE HYPPSITDFA AEQLEELEGV KLEGVKLEGV KLEGVELEGV ELEELELEEL EAFAFGFEPP RLDPV
|
| |