Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_24854 |
Symbol | HIRA3501 |
ID | 5003001 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | - |
Start bp | 649526 |
End bp | 652333 |
Gene Length | 2808 bp |
Protein Length | 878 aa |
Translation table | |
GC content | 57% |
IMG OID | 640418422 |
Product | predicted protein |
Protein accession | XP_001419001 |
Protein GI | 145349146 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0065395 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGTGGTGG TGTATAAACC CGAGTTCGTC GCGCACGACG GGACGGCGCC GATATTCTCC GTGGACGCCG CGCCGGATGG GTCGAGATTC GCCACGGCGG GAGGCGATCA AAAGGTGAAG GTTTGGGCGC TCGCGCCGGT GCTTGAACGC GAGATTGAGG CGGATGAAAA CGCGCCGAAG TGCTTGGCGA CGCTGAGCGA TCACTTCGGA CCGGTGAATT GCGTGCGGTT CTCGCGGAAT GGACGGTATT TGGCGAGCGG GTCGACGGAT ACGAGCGTGT TGGTGTACGC GCTGCGAGAG GGACCGGGAA AGGCGGCGTT CGGGAGCGCG GATGCGCCGA ACGTGGAGAA TTGGACGATC GCGGCGAGGT ATAGGGGACA CGGATCGGAT GTGATCGATA TCGCGTGGTC GCCGGATGAT TCCATGTTGG CGTCGTGCTC GCTCGACAAC TTGGTGATTA TCTGGGACTG TCGCACGGGG AATCCGGTGG CGACGCTTCG AGGGCACACG AGCTTTGTCA AAGGCGTGGC GTGGGATCCG ATAGGGAAAT TTCTGGCGAC GCAGAGCGAT GATAAGACGT GCATCATATG GCGTACGGAT GATTGGACGC AAGTGGCCAA GGTTGAGGAA CCGTACCAGG CGTCCATGGG CGCAACGTTT TCGATGCGCC TGTGCTGGAG CCCGGACGGC AAGGCGGTGA CGACTTGTAA CTCGTACAAA AAGCCGTCTC ACACGGCATC GGTGTTGGAA CGCGGGTCTT GGGGGAGCAA CTTTGACTTC GTCGGACACA AAGGACCCGT CGTTGCCGTG CGCTTCTCGC CCGTGCTTTT CCACGACGAG AAGCGAGATA AAGTTCACAC AGTCATCGCG TGTGGGTCGC AAGACTGCAA ACTGACAATT TGGACCACGA ACAGGCCAAA GCCAGTGTGT ATCGTGCGTA AATGCTTCTC GCAGTCCGTG GTGGACTTGT GCTGGACTCC GGATGGGTAC ACATTGCTGG CGTGCAGCAC CGACGGGACA CTGTGCACGT TTAAGTTTGA CCCGGCTGAA ATCGGTGAAA AGTTGGACGA CACCGCCGCT GAAGCCTTTT TGTCCGAGAC GTACGGTGAT ATCAAACGCA ACCGCGCGCC CATGCTGGAA GATCCCACGT TGCTTGGATT CGCGGCGGCT GCGCCGGGCG AGTTAGACGA CGTGTCCAAG GGTGTGAAGA CAACTCCGCC GCCCCCGAAG CGAATTCAGC CCGCGCCGAT TCGAAGTAAT AATGGCGTAG CACAAAACGG AGTGGCGACA ATATCAGCGC AACGCGAGAC GGTGAGTAAC GACGGTCGTC GACGCATTGT TCCCGTCGCA GCCACCGGTC CATCTGTGGG CACTCCAGTC AAGGCACCGC AACGCGTGCA GCTCCAGCCT TTGCCAGCGA ATCCGCAGCC TCAAAATTTG CAGGATGCTA CGAATGGATT GAAACGTAGA ATAGAACCTA CGGCAATGAA TGGGGGCAAT GGAGTGCAGA GTGCTTTTGA GGCGCCATCT GCGCCCGCAC CGAAGCGAGT GGCGCTTCAA CCGACGCAAG TGCAGCAGGT TCCACCGACT ACTCCGTCGA TGCCGCAGCG TTCGCAAATT CCTACTACAG CATTGCCCGC ACCACCGCCT CTTCCCGTGG CTCCGGCGCC CGGTTCGATG CACATACAAC TCGTTGAGCC AAACTTAGAG GCGGATGAAG GAAGCGAGGA ACGCGCACCA CTACTTCTCG AGGCAAAGAA CCACCATGAG CGCTCTTATG TGGAGCTGGT TTGCTCGCAA GCGGGTAAGA TGAAGTGGAC GGATCGCATC GATATGAAAG CTACTCATCT CAGTGGAAAT AAGAACTTCT GCGCTGTTGC TCTTGAGGAT GGTTCATTGC AACTCTACAC CCCATGCGGT CGGCGTTACA TGCCGTCTCT GCTTTTGCCT GATCGAGCTG CGTTCTTAGT GTCCGGCAAA GATGATTCGT CCCTTTTGAT CGTTACTAGA GACTACACGC TCCTTGTGTG GAACGTGGCC GTGGGCAAAG AAACTTGCAC GCTCAAGGCG GACTGCTCGG CGCTCGTGCG TAGTTCCTCG CGCTCTGGAG TTGCGCTCGC GACGGTGCGA CTGTCAAAGT CCGGGGCTCC GATTTTCACC TTCACGAATG GTTACGCTTA CGTCTATCAC AGTGATCTGC AAACGTGGGC TCGAGTTGCC GACCAAAGCT TCATGCGATC GGAGTTTACA TCTCGGCTTC GTCAGCCGGG TAGTTCAGGC TTTGGGGAAA TGCAAGCATT GCAAATCGCC GCTGCCCGCG CTGCCGCGCA CGTAGGACCA GCTGCACTGC TGCAGACAAG TGGACTGACA GCACGCCGCG AAACTGGCAG ACACTTAGAA ATTCTAGTTG CGGGTGCAGC CATGTTGCAA TCCGCGGACG AATTCAAATC GTGGCTTGCG GCGTACGTGC GTCACTTGGC GACTGAGTGC GGTGATGATC CTCGCACTAC GTTTGTCGGG GCAGAATCAC AACTTCGCGA AGTTTGCACC GAATTCCTCG GGCCTCTGAG TGCGAGCACG TCAGTCGACA CGTGGGCTTC AGAAATCCTC GGCATCAAGA AGCGCACGCT CCTTCGTGAA GTGATAATTC CGACCATCGC GGCGAATGGC AGCGCTCAGC GACTCATCGC CGAAGTTGTT ACGCTGTTGG AAGCCGCGGA ACAGCGCGAG AAGGACGCAA CAAAAATTTG AGCGAGTTCA TGATGACTTC TAGATGTTTG TGTAAAATTA CAATCAAATC AATACTCT
|
Protein sequence | MVVVYKPEFV AHDGTAPIFS VDAAPDGSRF ATAGGDQKVK VWALAPVLER EIEADENAPK CLATLSDHFG PVNCVRFSRN GRYLASGSTD TSVLVYALRE GPGKAAFGSA DAPNVENWTI AARYRGHGSD VIDIAWSPDD SMLASCSLDN LVIIWDCRTG NPVATLRGHT SFVKGVAWDP IGKFLATQSD DKTCIIWRTD DWTQVAKVEE PYQASMGATF SMRLCWSPDG KAVTTCNSYK KPSHTASVLE RGSWGSNFDF VGHKGPVVAV RFSPVLFHDE KRDKVHTVIA CGSQDCKLTI WTTNRPKPVC IVRKCFSQSV VDLCWTPDGY TLLACSTDGT LCTFKFDPAE IGEKLDDTAA EAFLSETYGD IKRNRAPMLE DPTLLGFAAA APGELDDVSK GVKTTPPPPK RIQPAPIRSN NGVAQNGVAT ISAQRETVSN DGRRRIVPVA ATGPSDATNG LKRRIEPTAM NGGNGVQSAF EAPSAPAPKR VALQPTQVQQ VPPTTPSMPQ RSQIPTTALP APPPLPVAPA PEADEGSEER APLLLEAKNH HERSYVELVC SQAGKMKWTD RIDMKATHLS GNKNFCAVAL EDGSLQLYTP CGRRYMPSLL LPDRAAFLVS GKDDSSLLIV TRDYTLLVWN VAVGKETCTL KADCSALVRS SSRSGVALAT VRLSKSGAPI FTFTNGYAYV YHSDLQTWAR VADQSFMRSE FTSRLRQPGS SGFGEMQALQ IAAARAAAHV GPAALLQTSG LTARRETGRH LEILVAGAAM LQSADEFKSW LAAYVRHLAT ECGDDPRTTF VGAESQLREV CTEFLGPLSA STSVDTWASE ILGIKKRTLL REVIIPTIAA NGSAQRLIAE VVTLLEAAEQ REKDATKI
|
| |