Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_50593 |
Symbol | TAFV3501 |
ID | 5004170 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | - |
Start bp | 386953 |
End bp | 388926 |
Gene Length | 1974 bp |
Protein Length | 617 aa |
Translation table | |
GC content | 60% |
IMG OID | 640419591 |
Product | predicted protein |
Protein accession | XP_001420161 |
Protein GI | 145351604 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.594901 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.592668 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGCGG ACGCGGCGAT CAGCGATCAC TTGCTGTACC ACGCGGAGAC GGAGACGGAT CCGCGAGGCG CGGCGAGCGG ATACGCGACG CTGCGAGAGT GGGCGCAAAA TTCGTTGGAT ATGTACAAGA GTGAACTGTG TAGGGTGCTG TTCCCGATGT TCGCGCACGT GTTGCTGGAT TTGGTGAAGA AGGGATACGC GGAGGACGCG CGAGAGTTTT ACGCGACGCA CCACGCGGAC CACATGCGGC TGTACGGGGA GGATTTGGCG GCGTTGAGCG CGCTGAGCGA TCCCGAGCAC GTGCTCACGG ACCCGGTCGC GAGGAAGTTT TTGGATAACA AGGTGAGCCT GAAAATGACG CAGTATTCGT TTGAGTTGCT GGTCAAGTTT TTACACGCGC AAAATCTCAT CTCGATGTTG GCGGTGTTGA ACGCGCACCT GTGCGTGGAA GTGTTGAAGG GCGAACCGAG TCCGTACGAG GACGAGAAAG AGGGTTCTGT GGTCATCACC GGACGCGCGC CTGAGGCGCA ACGCAAATTT AACAGCATCA AGGGCCGATG GGGATTAGTG GACTCGTGCT TGGAAATCGA GAGGGTGAAG CAGTTTAGAA AGGAATCAAA GGAAAAGGAA AAAGAAGCGG CGGCGAAGGC GCGCGAAGAG GCGAAGAAGT CGAAGAAGCA GAAGAAGGAA GAAGCCGCCG CCGCTGAAGA ACGAGACGCC GCGGGTGGGG ATGAGGAGAT GGAGGACGCG CCCGAGAAGA AAACGGAAGC CGAGTATCCG GTGGCCATGC TCGAAGATCC GCCGAACGTG ATCAAAGGCA AGGTACCGGT GCCACAGCTC GATTACGACG CATTCGAAGA GGCTTTGGAG GATTTGCGGT GGCGAATAAG GCTTTCGAAG GATGTCCTAC CCACGGTGGC GTTCTTCACA TTCACGCACG CGCACGGATT GTTAATCTGC GCCGACGCCA CGGGCGACGT CAAGCACGTG GCTGGTGGCT TTTCGGACTC TGTCGTGCGC GTTTGGCATT TAGAAGAAGA AGATGAAGAC GACAAGGACA AAGAGATTCC GGTTGATCGC GACGCAAAGA CGCCGCGATC GATTCCGATG ACTGAGTTCG TCGGCCACAG CGCGCCCGTG CAACAAGTCG CGTTTAGCCC GTGTGGAAAG TTTTTGTTAT CGGTGTCGCG AGACTGCCAC GTGCGCGTGT GGAGCATGGA GCTGAAGATA TGTCTGTGCG CGTACGAAGG TCACCTGCAT CCGATTTGGG ACGTGCAGTG GAGTCCCTTC GGGCACTACT TCGCCACCGC GTGTCACGAC CGCGTGGCGC GCGTGTACGC GATGGACGCG CCTTTCCCGC GTCGTATGTT CGTGGGTCAT CTTTCGAACG TCGATTGCGT GGCGTGGCAT CCGAACTGTA ATTACGTCGC CACCGGCTCC GCCGATCGCA CCGTGCGTCT GTGGGACATG TTCGATGGCG AATGCGTTCG CGTTTTCGCC GGTCACGCCG CTGGTGTGCG CGCCATAGTA TTCGCGCCCG ACGGTCGAAC CATCGCCAGC GCCTCGGATG ACGGTCGCAT TTGCATGTGG GATCTCAGGC GCGCCTCGTG CGTGATCTCG TACAAAGGTC ACGTTGGTCC GGTGTATTCC ATGGACTTTG CCGGCGGCGG CAATTTACTC GTCTCCGGTG GTGCGGACGA CACCGTGCGC GTCTGGGACG CAACCGTCCC CGCCGACGAG GTAGAAAAGC TAAAAGAAGT CCCCGCGGAC GCGGACGCCG CGACGCTCGC CGCCGCCGCG GCTGCGGCCG CCGCCGCGAA AGCCGCCGCC GACGCCGCCG CGGCCAGACG CGAAGTCATC CGCCGCGTCC CGCTCGAGAC GTTCCCGACC AAGAACACCC CGGTGTTCCG CGTCAAGTTT TCAAGGCGTA ACTTGTGCTT AGCCATGGGC GCCCGCAGAT CGACCAAGGA ATAA
|
Protein sequence | MGADAAISDH LLYHAETETD PRGAASGYAT LREWAQNSLD MYKSELCRVL FPMFAHVLLD LVKKGYAEDA REFYATHHAD HMRLYGEDLA ALSALSDPEH VLTDPVARKF LDNKVSLKMT QYSFELLVKF LHAQNLISML AVLNAHLCVE VLKGEPSPYE DEKEGSVVIT GRAPEAQRKF NSIKGRWGLV DSCLEIERVK QFRKESKEKE KEAAAKAREE AKKSKKQKKE EAAAAEERDA AGGDEEMEDA PEKKTEAEYP VAMLEDPPNV IKGKVPVPQL DYDAFEEALE DLRWRIRLSK DVLPTVAFFT FTHAHGLLIC ADATGDVKHV AGGFSDSVVR VWHLEEEDED DKDKEIPVDR DAKTPRSIPM TEFVGHSAPV QQVAFSPCGK FLLSVSRDCH VRVWSMELKI CLCAYEGHLH PIWDVQWSPF GHYFATACHD RVARVYAMDA PFPRRMFVGH LSNVDCVAWH PNCNYVATGS ADRTVRLWDM FDGECVRVFA GHAAGVRAIV FAPDGRTIAS ASDDGRICMW DLRRASCVIS YKGHVGPVYS MDFAGGGNLL VSGGADDTVR VWDATVPADE VEKLKETFPT KNTPVFRVKF SRRNLCLAMG ARRSTKE
|
| |