Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31950 |
Symbol | |
ID | 5002597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | - |
Start bp | 16303 |
End bp | 19902 |
Gene Length | 3600 bp |
Protein Length | 1199 aa |
Translation table | |
GC content | 55% |
IMG OID | 640418018 |
Product | predicted protein |
Protein accession | XP_001418349 |
Protein GI | 145347799 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [Z] Cytoskeleton |
COG ID | [COG5234] Beta-tubulin folding cofactor D |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGCCA TCGTTCGCGC GTTGACTCGG CGAAGCGCGC GTGGACGCGA CGAAGAAGAC GACGCCGCGG ACGAAGACGA CGTGCGGGCG TTCGTGGGAG TGATTGAAAA GTATAGAGAA CAGCCGACGG TGCTGGATCC GATGCTGGGG GGCGTGATCG AGCCTTTGAT GGACGCGGTG GCGCGAGCGT CGACGGAGGC GAACGAAAAC GAAAACGAGA ACGCGAACGC GAAGGCGAAC GCGAACGCGT GCTGCAGGGC GCTGGATGCG CTGTCGAGCG TGCGCGGATG GAAGACGTGC GTGCGGTTCT ATCCGAACGC GGCGAAGTAT TTAGAGCCGG CGGTGCGTCT GCTGCGGGAA GCGCGAGTGC GTGGAGACAA TACGTGGGAA ACGCAACGCG TGCTCACGAG TTGGTTGTCG ATTTTGGCGT TGGCGCCGTT TGACTTGGTG TCGATCGATA GCGCGATCGA TCCGCATTCG TCGCGTTCGA AAATTCCGAG CGTCGTGAGT GATTTGATGC GAGAGTGCAA GCACTTTTTA GGCGATCCGT CCGCGGTGCG CGACGTGGCG GCGCAAACGC TCGCCAAGCT TCTCACGCGA CCGGACATGA GCGAGGCGTT GCGCGAGTTC ATGACCTGGT CGAGCGCGAC GCTTCGTGGA GACGTCAATG ATGAAAAGGA AAGAGAAATG ATTTTCTTAG TACCGGGTGT ACTGCGGGCG CTCGCGGCGA TATACAAGAT TGGATCTCGA GAGCAGTTGC TGCCGTACGC GGAGGGAAAC TGGGACGACG CGCAGTATTG CGCGACGCGG CTAAGTTTGG CGAAACGCTC GACCATGGTG AGACAACTGA GTATCAAGCT CGCCAGTCGC GTGGGTTTGG TTTTTATGAA ACCTCGAGTG GTGTCGTGGC GATATGATCG TGGTGCGCGG TGTTTGCAAG ATAACTTGAG CGGGGCGATG CAAAAGCCGC CGACGAAGCA ACTCACCACG GCGGCGGACG AAGATGATAA ATGCGACGTG CACATGGCTG TTGATGATAT TGTTGAAATA TGTCTCGTCG GTTTGCGAGA TGCGGAGACT ATCGTACGCT GGACATCGGC GAAAGCGTTG GGGAGAATTA GCTCTCGACT CCCGCGTGAT TTTGGTGACG AAGTCGTTGG AGCGGTTTTG GCGTGCTTAT CGGTTATCGA AAGCGATTCA ACTTGGCACG GCGCATGTTT AGCTCTGGCT GAGCTCGCTC GACGTGGATT GTTGTTGCCG AATAGATTGG TGGAGGCCGT ACCGCGATGC ATGGACGCTC TCATCTACGA CGTTCGACGA GGAGCGCACT CAATCGGTGC GCACGTGCGA GATGCGGCAG CATATGTATG TTGGGCGTTT GCGCGCGCAT ATGAACCGGG CGTTTTCGAA CCTTTTGTCG ACCAACTTGC ACCGAGGCTT CTCATGATAT CGTGTTTCGA TCGTGAAGTT AATTGTCGCC GAGCCGCATC CGCTGCCTTT CAAGAAGCCG TCGGACGGCT CGGCAAGTTT CCTCACGGCA TCGACATTGT CACCGTGGCG GATTACTTTT CGCTTGGATC GCGAACCCGA GCTGCGTTGA CGGTGGCACC ATTCATCTGT CAGTTTGAAG AATATAGGCG TTCGTTACTC GAGCACGTGT TGGACACGAA GCTCACGCAC TGGGAACTTG CCACGCGGCA ACTCGCGACG AAAACAATCA GAGCTCTGGG TAATTTAGAC CCGCAGTGGA TCGGTGACGT AGGCATAAAA ACAGTTCTGT CGCGCGCGAC GAGTTCTGAC TTGTCGACGC GCCACGGCGC TGTGCTTTCG ATCGGTGAGA TGTTACTCGT GACACAGCGT GCAAAAACAA AACTCGAAGA CGACTGTTTC GAACGAGTAG CCGATTTAGT TCAAAGTATG GAAAGGGAGA AGATGTACAA AGGAAAAGGC GGCGAAATAA TGCGTGGCGC GACGTGTAGG CTCATTGAGT GCGTGTTTTT GTGCTGCGAC GAGAATCACA AGATTGACTC GAAGGCGACA GATGCCTTTG TCTACTTTGC GGAAGAGAGT CTGCAGTGCT GCAACGGAGA CGTACAGGCT GCCGCCTCAG ACGCCATCGC CGCATTTACA GAGACGAATT ACGCTTCGCG TGGATCTCAC CGTGCTCATT GCTTGCTATT GAGGCACGCT GAAATCGTCG TCAACGATCT CGTAGGTGTC GTGCGGCGTG GATCGGCGCT CGTATTAGGC GGATTTCCTG TGACAAGTCT TCTCGCCGCG AAAAATAGTG AAGATAAAAG TGCAACTCTG CGCGCGGTCA TCACGGCGCT GTCGGTAGCC ACGAAACCGG AAGAAGATGT AGAAATGAGA GACGCCGAGA CGCGGGTGAA CGCAACGATC AGTCTCTCGG AGCTGAGTGT GAAATTAATG TGTGCAGAGT GCCATGATAT CGATGACGAC GACATCGCCT TTGTATCGGA CACTGCGATC GCCACTTTAC TCGGATGTTT GTGCGACTAC AGCGTTGACA ACCGTGGCGA CGTCGGCTCG TGGGTGCGAG AATCGGCGAT GAAATGCTTC CCTGTGCTTG TCGCCGCTTT GCAAATGCGC AATGCTTTAG CTGCAGATCA GTCGCAAAAC ATCATGACAG CACTTCTCAA GCAAGCATTC GAAAAAATCG ACCGCATTCG ATGTCAAGCG CTTGTGACGC TCGTGCAGCT CGTGCGTGGT GGTGACGCGA TTCGAGTTAG AATGCGAGTG CAGGCCAAAC TAACAGTACA CGCGCTCTCT GGTGTGCCAG ACTACGATGT TTTACAATGT TGCTTGCCAG CGACTGTCGA AACAGCGCCA GATGCTTCCC ATGTCTCGAC AATTTTCGCG ACGTTAACTC CCGTTCTCGG CGCAGAGGCG TACGTCAACG CCGCGTTGAG CGGATGGTTT CTGAGTTGCG GAAGTGTAGG CGACAGTCTG GTGCGCTTTT CGACCGATGC GTTGTTACGA GCGATTAGGC GGTTTGAGGG CCTTCCAGAT ATTGTTGTGG CATCGATTAT ACAAGATCTG TGTCAAAACA AGCACGTTGA TCGCGTTACG GTACCGGCTT TACGAGTGTG CGATGCCCTA ATTTCGCACG GCGCGTTGGA TCAGGCGCAC ACGCACGCGA TTCAACTCAT CGAAGCTATT CGTTGTGAGT GTTTCTCGAG CAGAGATATT TCAAAGCTCG TCACTGGGAG CGCATGCCTG GCTCACTTCG TCGGTGCTGC TGATAGCGTC GTTCACGAAT CAGCATCGAT GGGGCTACTC GCACTCATGG CGAACCGCTT CCCTCGCGTG CGTTGCGCAG CGGCGGAGCA TTTGTACATT GCCCTCCTTG CTGTCGCCGA ACCGAGTCGG GGAACTGAAA ACGCAGCTGA AACCCTGTCG TTGAATTCGT GGGACGCACC ACCGAGTGTC ATGAAAGAAA CACGCAAAAT AATTTATAGT TTACTAGGAC TAGACCTCCC AGCTTTCATG CTGAAAGCTT CGGGAAAACT TCGGGACCGA CGAGCGGATG AAAGAGAGAA CTCGACGTAT GCGTCGCTCG TTGGAGATAC CGGTTATTAG
|
Protein sequence | MDAIVRALTR RSARGRDEED DAADEDDVRA FVGVIEKYRE QPTVLDPMLG GVIEPLMDAV ARASTEANEN ENENANAKAN ANACCRALDA LSSVRGWKTC VRFYPNAAKY LEPAVRLLRE ARVRGDNTWE TQRVLTSWLS ILALAPFDLV SIDSAIDPHS SRSKIPSVVS DLMRECKHFL GDPSAVRDVA AQTLAKLLTR PDMSEALREF MTWSSATLRG DVNDEKEREM IFLVPGVLRA LAAIYKIGSR EQLLPYAEGN WDDAQYCATR LSLAKRSTMV RQLSIKLASR VGLVFMKPRV VSWRYDRGAR CLQDNLSGAM QKPPTKQLTT AADEDDKCDV HMAVDDIVEI CLVGLRDAET IVRWTSAKAL GRISSRLPRD FGDEVVGAVL ACLSVIESDS TWHGACLALA ELARRGLLLP NRLVEAVPRC MDALIYDVRR GAHSIGAHVR DAAAYVCWAF ARAYEPGVFE PFVDQLAPRL LMISCFDREV NCRRAASAAF QEAVGRLGKF PHGIDIVTVA DYFSLGSRTR AALTVAPFIC QFEEYRRSLL EHVLDTKLTH WELATRQLAT KTIRALGNLD PQWIGDVGIK TVLSRATSSD LSTRHGAVLS IGEMLLVTQR AKTKLEDDCF ERVADLVQSM EREKMYKGKG GEIMRGATCR LIECVFLCCD ENHKIDSKAT DAFVYFAEES LQCCNGDVQA AASDAIAAFT ETNYASRGSH RAHCLLLRHA EIVVNDLVGV VRRGSALVLG GFPVTSLLAA KNSEDKSATL RAVITALSVA TKPEEDVEMR DAETRVNATI SLSELSVKLM CAECHDIDDD DIAFVSDTAI ATLLGCLCDY SVDNRGDVGS WVRESAMKCF PVLVAALQMR NALAADQSQN IMTALLKQAF EKIDRIRCQA LVTLVQLVRG GDAIRVRMRV QAKLTVHALS GVPDYDVLQC CLPATVETAP DASHVSTIFA TLTPVLGAEA YVNAALSGWF LSCGSVGDSL VRFSTDALLR AIRRFEGLPD IVVASIIQDL CQNKHVDRVT VPALRVCDAL ISHGALDQAH THAIQLIEAI RCECFSSRDI SKLVTGSACL AHFVGAADSV VHESASMGLL ALMANRFPRV RCAAAEHLYI ALLAVAEPSR GTENAAETLS LNSWDAPPSV MKETRKIIYS LLGLDLPAFM LKASGKLRDR RADERENSTY ASLVGDTGY
|
| |