Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_39912 |
Symbol | |
ID | 4999547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | + |
Start bp | 133130 |
End bp | 134632 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | |
GC content | 55% |
IMG OID | 640414968 |
Product | predicted protein |
Protein accession | XP_001415398 |
Protein GI | 145340576 |
COG category | [R] General function prediction only |
COG ID | [COG1078] HD superfamily phosphohydrolases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.694161 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACTGC GCGCGGAGGA CGACGATGAT GATCCGTTCA TCGCGCAGTT GAGTCAGATC GCGCACGAAC CGCGGGTGCA CGTGGGGGAG GAGAACGCGG CGGCGACGGC GTATTGCGCG GGGAGACGGT CGCGAGGAAA GACGTTTAAC GATCCCGTGC ACGGACACAT GTACTTTAAT CCGAAGCTGT GCGATGTGAT CGATACGCCG CAAATGCAAC GGTTGCGAGA GTTGAAGCAG TTGGGGACGT CGTATTACGT GTTTCCGGGA GCGGGGCATA ACAGGTTCGA GCACTCGTTG GGAACGTGTC ACCTGGCGAA CACGGTGTTC GAGTCCATCA AGCGCAGCGC GCCCAGGCAC GGGTTAGGGC TGACTGTGGA GGATAAGTTA TGCGTGCAAC TCGCGGGGCT GTGTCACGAC ATGGGCCACG GGCCGTTTTC GCACGTGTTC GATAACGAGT TTTTGCCGTT GAGACACGGT TGGGATCCGA AAGTCGTGGC GCCGTGGAAT CACGAGCGCA TGGGGGTGGA CATGTTTTCT TGGTGCTTAG ACGATAACCA CATTGATTTA GAGCCTCAAG TCGTGCGGCG CGTGTGCGAT TTCATCACGA GCAACGAGGA AGGAGCGAAG GAGAAGCGAT TTTTGTTTGA CATCATCGCC AACAAACAAA ACGGCATCGA CGTGGACAAG TTCGAGTACC TGTTGCGAGA TTCTTACCAG GCCGGCGTGC GCATGAGCGT GGATACGATG CGATTGACGT CGCACATGAA GGTGATCGAT GACAGGATTT GCTTCAAGTC GAGTGAGGCG AACAACGTGT ACGCGTTGTT CCACTCTCGA GCGTCCATGC ACCAGAGCGT GTACACGCAC AAAAAGGCCA AGGCGGTGGA ATATATGGTG GTCGATGCAT TAGTCGAAGC CGACATCGCG TGGAACGGGC GAATTAGCAA CTCCATTTGG AGCGTTGAGG ATTTCATCGC GATGGACGAC ACGCTGCTCA AACAGATTGA ATTTTGTGAC GATCCCGCGC TCGCCAAGGC ACGAGACATC GTGCGACGCA TCCGTCGTCG CGAGTTGTAC CGATTCGTGA ATGAATACAC CGTGCCCGAG GATCAAGTGG TGGATTTCAA GCCGGTCGAG GCGAAAGACA TCACGTCATG CCAAGGAACG AACAACATCC CGGGCGGTTT GAAACCAGAC GACATCATTG TGCAGTGCCT GAAGATTGAT TACGGCCAAA AAGGACACAA AGATCCCGTG GAGAACGTCA GGTTTTTCCA CTACTGGGAC GACGAAACCT CGTGCAGCAT CGCCAAAGAG CAAATCAGTT CGCTCTTGCC GCGAAATTTC GTCCATCGCG TCGTTCGCGT CTTCAGTCGC CGCCGCGAAC CAGAGTACAT CGAAGCCACC GCGCAAGCCT TCTCGAATTT CCAGCGCCGT CAGCTCGGCA AAGAGGCGCA AATCACCCCG GTGAAGCGCC AGAGATTTTC GAACGATAGC TAA
|
Protein sequence | MELRAEDDDD DPFIAQLSQI AHEPRVHVGE ENAAATAYCA GRRSRGKTFN DPVHGHMYFN PKLCDVIDTP QMQRLRELKQ LGTSYYVFPG AGHNRFEHSL GTCHLANTVF ESIKRSAPRH GLGLTVEDKL CVQLAGLCHD MGHGPFSHVF DNEFLPLRHG WDPKVVAPWN HERMGVDMFS WCLDDNHIDL EPQVVRRVCD FITSNEEGAK EKRFLFDIIA NKQNGIDVDK FEYLLRDSYQ AGVRMSVDTM RLTSHMKVID DRICFKSSEA NNVYALFHSR ASMHQSVYTH KKAKAVEYMV VDALVEADIA WNGRISNSIW SVEDFIAMDD TLLKQIEFCD DPALAKARDI VRRIRRRELY RFVNEYTVPE DQVVDFKPVE AKDITSCQGT NNIPGGLKPD DIIVQCLKID YGQKGHKDPV ENVRFFHYWD DETSCSIAKE QISSLLPRNF VHRVVRVFSR RREPEYIEAT AQAFSNFQRR QLGKEAQITP VKRQRFSNDS
|
| |