Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_44172 |
Symbol | |
ID | 5004340 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | - |
Start bp | 197412 |
End bp | 200042 |
Gene Length | 2631 bp |
Protein Length | 876 aa |
Translation table | |
GC content | 55% |
IMG OID | 640419761 |
Product | predicted protein |
Protein accession | XP_001420443 |
Protein GI | 145352201 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0058] Glucan phosphorylase |
TIGRFAM ID | [TIGR02093] glycogen/starch/alpha-glucan phosphorylases |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.079411 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAGCG ATCGCGCGGA CGTGACCGAG TCATCGTTCT TGGAGAGCGC GACGACGATC CTCGATCGCC CGCGTCAGTA CGTCGACGAT GCAGAGATGG GCGTCGATCA CTTGGCGCAC AAGGAGTTGT TGTGGAAGCT ATCGAGCACG TACTTGCCAA ACGACGTGGC GAGTTTGCAG CGATCGTTGG TGCGGCACGT GGAGTATACG CTCGCGCGAA GGAGGTACAA GCTCGATACG AGCTCGTTTT ATCAAGCCAC CGCGCACTCG GTGCGAGATC GATTGATCGA GCGATGGACG GACACGCAGC AGTATTCGGC CAAGAAAGGG GCGAAGAAAG TGTACTATCT GTCGCTCGAA TTTTTAATCG GCCGCTCGCT CGGCAATGCC GTCTCAAATC TCGGGTTACG AGGCGCCTAC GCCGAGGCTT TGCGGCAGAT TGGATACGAT TTGGAAGATA TCATGTCAGA GGAAAAAGAG CCAGCGCTCG GGAACGGGGG TCTGGGGCGC TTGGCGTCTT GTTTCTTGGA CACGCTCGCG ACGCAAAATT ATCCCGCGTG GGGGTACGGC ATTCGCTACA AGTACGGAAT GTTTGAGCAA AGAATCGTGA ATGGTAAACA AGTGGAGTTT CCAGATTATT GGCTTACCGA TGGGAACCCG TGGGAAGTTG AACGATTAGA CGTGCAGTAC CCGGTTCGTT TATTCGGCCA CGTTCGAGAG TTTCAAGATC CGGACGGGAA CACGTTGTAC GCGTGGGAGG GCGGCGAAGT CGTCATGGCG CAGGCGTACG ATACCCCGAT TCCAGGCTAC GGTACGTACA ACACCAACAA TATGCGACTG TGGAGTAGCA AGCCATCGCA CGAATTCAAT CTCGCCTCGT TTAACGCCGG TAATTACTAC GGCGCGGTCG AGGCGAAGGA ACGTTGCGAG AGTATCACGT CAGTGCTGTA CCCGAACGAC GCCACGGAAG AAGGTAAGCG TTTGCGATTG AAGCAACAAT ACTTCTTCGT CAGCGCGACT TTACAAGACA TCTATAGACG TTTCAAGAAG AACGTCGGTC GTGGGTCTAC GACGATGAAG AATATGCCGG ACAAGGTGGC GATACAGCTC AACGACACGC ACCCGGCGAT TGCTATTCCC GAACTCATGC GTTTGCTCCT CGACATCGAG CGTCTACCTT GGGACGAGGC GTGGGAAATA ACGAGAAAAG TCTTCGCGTA CACCAACCAC ACCATTCTCC CGGAGGCACT GGAGAAGTGG CCGGTGCCGA TGATCACCGA GCTCTTGCCG CGACACATGC AAATCATTTA CGAAATCAAC CACCGGTTTT TGCTTGAGGT TCAAAAGATG TGGCCAAACG ACACCGCGCG CATGTCGTCG ATGAGTATCA TCGAAGAGTC ATCACCCAAG ATGGTACGCA TGTCCAACTT AGCCGTCATC GGGTCGCACA CCGTAAACGG GGTGGCGATG ATCCACACCA AGCTTTTGAA GTCTTTGCTA TTCCCCGACT TTTTGTTGAT GTGGCCAGAA AAGTTCATCA ACGTGACGAA TGGTGTCACG CCGCGACGAT GGCTGTTGCA AGCGAATCCA GCGCTCGCGA GCATTTACAC GGGCATGGTC GGCCCTGGGT GGGTAAACGA TCTCAAGCGA CTCGAGCCGA TCAAGACGAT GGCGCAAGAT CCGCAATTTC GACAGCGTTG GCGCGCGGCC AAGCAAACGA ACAAGCAAGC GCTTGCGGAA TGGCTGTACC GTTCGATGAA TATCCGCGTC GATCCCAACG CCTTGTTCGA CATGCAAATC AAACGCATAC ACGAATACAA GCGCCAGTTG TTGAATGTCC TTGGAATCGT GCACCGCTAC GCCGAAATCA CGCAGGCTAC GCCCGAACAG CGCAAAACAA TGGTGCCGCG CGTGCACATC ATCGCCGGCA AAGCTGCGCC CGGATACGTC ATGGCGAAAA ATCTTGTCAT GCTCGTATGC GCCGTGTCTG AAGTCGTGAA CTCCGACGCG GCGTGTCGTG ACTTACTCAA GGTGGTGTTC GTTCCCAACT TCAACGTGTC TCTAGCGGAG ATTCTCATTC CCGCGAGTGA TATCTCGCAA CACATCTCCA CTGCGGGTAT GGAGGCGAGC GGCACCGGGA ACATGAAGTT CGTCATGAAC GGCGGATTGA TCGTGGGTAC AATGGACGGG GCGAACATCG AGATAGAACA AGCGATTGGC GAGCATAACA TGTTCACGTT TGGCGCGAAG GCGGATCAAG TCGCGGCGAT TCGACGCAAG ATGGCGCACG ACCCACCAAA AATTGATCCT CGTCTTCACC GCGCCATGGG AATGATTCGC GCCGGCATCT TCGGCAAGCC GGACGACGGA GCGTACAACC AGCTCTTGGA CGCCATCGAT CCGAGGAAAG ACGTCTACCT CACCGCGCAT GATTTTCCGT CGTATCTAGG GGCGATTGCA GAAGCCGATG CGGCGTATCA GTACGAAGAG AAGTGGACCG CAAAGTGCAT CGAAGCTGCG TGTTCGATGT GGATGTTTTC ATCCGATCGC ACGATTCGGG AGTACGCGGC GAAGATTTGG AACGTCGAAC CTTTGCCGTT TCGACCGCCC AGACACCACA GTGAACGTTG A
|
Protein sequence | MESDRADVTE SSFLESATTI LDRPRQYVDD AEMGVDHLAH KELLWKLSST YLPNDVASLQ RSLVRHVEYT LARRRYKLDT SSFYQATAHS VRDRLIERWT DTQQYSAKKG AKKVYYLSLE FLIGRSLGNA VSNLGLRGAY AEALRQIGYD LEDIMSEEKE PALGNGGLGR LASCFLDTLA TQNYPAWGYG IRYKYGMFEQ RIVNGKQVEF PDYWLTDGNP WEVERLDVQY PVRLFGHVRE FQDPDGNTLY AWEGGEVVMA QAYDTPIPGY GTYNTNNMRL WSSKPSHEFN LASFNAGNYY GAVEAKERCE SITSVLYPND ATEEGKRLRL KQQYFFVSAT LQDIYRRFKK NVGRGSTTMK NMPDKVAIQL NDTHPAIAIP ELMRLLLDIE RLPWDEAWEI TRKVFAYTNH TILPEALEKW PVPMITELLP RHMQIIYEIN HRFLLEVQKM WPNDTARMSS MSIIEESSPK MVRMSNLAVI GSHTVNGVAM IHTKLLKSLL FPDFLLMWPE KFINVTNGVT PRRWLLQANP ALASIYTGMV GPGWVNDLKR LEPIKTMAQD PQFRQRWRAA KQTNKQALAE WLYRSMNIRV DPNALFDMQI KRIHEYKRQL LNVLGIVHRY AEITQATPEQ RKTMVPRVHI IAGKAAPGYV MAKNLVMLVC AVSEVVNSDA ACRDLLKVVF VPNFNVSLAE ILIPASDISQ HISTAGMEAS GTGNMKFVMN GGLIVGTMDG ANIEIEQAIG EHNMFTFGAK ADQVAAIRRK MAHDPPKIDP RLHRAMGMIR AGIFGKPDDG AYNQLLDAID PRKDVYLTAH DFPSYLGAIA EADAAYQYEE KWTAKCIEAA CSMWMFSSDR TIREYAAKIW NVEPLPFRPP RHHSER
|
| |