Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_12474 |
Symbol | |
ID | 5001351 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | + |
Start bp | 626708 |
End bp | 630174 |
Gene Length | 3467 bp |
Protein Length | 547 aa |
Translation table | |
GC content | 64% |
IMG OID | 640416772 |
Product | predicted protein |
Protein accession | XP_001417301 |
Protein GI | 145345617 |
COG category | [C] Energy production and conversion |
COG ID | [COG0281] Malic enzyme |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.161702 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGATC GAGGGCCGGT GGCGCGCGCG GCGCCGCGAC CGCCGTGCGG GCCGGGATAC GCGACGCTGC GCGACGCGGG AACGTATCGA GGATTGGCCA CGGTCGATCG CGAGGGACGC GGTCTGCGAG GGTTGTTGCC ACCGAACGTC GTCGAGGACG ACGTGGAGGT GCGCCGGGCG AGGGAAGCGC TCGGGCAGTG CGACGGCGCG TTCGAGTCGT ACAAGCAGTT GGTGGCGCTG CAGACGACGG ACGAGCGGAC TTTTTATAAA TTGTTGCGAT CGGACACCGA GCGGTTGCTG CCGCTGTTGT ACACGCCCAC GGTCGGCGAG GCGTGTCTGA GATTCGGGAC GCTGATACAA CGACCGATGG GGGTGTGGGT AAGCTCGGAC GACGCGGGGA ACGTGCGGGA ATTGATGCGC AATTGGCCGC GCGACGACGT GAAAATCGCC GTGATCACGG ATGGGGAGAG AATTTTAGGG CTCGGAGATC AAGGAGCGAA CGGGATGGGA ATTAGCGCGG GAAAGAGCAT GGTGTACGCC GCATGTGGAG TGCCGCCGAG TGCGCTGTTG CCGATTCAGA TTGACACCGG TACAAACAAC AAGACGTTGC TCGACGATCC GTTGTACATC GGGCTCAAAC GCGAGCGCGA CCGCTCCCAG GCCTACGACG CGCTCTTGGA TGAAGTCGTC GTTTCGATTC GCGAGCGATT CGGTCCGAAC ACGATCATAC ACTGGGAAGA CTTCGCGCCG AGGAACGCGT TCAGGGTATT GAAAAAGTAT TACTCCGCGC CCGATGTCGT GACGTACAAT GACGACATCC AAGGCACCGC AGCGGTTACG GTGAGCGGGA TATTGGCGTC GGTGCGAGCG CTAGAGAGAG GTGGTGACGT CACTCAGCAG CGCGTATTGT TCTTTGGCGC TGGACAGGCG AATATCGGCG CCGCTGAACT CTTCGTCCTC GCGTTGACGA AGCGCGGCGT CGCGGAAGCC GACGCTAGGA AACGGGTGTG GCTGTTTGAT TCAAAAGGCC TCGTCGTGCG CTCGCGCGCG TCGCAGCTGA GTGATGACAA ATTGCCTTTT GCACAAGACG CGAGGGAGGA AACTGACCTC GCATCAGCGA TCGAAAGCAT CAAGCCGACG GTGTTGGTCG GCGCCGCCGC GGTGCCCGGG GCGTTTACTC AAAAAGTCGT CAAGTCGATG AGCAAACTCA ACGACCAACC GATCATTTTC GCGCTCTCCA ACCCGACGTC CAAGGCTGAG TGCACCGCCG AGCAGGCGTA CGCGTGGAGC AACGGCCGAG CGATATTCGC GAGCGGCACC CGATTTCCGC CGGTGCGATT CTCGGCTGAA AAGACGTTCG CCCCCGGATT CGCCAACAAC GCCTTCATCT TCCCACCCAT CGCCCTCGCC ACCATCGTCA GCGGCGCGAA ACAAGTCACT CCAGACATGT TTCTCACCGC CGCCGAAGCC TTGGCCGAAT CCGTCGACGC CAACCTCTTC GCCGTCGGCG CCGTCTACCC GCCCGTCGAT CGCATCGCCT CCTCCGCCGT CGCCGTCGCC GCGCGCGTCG CTCGCGCCGT CGACCCATCC ATCGATCTCG ACACCTGGCG CGCCCGCGTC CAGGACTACA TCGCATCCAA CGATTTATTC GATAGCCTGT AATCGTGCTT TCGCGCTCGC GTTCTCCCCT CGCCCTTCGT CGACGCGCGT CCCGTCCCGC TCGCGGGTTA TCCACTCACT CCGGTCGCCA TCGACGTCGC GTCCAAGCCG CGTCGCGCCG CACTTCACCG CGCGTTCGCG ACCGGCGCGG CGACGATGCG CGCGGGACCG CGCGCGACGC GCGTCGGCGT CGGCGTCGGC GTCGGCGTCG GCGTCGCGCG GCGTCGGCGC GCGCGCGTCG ACGCGACGCC GAGCCCGCGC GGCGACGTCG ACGCGAGCTC GAGCGCGCGC GAGCGCGCGA CGGCGCGCGA CGACGGCGTC GTGCGCACGC GCGTCGTGCG CGAGCGAGGG AGCGAGACGC CGAGAGAGGC GCGCGCGAGC GCGAGCGCGG AGGGGAGGAG GACGACGAAT CGCGAGGCGC CGCGCGGCGC GACGGGCGGC GGCGCGCGAG CGCGACGACG ACGCGCGACG ACGACGAAGG AGACGCGCGA TGGCGTCGTC GACGCGACGA TGACGACAGA AGGCGCGCCA GGGAGCGAAC GCGGCGCTGG AGGGAGGGAT CGACAGACGA AGAAGGAACG CGCGAGGACG CGCTGGGCGG ACGCGCGGTC GCGAGCGAGA GGGACGTCGA TGGGTCAAGA TGAACGTATG AGCGTCGAGG ACGAGCGAAA TCTCGGGCGG GCGATACAGG CGTTGTTGCG AATCGAGCGC GAGGCGGAGA TTTTGCGAGA TGAGGCGGCG GCGACGAGGA TTCAAGCGAC GGAGGCGCTG GGCGCGGTCG AGGCGTTGGC GCTGCGAGAG AAAGCGGCCA AGCAGTCCAA GCGTCTGTTC GAAGACACGC TCGCGTTGCG ATTAGAGTAC GACAGCCCGG CGGCGATGAG AGCTGCGGAG CGCGAGGGGA AGATGGCGCG GAAGAGGCTC GTGACGGCGA ACATGGCCTT TGCAGCGAGC ATCGCGCGAA AGCTGTACGA TAGAATTCCC GCCATGGAAA AGGCGGGGAT GAGTAGGCAA GACATGGTTC ACGAGGGCGC CACCGGTCTC GTTCGCGCGA GTGAGCTATT CGATCCTTCA AAAGGATTCA GATTCACGAC GTACGCTCAC GCATGGGTTC GTCAGGCCAT CGTACGCGGC GTGCACGCCA ACGGCAGAAC GATTCGCGTC CCTTCGCACG CTCACGTTCT CAAGAAGGCG GCTTACGAAA AGCAGCTCGA GATGCAAAGC GAACTCGGTC GCAGCGCGAC GAAAGAAGAA ATTGCCGACT CTTCGTCGCG CTTCACCGTG GAGATGCTGG AGAAGTTCAA GGATTTGAGC ATGGATCCTC TCAGCCTGGA CGCGAACGCC TCGCAGGAAA CGGACGAACT CGATCTCGCA AGCCCCGACG ACGCCGCCGA ACTCGGTCCA TCCGATGCCG TGGCGCGGGA GATTGAGAGG GATTTCCTTC GAAACGAGAT CAGAGCGAGT TTGGACCGAC TACCCGCGCC GCAAGCGTAC GTCCTCCGAC GCCGTTTCGG TCTCATCGGC GACGGTCCGA TGACGCTCGC CGAAATCGGC GCCGCGCTCA ACAAGACCGG TGAAGGTGTC CGCTACCTCG AGAAGAAAGC GCTCGAAAAC CTCAAACGAC AAGATGGTAT CACTGTCGAT GGCGTCACCG TCTCGCTCGC ACAGCATTTG GATACGTCCA ACGACGCGAG GGCGTCGATG GCGACGGCGC CCGCGAAACC AAAACGCAGA CGCGCCAAGA CCAAGACCAC GACGATCGCG CCGTCCGCGC TCGATCTCGT GCGCTCGATC GATTTAGCAT AGCATAG
|
Protein sequence | MADRGPVARA APRPPCGPGY ATLRDAGTYR GLATVDREGR GLRGLLPPNV VEDDVEVRRA REALGQCDGA FESYKQLVAL QTTDERTFYK LLRSDTERLL PLLYTPTVGE ACLRFGTLIQ RPMGVWVSSD DAGNVRELMR NWPRDDVKIA VITDGERILG LGDQGANGMG ISAGKSMVYA ACGVPPSALL PIQIDTGTNN KTLLDDPLYI GLKRERDRSQ AYDALLDEVV VSIRERFGPN TIIHWEDFAP RNAFRVLKKY YSAPDVVTYN DDIQGTAAVT VSGILASVRA LERGGDVTQQ RVLFFGAGQA NIGAAELFVL ALTKRGVAEA DARKRVWLFD SKGLVVRSRA SQLSDDKLPF AQDAREETDL ASAIESIKPT VLVGAAAVPG AFTQKVVKSM SKLNDQPIIF ALSNPTSKAE CTAEQAYAWS NGRAIFASGT RFPPVRFSAE KTFAPGFANN AFIFPPIALA TIVSGAKQVT PDMFLTAAEA LAESVDANLF AVGAVYPPVD RIASSAVAVA ARTRQDQDHD DRAVRARSRA LDRFSIA
|
| |