Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_39392 |
Symbol | |
ID | 5004887 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | + |
Start bp | 423448 |
End bp | 425028 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | |
GC content | 62% |
IMG OID | 640420308 |
Product | predicted protein |
Protein accession | XP_001420686 |
Protein GI | 145352721 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0169] Shikimate 5-dehydrogenase [COG0710] 3-dehydroquinate dehydratase |
TIGRFAM ID | [TIGR00507] shikimate 5-dehydrogenase [TIGR01093] 3-dehydroquinate dehydratase, type I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.541352 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGTGC GCGCGCGCGC GGCGTGCAAA CTGACCACGT CCGTCATCGC GCCGAACCTG GAGCTCGCGC TGCGGGACGT GGACGACGCC GTGAGGAAGG GCGCGGACGT CGTGGAGCTC AGGGTGGATT TCTTGCGCGA CGACGCGGCG CGCGGGACGC TGGGCGACGC GATCGAAGCG CTGATTAAAG CGTCGCCGGT GCCGGTGATC GTGACGAATC GACCGACTTG GGAGGGAGGG CAAGATGATG GGGACGAGAG CGCGCGGTTG GAGGCGCTTT GGAGGGCGCA CGAGTGCGGG GCGGCGTACG TGGATTGCGA GGCGCTCGCG GCGGAGCGAT TCTTCGCGGC GAAACCGGCG ACGGCGACGC TGAGGAATGG GAAGACTAAG ATCATTTTGA GCTCGCATAA TTACGATGAG ACGCCGAACG ATGAGATTTT GGCGGAGATT CACGCGAAGT GCGTGCGCTT GGGGGCGGAT ATCGTCAAGA TGGCGTCGGT GTGTAACGCG GTGGAGGACG TGGCGCGATT GGAGAAGCTC TTGCGCACGA AGGGAAGGGA GATCGAGACG ATCGTTTTGG GCATGAGCGA GCACGGACAA GTGTCTCGAT TGCTCGCGGC GAAGTTTGGA AGCTTTTTGA CGTTCGGGGC GATTCGAAGA GGGGAAGAGA GCGCACCGGG GCAGCCGTTG CTCGAAGAGC TGCGAGATTT GTATCGCGTG CCGACGCAGA CGGCGGCGAC AAAGGTGATG GGCGTGATCG GGAACCCGAT CGGACACAGT AAGTCACCGG CGTTACACAA TCCTTGTCTC GCCGCCGCGG GCGTGGATGC GTGCTACGTC CCGTTACTCG TCAAGGATAT CAAAACCTTT CTCGCGTCGC CGCTCTTCGG GTCGAAGGAC TTTGTGGGGT TTAGCGTGAC GATTCCGCAT AAGGAAGACG CCTTGGAGTG CTGCGCCGAG GTCGACCCCG TGGCGAAGCA AATCGGCGCG GTGAACACGT TGGTTCGTCA ACCGGATGGA TCGTTAAAGG GTTACAACAC CGATTATGTC GCCGCGATCG AGGCTATCGA AAACGCGATG GAGAAGAAAA CGGGCGTCGC CGCCGCGAAG TCCTTGGCTG GGAAAACAGT CGTCGTCATC GGCGCCGGCG GTGCGGGCAA GGCTTTGGCG TTCGGCGCTA AGTTTAAGGG CGCCAACGTC GTCATCGCCA ATCGCAGCGT CGAGCGCGCA CAGGCGCTCG CTGACGCGTG CGGCGGCGTC GCGGTGTCGC TCGAAGATTT AGCGAGCGGT AGCGTCGTCG GCGACGTCTT GGCCAATAGC ACCTCGGTCG GGATGCAACC GAACGTTGAA GACACGCCGA CGCCAGCGTC TGTACTCGGA GGGTTTTCCG TCGTCTTCGA CGCGGTGTAC ACCCCGCTCG AGACGCGGCT CTTGCGCGAA GCCAAGGCGA GTGGGTGCGA AATCGCGAGC GGGCTGGACA TGTTCGTCGG GCAAGCGGCG AGGCAGTTCG AGCTCTTCAC CGGGAAAGAG GCCGAGGTTG AGCTCATGCG CGACGCCGTG TTGTCGAGCA TAAAAAGGTA A
|
Protein sequence | MRVRARAACK LTTSVIAPNL ELALRDVDDA VRKGADVVEL RVDFLRDDAA RGTLGDAIEA LIKASPVPVI VTNRPTWEGG QDDGDESARL EALWRAHECG AAYVDCEALA AERFFAAKPA TATLRNGKTK IILSSHNYDE TPNDEILAEI HAKCVRLGAD IVKMASVCNA VEDVARLEKL LRTKGREIET IVLGMSEHGQ VSRLLAAKFG SFLTFGAIRR GEESAPGQPL LEELRDLYRV PTQTAATKVM GVIGNPIGHS KSPALHNPCL AAAGVDACYV PLLVKDIKTF LASPLFGSKD FVGFSVTIPH KEDALECCAE VDPVAKQIGA VNTLVRQPDG SLKGYNTDYV AAIEAIENAM EKKTGVAAAK SLAGKTVVVI GAGGAGKALA FGAKFKGANV VIANRSVERA QALADACGGV AVSLEDLASG SVVGDVLANS TSVGMQPNVE DTPTPASVLG GFSVVFDAVY TPLETRLLRE AKASGCEIAS GLDMFVGQAA RQFELFTGKE AEVELMRDAV LSSIKR
|
| |