Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_38791 |
Symbol | |
ID | 5002134 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | + |
Start bp | 288023 |
End bp | 289129 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | |
GC content | 60% |
IMG OID | 640417555 |
Product | predicted protein |
Protein accession | XP_001417727 |
Protein GI | 145346505 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0409062 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGGCG TCGACGTCGT CAACGTCGAT CTGGGCGATC GATCGTACCC GATCTACGTC GGGACGGGGC TGCTGGACGA CGGGGACGCG CTGCGCGCGC ACGTCGCGGG GTCGACGGCG CTCGTGGTGA CGAACGAAAC CATCGCGGGG CTTGGATACC TCGATCGCAC GGTGAAAGCG CTCACGGCGA AGGATTCGAA ACTGCGCGTG GAGACGGTGG TGCTGCCGGA CGGAGAGGAG CATAAGAATT TGGAGGTGCT GAACGCGGTG TACACGAGGG CGCTGGAGAC GCGACTCGAC CGCGGGACGA CGTTCGTGGC GCTGGGGGGG GGCGTGATCG GTGATATGAC GGGATACGCC GCGGCGTCGT ATCAGCGCGG GGTGAAGTTC GTGCAAATAC CGACGACGGT GATGGCGATG GTGGATAGCT CGGTGGGGGG GAAGACCGGG GTGAACCACG CGCTCGGGAA GAATATGATC GGGGCGTTTT ATCAGCCAGA GTGCGTTTTG ATCGATATCG ATTCGTTGAA GACGCTTCCC GATCGAGAGT TCGCGAGCGG GATCGCAGAG GTGGTGAAAT ACGGTCTCAT TCGCGATGGG CCGTTTTTCG AATGGCTCGA GGCGAACGTC GATAAGCTTC TCGCGCGCGA TACGCAAGCC ATCGCGTACG CCGTCGAGCG ATCGTGCGTG AACAAGGCGG AAGTCGTCGC CGCGGATGAG AGGGAGGGCG GCGTTCGAGC GACGCTGAAT CTTGGGCACA CGTTCGGTCA CGCGATAGAA ACCGGTCTCG GCTACGGCGA GTGGTTGCAC GGCGAAGCGG TGAGCGCCGG TATGTGTATG GCGGCGGATA TGTCTCTTCG ACTCGGTTGG ATCGACGCCT CGCTCAAGGA GCGCACGATC GCCTTATTGA ACAAGTGCAA AACCCCGATC GACGTCCCTG AAAAGATGAC GGTTCAAATG TTCATGGACT TGATGGCGGT GGATAAGAAG GCTGCGAATG GGAAATTGCG CTTGATTTTG TTAAAGGGCG AGCTCGGCGA GTGCGTCTTC ACTGGGGACT TCGACCAAAG CAAGCTCCAG GAAACCTTAG ACGCGTACGT CAAGTAA
|
Protein sequence | MDGVDVVNVD LGDRSYPIYV GTGLLDDGDA LRAHVAGSTA LVVTNETIAG LGYLDRTVKA LTAKDSKLRV ETVVLPDGEE HKNLEVLNAV YTRALETRLD RGTTFVALGG GVIGDMTGYA AASYQRGVKF VQIPTTVMAM VDSSVGGKTG VNHALGKNMI GAFYQPECVL IDIDSLKTLP DREFASGIAE VVKYGLIRDG PFFEWLEANV DKLLARDTQA IAYAVERSCV NKAEVVAADE REGGVRATLN LGHTFGHAIE TGLGYGEWLH GEAVSAGMCM AADMSLRLGW IDASLKERTI ALLNKCKTPI DVPEKMTVQM FMDLMAVDKK AANGKLRLIL LKGELGECVF TGDFDQSKLQ ETLDAYVK
|
| |