Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_40785 |
Symbol | |
ID | 5002206 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | - |
Start bp | 536665 |
End bp | 538158 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | |
GC content | 59% |
IMG OID | 640417627 |
Product | predicted protein |
Protein accession | XP_001418517 |
Protein GI | 145348146 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.725056 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.204992 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACGAG GGCGAACGAC GACGAACGCG AGGACTAACC CGGAGCGCGG CGCGCGCGAC GCGAGCAGGG CGCAGGCGAC GGCGAGCAAG TCGAAGGCGA GGAGCGCGGA CGTCAGCGCG ACGGGGGCGA ATTTGAAGGA GTGGAACCCG AAGTCTTGGC GACAGCGCGA GGCGCTGCAA CAACCGAACT ATGAAAATCA GGCTGAGCTG GAGGAGGCGC TGAAGGTGAT CGCCAACCGG CCGCCGCTCG TGTTCGCGGG AGAAGCGCGC GACTTGCAGG AAAAGCTCGC GAACGCGGCG GCTGGTAACG CGTTCGTCTT GTTCGGTGGT GATTGTGCGG AAAGTTTTAG AGATTTCACG TCGGATAACG TGCGGGATAC GTACCGGGTT TTGCTGCAAA TGTCCGTCGT GTTGATGTAC GGCTCGGGCG TGCCCGTGGT GAAGCTGGGA CGAATGGCTG GCCAGTTCGC CAAACCCCGT TCCGAAGACT TGGAAACGAT CGATGGGCTC TCGTTGCCGT CGTACAGAGG CGATAACATC AACAGCTGTG AGTTCACCCC GGAAGCGCGT CGGCCGGACC CTTCGCGTTT GGTCAAGGCG TACGATCAGT CGTGCGCCAC GCTCAACTTG CTGCGAGCTT TCAGTAACGG AGGGTACGCC GCGATGACGC GCGTGAGCGA TTGGAATTTG GATTTTATGG AAAACACCGA ACGAGGCAGT CAATATGAGG ATCTTGCGCA GCGCGTCGAC GCCGCGATTG ACTTTATGGC GGCGTGCGGC ATCGACGAAA CTCATCCGTC GATGCAAGAG ACATCCTTCT TCACGGCGCA CGAAGCTCTT CACCTGGGTT ATGAAGAATC GCTCACGCGT TTGGACTCCA CGACCGAGGA GCATTACGGC TGTTCCGCGC ATTTCTTGTG GTGTGGTGAG CGCACGCGCC AACCCGAAGG CGCACACATG GAATACTTCC GCGGTATTTC CAACCCGATC GGCATCAAGA TTTCCGACAA GAGCGACGGC GAGGGTGTGG TGAGCTTGGT GAAGAAGTTG AACCCGGACA ACGTCCCGGG TCGCATCACC CTCATCTCTC GCATGGGTGC TGCCAAGTTG CGCGAGCATC TTCCGCGTCT CATCACCGCC ATTGAAGACG CCGGGCTCAA CGTGTTGTGG GTTACGGATC CCATGCACGG GAACACCATC AAGACTGATA ACGGTTTCAA GACGCGTCCG TTCGAGGCGG TGCGCGACGA AATTATGGCA TTCTTTGAAG TGCACGAAAA GATGGGTACT TATCCTGGTG GGGTTCACTT AGAGATGACG GGGCAAAACG TCACCGAGTG CACGGGCGGC ATCATGGACG TTTCGGTGTC TGATTTGGAA AAGCGCTACC TCACCCATTG TGATCCGCGC TTGAACGCGA GCCAAGCCAT CGAGCTTGCG TTTTTGATGG CTAGCGAGTT GAACGATATG CGTCGTCGCC GCGCGGCGCA ATAA
|
Protein sequence | MGRGRTTTNA RTNPERGARD ASRAQATASK SKARSADVSA TGANLKEWNP KSWRQREALQ QPNYENQAEL EEALKVIANR PPLVFAGEAR DLQEKLANAA AGNAFVLFGG DCAESFRDFT SDNVRDTYRV LLQMSVVLMY GSGVPVVKLG RMAGQFAKPR SEDLETIDGL SLPSYRGDNI NSCEFTPEAR RPDPSRLVKA YDQSCATLNL LRAFSNGGYA AMTRVSDWNL DFMENTERGS QYEDLAQRVD AAIDFMAACG IDETHPSMQE TSFFTAHEAL HLGYEESLTR LDSTTEEHYG CSAHFLWCGE RTRQPEGAHM EYFRGISNPI GIKISDKSDG EGVVSLVKKL NPDNVPGRIT LISRMGAAKL REHLPRLITA IEDAGLNVLW VTDPMHGNTI KTDNGFKTRP FEAVRDEIMA FFEVHEKMGT YPGGVHLEMT GQNVTECTGG IMDVSVSDLE KRYLTHCDPR LNASQAIELA FLMASELNDM RRRRAAQ
|
| |