Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_16515 |
Symbol | |
ID | 5003228 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | + |
Start bp | 471058 |
End bp | 472050 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | |
GC content | 59% |
IMG OID | 640418649 |
Product | predicted protein |
Protein accession | XP_001419176 |
Protein GI | 145349512 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01802] monofunctional chorismate mutase, eukaryotic type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.0913572 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCCGC CCGCCTCGCT CGCGAGCTCG AGCTCGATCG CGCGCGCGGC GAGCCTGCGC CGCGCGACGC GTCGCGGTGG TGACGGCGCG CGGGACGCCG CCGACGCGCG ACGCCGCCGA CGCGCGCTCG GCCGCGACGC GCGCGCGACG GCGGCGAGGG ATCGCGTGAA CGGAGGCGGC GTGAACGAGT CCGTGGATTA CGAAGACCTG AGCGATCGCT TGAAGCTGGA TAACGTGCGG CAGAGCTTGA TTCGGCAGGA GGATTCCATC ATCTTCGCGC TCATCGAACG CGCGCAGTAT AAGCTGAATT CCGCGATATA TGCGAAAAAT GCGGTGCCAG TGCCGTGCTT CGCGCCGAAC GGTGATCGAG CGTCGATGCT GGAGTTCATG CTTCGAGAGG TGGAGCAGAG TCACGGGAAG ATTCGAAGGT ATACGTCGCC GGATGAGCAC GCGTTTTATC CAGAGGCGCA GCCGCCGTTG GTGATTCCGC CGATCGCGTT TAAGGACGTT TTGCACCCGT GCGCGGAGTC GATTAACATT AATGATCGCA TCATGGAAAT GTACGTCGAT AATCTTTTGC CGGAGATGTG CGAGGGCGGG GATGATAACA ATTACGGGAG CGCGAGTCTG TGCGACTTGT CGTGCTTGCA AACGATTTCC AGGCGAATTC ATTACGGCAA GTACGTCGCT GAGTCCAAGT TTTTGGCGCA GCCGGAAGAA TACACGGAGT TGATCAAAGC GCAAGACGCC GACGGTTTGA TGGCACTTCT CACGAACCAA GCCGTGGAGG ACCGGGTCGT TCGACGGGTG GCAAACAAGG CGGCGGTGTA CGGTTCGGAT ATCAGCGAAG ACATTCCCGA TACTTTGGCC TTACCCGTCG GTTCTGAATC GTTAAAATTG GCCCCGGAAA AAGTGGGCGA GTTATACTAC CGCTGGATCA TGCCCATGAC GAAGGATGTT CAAGTGAAGT ACCTGTTACG TCGCCTCGAC TAA
|
Protein sequence | MTPPASLASS SSIARAASLR RATRRGGDGA RDAADARRRR RALGRDARAT AARDRVNGGG VNESVDYEDL SDRLKLDNVR QSLIRQEDSI IFALIERAQY KLNSAIYAKN AVPVPCFAPN GDRASMLEFM LREVEQSHGK IRRYTSPDEH AFYPEAQPPL VIPPIAFKDV LHPCAESINI NDRIMEMYVD NLLPEMCEGG DDNNYGSASL CDLSCLQTIS RRIHYGKYVA ESKFLAQPEE YTELIKAQDA DGLMALLTNQ AVEDRVVRRV ANKAAVYGSD ISEDIPDTLA LPVGSESLKL APEKVGELYY RWIMPMTKDV QVKYLLRRLD
|
| |