Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31872 |
Symbol | |
ID | 5001997 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 676844 |
End bp | 677992 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | |
GC content | 56% |
IMG OID | 640417418 |
Product | predicted protein |
Protein accession | XP_001418082 |
Protein GI | 145347241 |
COG category | [R] General function prediction only |
COG ID | [COG0384] Predicted epimerase, PhzC/PhzF homolog |
TIGRFAM ID | [TIGR00654] phenazine biosynthesis protein PhzF family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.888244 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.28032 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGA GGCACGCGTA CGCGATCGCG AACGCGTTCC CGTCGGCGAC GAAAGGCGAC GACAGTGGGA ATCCCGCGGC CGTGGTGCTT TTGCCGGACG ATGGACCGTG GCCGCCGGAC GCGGTGATGC AGGCGGCGGC GAGCGAATTA GGCTTGAGCG AGACGGCGTT CGCGAAGCGG GCAGACGTGG AGGTCGTCGG CGCGATGGTG CATCGGATGT ACGACATACG GTGGTTTACG CCAAAGTGCG AGATCAATCT GTGCGGGCAC GCGTCGATGG CGACGGCGCA CGAGATTTTT CGAGCGCCGG GGAACGAGCA CGTCACTAAG ATTGGATTTT TATACGGCAA GCCGCGTGAT GTGTGCGATG GAACACAAGT GAAAGCCGCT TACGAGTCGT TGTTCGTGTG GAAAGATTTG GACGGCGGTG TAGAGTCGTC CTACGCGATG GCACTTCCGG AGGAACGCGC GAGATCCTTT CCGGACGCGC TTGGTGAGGA TGGGTTCTTA CACCCTCAAG AAATCGTGAA CATGATTCGA GGCTGCTTTG GCGACGACCA CGAGCGCTCG AACTCGCAGA GCTCTGCGAC TTCCAAGGCG AAGTACGAGC TTCGGTACAA CTTCATCGGC GACTTGTTTT TCATAATCGA CACGGATGAT GCACCCGACG AGGTGTTTGA GGCGTGTTTC GACCGCTTCA TGAATCACGC GCCCGATCTG AAGGCGATAT CGGAAGTGGG GAGGTGCTTC ATCGAGGAAA TGAAGTACGA CTTTGAAATG GTTCAAGGTT TTCGTGGTTT GTGCGTGCTC TTGACGGTGA AAAATCGGCC GAATCATTCG TACGATTTCT ACACGCGATG GTTTGGGCCG GATGTCGGTA TCGACGAAGA TCCCGTGACG GGTAGCGCGG CGAGCGGCTA TGCTAGATTT CTTGATGACA AGTTGCCCGA GGTCGTCGGT AAAAAGCGCG GTTGTCAAAT GTCGAAACCC AGGGGCGACA TCACGGTGAG TCTCTCGGAC AACCCCTACC CCGACACGGA TGTTCATGTC GAGGTTGTCG GCAAAGTTGC GACGAGATCG AGTGGTGTTC TCGAGGTTTC TGTTCTAAAG GATGGTGATA TATCAGTCGT GAATCGACGA AACTCCTAG
|
Protein sequence | MTTRHAYAIA NAFPSATKGD DSGNPAAVVL LPDDGPWPPD AVMQAAASEL GLSETAFAKR ADVEVVGAMV HRMYDIRWFT PKCEINLCGH ASMATAHEIF RAPGNEHVTK IGFLYGKPRD VCDGTQVKAA YESLFVWKDL DGGVESSYAM ALPEERARSF PDALGEDGFL HPQEIVNMIR GCFGDDHERS NSQSSATSKA KYELRYNFIG DLFFIIDTDD APDEVFEACF DRFMNHAPDL KAISEVGRCF IEEMKYDFEM VQGFRGLCVL LTVKNRPNHS YDFYTRWFGP DVGIDEDPVT GSAASGYARF LDDKLPEVVG KKRGCQMSKP RGDITVSLSD NPYPDTDVHV EVVGKVATRS SGVLEVSVLK DGDISVVNRR NS
|
| |