Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_34045 |
Symbol | |
ID | 5001008 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 231846 |
End bp | 233432 |
Gene Length | 1587 bp |
Protein Length | 505 aa |
Translation table | |
GC content | 58% |
IMG OID | 640416429 |
Product | predicted protein |
Protein accession | XP_001416591 |
Protein GI | 145344131 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG0317] Guanosine polyphosphate pyrophosphohydrolases/synthetases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.790257 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCTGC AGATGACGCC CGCGAGCGGG GCCCCGTTCG GGGGGTCCAG GGTGGTGGAC TTTGTCGCGG CGTGGGACGC GATATCGCCG CGATTGGCGG CGCGCGAACA CGGAGAGGCG GACAAGGAGA TGCTGCTGAG CGCGTTGACG CTCGCCATAC CGCGGCTGCA GCAGATCGTG CGAACGGAAG GGAAAGGAAA GAAGCCGGCG CACGAGCGAG CGGTGGACGT GACGCTCACG CTCGTGGACA TGGGGATGGA CGCGGAATGC ATCTCCGCGG GATTGCTTCG AGAGGCCGTG GTGCACGGGA CGGTGAGTTT GGATGAGATA GAGGACGTGC TCGGGGAGCG GGTGATGCGG TTGGCGCACG ATGTGGGGCG AGTGCACGAT TTACCGAGAC GAGTGCACTC GTACGACGAA AACGCGGCGG AACGCTTGCG GTCGTTTTAC CTGTCGTTTC ACGACATAAG AGCGATCGTG GTGGAGTTGG CGTGCAGACT TGATACTTTG AGAAACATCG ATGAGCTTGA GCCGATCCAG CGGACGACGA TTGCGTTGGA GACGATGCAG ATTTACGCCC CCATGGCGCA CGCGTTGAAC ACGGGCGCCC TTTGCGCAGA GTTGGAAGAT TTAGCGTTTA AGATTTTGTT TCCGACGTCT TACGCCTCGC TGGAGAGGTG GCTGACGGCA AAAAGTCCAG GGGACTCGGC CATCTTGGAT AAGATGACAA AGATGCTCAC GGATACGATG AACGCCGACA CGACGATGAA CGCGTTGATC GGCCGCGGAG GGGTCAAGGT TTTGGCGAGA AGGAAATCTC GTTACTCCAC CATGAAGAAG ATCATTCGCG ATGGGCGAAA GCGTGAAGAG GTGCACGATT TGCTCGGACT ACGCCTCGTT TTGACCCCGC AACCTGGGAG TGGGGCGGAA TTGCCCGGCA TGGGTCAGGT GTATTCGGGT GAGGTTACGT ACGAGCACAT GGAGGCGAGA GCGTTAAAAG CGGCGAACGC GGCGTGCTAT CGCGCACAGC AAATAGTGCA CGGTTTATTT CCAGCCGTAA GCGGACGAAC CAAGGACTAC ATCAGTGATC CCAAGGCGAA CGGATACTCG TCCTTGCACA GCACTCTCAA GGTCGCGTTT GGCGACGACG GCAAACCTTT GGCGTCGAGC GAAGCGTACA AACGCGGCGT GAATGTCGAG ATGCAGATCC GAACGGCAGC CATGCATCTC GCCGCTGAAG CGGGCACTGC GTCGCACAAT TCGTATAAGG GTGGCTTGAA GGAAGACACG GGTATGGCGG GCTCGCTCGC GGACTTGGCC TCGGCAGCGA ATCGCGCCGC GGAGGAAAAG TTTGGCGCTT TCACGCACGC CGATTTGCGC GAACGCGACG AACTCCACGA CAGATTGTTC GAAGCATTCG ATCTTGATGG TGATGGTCGC GTCACGATGA GTGAGCTGCG TACGGTACTG GAGAAAATAT GGGACACGGA GATGAACGAT TTGCGCGAGG AAGCTCACGC GTTGATGGAG CTCCTCGACG TTAACCAGGA TGGTACTATC GACGCTGATG AATTTGCAAA GTTCCGC
|
Protein sequence | MELQMTPASG APFGGSRVVD FVAAWDAISP RLAAREHGEA DKEMLLSALT LAIPRLQQIV RTEGKGKKPA HERAVDVTLT LVDMGMDAEC ISAGLLREAV VHGTVSLDEI EDVLGERVMR LAHDVGRVHD LPRRVHSYDE NAAERLRSFY LSFHDIRAIV VELACRLDTL RNIDELEPIQ RTTIALETMQ IYAPMAHALN TGALCAELED LAFKILFPTS YASLERWLTA KSPGDSAILD KMTKMLTDTM NADTTMNALI GRGGVKVLAR RKSRYSTMKK IIRDGRKREE VHDLLGLRLV LTPQPGTLKA ANAACYRAQQ IVHGLFPAVS GRTKDYISDP KANGYSSLHS TLKVAFGDDG KPLASSEAYK RGVNVEMQIR TAAMHLAAEA GTASHNSYKG GLKEDTGMAG SLADLASAAN RAAEEKFGAF THADLRERDE LHDRLFEAFD LDGDGRVTMS ELRTVLEKIW DTEMNDLREE AHALMELLDV NQDGTIDADE FAKFR
|
| |