Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_18554 |
Symbol | |
ID | 5006063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009370 |
Strand | + |
Start bp | 264676 |
End bp | 266331 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | |
GC content | 70% |
IMG OID | 640421484 |
Product | predicted protein |
Protein accession | XP_001421895 |
Protein GI | 145355286 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.0184372 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0131926 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCGC GCCTGCAGCC GTGGGCGAGC GTCCGGGAGT GGCGCGACGT CCGCGACGCG CTCGCGCGCG CGACGGAGGA CGAGGCGCGC GGCGACGGCG CGCGACGCGG CGCCGCCGGG GACGTCGAAA TCGACGCCGC GCTCGGCGTC GTGCGCGCGT GGCGCGCGCG CGGGCGCGCG CCGCTCGCCG CGGACGTCAC CGCGATGCTC GTGCGCGCGT CGCGCGCGCG CGAGGGCGTC GATCGAGGCG ACGCGGACGC GGCGCGGTCG GCGACGGCGA TGACGCTCGC GAGACTGGTG AACGGCGTCG TCGACCCGAA GCAAAAGGGA CGGTACGCCG CGCCGATCGC GACGCTGGCG CGAGAGGTGG GACTGCCGCG ATTTTTAGTG GATCTGCGAC ACGAGTGCGC GCACGGGACG ATGCCGAGCG CGGGAGCGCT GCGGAGGGGC GCGCGGCGAG CGTTGGCGTG GTGTCGACGA TGGTATTGGG ACGAACAGGC GCGGGCGTTC GACGCGGCGT TCGAACGCGT GCGGGGATGC GTGCGAGCGA TGTGCGCGTG CGAAAAAGAC GCGAGAGGGC TGCGGGCGCG GGAGGGACGG GGCGCGGTGG AGTCGTCGAG CGAGGAGATG GACGAGGACG GGGCGAGCGA GGGCGAAAAC GCGGGACGGG AGAGCGAGGG CGGGACGTCG TTTAAGGACG TGCGAGAACG ACGACGACGG GCGATCGGAA CGTTGAGCAG CGTGTGTCCG AAGGGCGCGG CGCACGTCGT GGCGGAGGCG TTGCTCGACG GGGGGTGGCT CCGCGTCGTG GAGGACGAGA CGGCGAGCGA CGTCGACGAC GCCGACGAAG CGACGTTTCG CGCGAGCGCG GAAGATTGGC GGCCGACGCT CGAGCGACTG TGTCAGAAAT GGACCGGTTT GTTCGCGTAT CTGTTCGACG CCGCGATTCG AGGCGAGAAA CCGGGAAACG ATGTGGGGTT TCAAACTTTG TTGAGCGTCG CGGCGAGCGG CGACTTCGCC GCGAACGGCG ATCAGCGCGT CGCCGCGTTC CACGCGTGCA AACGCGCGCT CGCGAGCGTG CACGAAGACG ATTGGAGCGA TGACCCGGCG GTGGCGAAGA GAACGATCCG GACGCTGCAA AAAATTGCCG GCGTGTCAAA GGATGAAATC AAATCCGCGT CGCGTCGCGG CGCGGCGCCC GCGGACGCGC TCGCGAGCGC GAGAGCCGAC ATCGAGGCGC TTCGCGCGAC GCTTCAGTCC GGTCGTAAAC GCAAGCGCGA TTCGCGCTGG GAACGGGCGG AAGATTGGAC GCCGTCACCC ATAGGCGTCG TCGCGGGCGT CTCGGCGCGC GCGTTGGTCG ACGTCGCGCC GTCGTCGCGA ACGATTCGCG TCACATCCGG TGTCAGCGCG TCGAGCGACG CGGGGTATCG AAGCGCCACG AAATCGGCGA CGTATCCACG CGGCGACGAC GGTGACGACG ACGACGCCGC CGACGACGAC GAGGACGAAA ACGACGACGA GGACGAAAAC GACGACGGCG GCGAACCGTC CGAACGCGTC GGCGTCGCCG CCGCGCTCAA CGTCGCCGGC GGTCGCGTGG AGCTCTCAAA ATCGCAAGCC GCCGCCGTGG CGGCGTCTGT GGCGTGTCTG CTTTAG
|
Protein sequence | MSARLQPWAS VREWRDVRDA LARATEDEAR GDGARRGAAG DVEIDAALGV VRAWRARGRA PLAADVTAML VRASRAREGV DRGDADAARS ATAMTLARLV NGVVDPKQKG RYAAPIATLA REVGLPRFLV DLRHECAHGT MPSAGALRRG ARRALAWCRR WYWDEQARAF DAAFERVRGC VRAMCACEKD ARGLRAREGR GAVESSSEEM DEDGASEGEN AGRESEGGTS FKDVRERRRR AIGTLSSVCP KGAAHVVAEA LLDGGWLRVV EDETASDVDD ADEATFRASA EDWRPTLERL CQKWTGLFAY LFDAAIRGEK PGNDVGFQTL LSVAASGDFA ANGDQRVAAF HACKRALASV HEDDWSDDPA VAKRTIRTLQ KIAGVSKDEI KSASRRGAAP ADALASARAD IEALRATLQS GRKRKRDSRW ERAEDWTPSP IGVVAGVSAR ALVDVAPSSR TIRVTSGVSA SSDAGYRSAT KSATYPRGDD GDDDDAADDD EDENDDEDEN DDGGEPSERV GVAAALNVAG GRVELSKSQA AAVAASVACL L
|
| |