Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_16330 |
Symbol | |
ID | 5003387 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | + |
Start bp | 44085 |
End bp | 45305 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | |
GC content | 64% |
IMG OID | 640418808 |
Product | predicted protein |
Protein accession | XP_001419053 |
Protein GI | 145349256 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0411346 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAAAC TGGAGACGGC GAAGACGGCG CTGCGCGGCG CGAGCGGGCC GGACGACGGC GATCGCGCGA TCGAACTCGC GGGCGAGGCG CTGGAAAAGT TTATGGAGAT CGCGAACGAC GCGGACGCGG ACGAGGCGAG CGTGCGACGG GAGACGGCGC GCGCGCACTT TGTGTACGGA GAGGCGCTGT TTCGAGGCGC GCAGGCGCAA AACACGGTGT TCGGGGAACA GGTGCGGGCG AACGCGGAGG CGAGCGGGAC GAGGCTGGAG GACGCGCCCG AGGACGAAGA CGTGGGCGAG GAAGACGAGG AAGAAGGAGC GGGCGCGACG GAGGGCGAGA AGAACGGTAA GGAAGCCGCG GATGACGAGG ACGAGGGCGA GGAGGATGAG GAAGAGTCTG ATATGGAGTT GGCGTGGAAG ATGTTGGAGA CGGCGCGCGT GATGTTTGAA GAGGACGCGA ACGCGGCGTT GGAATTGGCG GATGTTTTAG AGACGATCGG GGAGTTGAAC ATGGAACAGT CGCAATTTGA TACGGCGTTG TCGGATTACA AGTCGGCGTT GAAGCTCTTG GAAGAAAACT TGGAAGCGAC GGATAGGCGT TTGGCGAGCG CGCTGTATTC GATTTCCATC GCTAATCAAA TGATGGAGGC GAATGACGAC GCGCTCGCGG CGAACACGCG CGCGATCGAA ATCTGTGACG CGCGAATCGC GGAGCTCAAA GCTGGGACGG CGCGCGTGAG CAAGGGTGCG CGCGAGAACG CGGATGAAGT CGTCTCGCCC GAGGCTGCCA TCGCCGAGTT GGAGCAAATA ATGGGCGTGG CGTCCGATTT GAAGGAGCGC CAACTCGAGC TGAAAGAGCT CGTCAGTGCG GACAACTCCA CGCGCGAAGC TCTCCGACAG GCGTTCAAGG CGATCGGCGG TGCAGCGCCC CCGGGTGCGT CGGAGCCGGA GGAGAGCGCC GGTTTCGCCG CTCCGACGCT TACGTCGAGC GTTCCCGTGC AAGCGGCGCC TGTTCGCCGC GTATTACCCG CGCCTGTTCG CCGCGTCGAG GTCGCACCGC TTCAAGAAGC GCCGGCGAAG CGCGTGGAGC CGCAGCAAAC GTCCGCGCCC GCTGCCGCTC CAGAGGCGAA GAAGATGAAA CCGACGCCCG TGGACAAAGC CGCGTTGATT GGTGCGACTG CTCCCAAGGA CGCCGAACCA AACGGATGCC CGCAGCAGTA G
|
Protein sequence | MAKLETAKTA LRGASGPDDG DRAIELAGEA LEKFMEIAND ADADEASVRR ETARAHFVYG EALFRGAQAQ NTVFGEQVRA NAEASGTRLE DAPEDEDVGE EDEEEGAGAT EGEKNGKEAA DDEDEGEEDE EESDMELAWK MLETARVMFE EDANAALELA DVLETIGELN MEQSQFDTAL SDYKSALKLL EENLEATDRR LASALYSISI ANQMMEANDD ALAANTRAIE ICDARIAELK AGTARVSKGA RENADEVVSP EAAIAELEQI MGVASDLKER QLELKELVSA DNSTREALRQ AFKAIGGAAP PGASEPEESA GFAAPTLTSS VPVQAAPVRR VLPAPVRRVE VAPLQEAPAK RVEPQQTSAP AAAPEAKKMK PTPVDKAALI GATAPKDAEP NGCPQQ
|
| |