Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_39526 |
Symbol | |
ID | 4999868 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | + |
Start bp | 326136 |
End bp | 327806 |
Gene Length | 1671 bp |
Protein Length | 536 aa |
Translation table | |
GC content | 64% |
IMG OID | 640415289 |
Product | predicted protein |
Protein accession | XP_001415457 |
Protein GI | 145340698 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG2265] SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase |
TIGRFAM ID | [TIGR00479] 23S rRNA (uracil-5-)-methyltransferase RumA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.612078 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCGCG CGTTCGCCCG CGTCGTCCTC GCGCGCGCGC TCGCCGCCGC GCGCGCCCCC GTCGCGCACG CCGCGCGTCG TCTCGCCGTC GCGCGTCCGC CCTGCGCCGC GAAATCCACC GCCGCCGCCG CCGCGTCCGA CCGCGGCGAC GTTCGCGTCG GCGACGAAGT CTCCGTGGAG TGCGTCGATT TCGCCACGTC GGGCGAGGGC GTGTGCAAGC TCGCGAATGG GATGGTTTTG CTCTGCGACG GCGCGACGCC CGGAGAGGTG GTGCGCGCGC GGGTGACGAA GCTGCGGAAG AAGCTGGCGC ACGGCGCGAA GACGGCGACG GAACGCGCGG CGCCGAACGC GGTGACGGCG CCGTGCCCGC ACCACGAACA GTGCGGAGGG TGCGCGTGGC AGCACGTGAA TTACGACGCG CAGGTGGGGC ATAAGCGGAA TCGGGTGGTG GACGTGCTGG CGAGAATATA TAAAGCGGGC GAGGACGCCG AGGCGAAAGT CGGCGCGTGC GTCGGCGCGG ACGAGACGTC GAGGTATAGG AATAAAATGG AGTTCGCGTT CGCGAGTGGA ACGAGAGGAA AGACGGTCGT CGGGTTGCGA CCGCGAGGGG CGAACGATTC GGTGGTGGAT TTAAGCGGTG GATGTCTGTT GCAGAGCGAG GAGGCGGATC GCGTGTTGGC GGCGATTCGG GAGACGCTGG AACGCGCGGA CGGGCGCTTG GAGGCGTTCG ATCGCACGAG CGGCGAAGGC ACCCTTAGAA GCGTGACGAT TCGCACGGCT GGTGGCGAGC GCGGCGGTGA GAAGGCTGTG ATGGTCGATT TAGCGACGAC GGCTTCGCCG AACGAACTGA AAACGGGACC GCTCGCGGGA CTCATCGACG TCGTCTCGAA GGTACCGGGC GTGGTGTCCG TGGTGCACAC TTCCGTGCCG AGCGAAGCCG AACTCCGACG CGCGGGCGGT GGACGCTCGT CAAAGTTCGT CAAGGGTGGC ACGACGACGA AGGCTGGGTC AACTAAGAAA GTCGAGGCGG TGTTCGGCGA GAACAAATTA GTCGAAACGC TCAACGGCAT CGACTTTGAA CTTTCTTCGG CGTCCTTCTT TCAGACCAAC ACTGAGCAGG CTGCGAGATT GGTGCGACAG GTTCGCGAGG CGTGCGCTTT CAGCGGCGAT AAGTCTGAGA TCGTGCTCGA CCTGTTTTGC GGTGTCGGTA CGATGGGACT CAGCGTCGCG AGCGACTGCT CGCGAGTGAT GGGATGGGAA GTCGTTCCAG AGGCGGTGAA AGACGCGAAA CGCAACGCCG AGCTGAACAA CATCACGAAC GCCAAATTCT ATCGCGTTGA TTTGGCGAGA TTAAATCCGT CCAAAGGCCC GAAAGGTCTT CTCACGACGC CGAAAGGCAA AGAGCTTCCC ATGCCGGACA TCGTCATCAC GGACCCGGCG AGGCCTGGTA TGGACTCCGC ACTCATCGCG ATCCTGCGCA CAATCGGTGC TCGGCGTATC GTCTACGTGT CGTGTAATCC CGCGACGCAA GCGCGAGACT TGCTACTTCT CACGGCGCCG TCGGAGGGCG CGGACGACGT CGCGTACGAG CTCAAAACCG TCACGCCCGT CGATATGTTT CCTCACACGA CGCACGTCGA GTCCGTCGCC GTGCTCGAAC GCAAAGCTTA G
|
Protein sequence | MPRAFARVVL ARALAAARAP VAHAARRLAV ARPPCAAKST AAAAASDRGD VRVGDEVSVE CVDFATSGEG VCKLANGMVL LCDGATPGEV VRARVTKLRK KLAHGAKTAT ERAAPNAVTA PCPHHEQCGG CAWQHVNYDA QVGHKRNRVV DVLARIYKAG EDAEAKVGAC VGADETSRYR NKMEFAFASG TRGKTVVGLR PRGANDSVVD LSGGCLLQSE EADRVLAAIR ETLERADGRL EAFDRTSGEG TLRSVTIRTA GGERGGEKAV MVDLATTASP NELKTGPLAG LIDVVSKVPG VVSVVHTSGG TTTKAGSTKK VEAVFGENKL VETLNGIDFE LSSASFFQTN TEQAARLVRQ VREACAFSGD KSEIVLDLFC GVGTMGLSVA SDCSRVMGWE VVPEAVKDAK RNAELNNITN AKFYRVDLAR LNPSKGPKGL LTTPKGKELP MPDIVITDPA RPGMDSALIA ILRTIGARRI VYVSCNPATQ ARDLLLLTAP SEGADDVAYE LKTVTPVDMF PHTTHVESVA VLERKA
|
| |