Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_43434 |
Symbol | |
ID | 5006696 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | + |
Start bp | 41894 |
End bp | 43483 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | |
GC content | 65% |
IMG OID | 640422117 |
Product | chloroplast lysine N-methyltransferase |
Protein accession | XP_001422460 |
Protein GI | 145356486 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.000449884 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.898898 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTCGC GCGTCGCGCG CGACGTCTTC GCGGTCGCGC GATCGCCGCG CGCGACGCGC GGGACGTCGA GACGGCGCGC GCGATGGGGC GACGCGACGA CGTCGAAAAC GCGTCGCCCG CGGACGCGCG CGAGGCGCGA CGCGGCGTCG AGCGCGGATC ACGACGCCCT GCACGAGTGG TTATCCGCGA ACGGCGCGGA CGTGGCGAGC GTGGAGTTTT ACGACGCGCG CGCGGGCGAC GAGGACGACG GCGGCGACGC GGGGTGGGGC GCGCGAGCGA CGCGAGCGCT GGCGCGAGGC GCGAAGGCGA TCGTGGTGCC GAAATCGCTG TGGATCACGC CCGAGGTCGG GATGAACGAT GATGAACTCG GGAAGGCGCT GCGGGACGAG GACGTCGCCG GTGGATTGGC GCGATGGACG ACGTTGGCGT TGACGCTGCT GAAGGAACGC GAACGCGGAG AAGAGTCAAA GTACGCGGCG TACGTGAAGA CGCTGCCGGA AGTTTTACAC TCGCCGTTGT TTTGGAACGC GGAGGAATTG AGTGAGATTC AAGGGACGCA GCTGTTGGAT AACGCGGCGG GGTACGATGG GTACGTGCGG GGGGTGTACG AGACGTTGAG AACGGGGATG TTCGCGAAGC ACGCCGACGT CTTCGACGTC GAGGGCGCGT TCAGCGAGGA CAACTTTCGA TGGGCGTTCG GGATCTTACG CTCGCGCACG ATGGCACCGT GCGATGGGGC GAACATCGCG CTCGTGCCTG GCGTAGATTT AGTCAATCAC AGCTCGTTGA GTCAGGCGAG ATGGCGAGTG AGCGGCGGCG TCGCGGGCGC CGTCGCCGGA TTGTTTGGGG GCGGAAAGGG CGACGACGGC GTCTCGGCGC GCGTCGAGTG CGATCGAGCG CTGAACGTGA ACGAGCCGTT GTACGTAAAT TACAACCCAG AAGGCACGGA CACTTCGTTT GCGCTCGATT TCGGGTTCGT GGACACCATC ACCCCGAGCC CCGGGTACGC GTTGTCGCTC TCCGTGCCCG AAGACGATCC AAATGTCTTC GACAAGCTCG ACGTTTTAGA CGTGTGCGGG CTCGGCGAGA CCCCGACGTT TACGCTCCGC GCGTACAGCG ATCCAGATCC CGACCTCAGA ACGTTTTTGC GATTGCTCAA CTGCAAAAAT CAGGACGCGT TCTTGCTCGA GGCGTTGTTT CGTCAACAGT GCTGGTCGCT CATCTCCGAG CCGCTCTCGC GCGAGAACGA AGCCGACTGC TGCGCGTCCA TGACGGACGG CGTCGCCGCC GCCTTATCCG CCTACGCCTC TCGCGCGTTG GACGAAGAAA AAGCCTACCT CATGTCTCCC CCGAGCGCGC GTCGCGCCGC CGCCGGCGAC GACGAGCGCG CGCGCGTTCG CAAGGACGTC GCCGTCCGCG TCCGCCTCGC CGAAAAATCC ACCCTCATCG AAACCGCCTC CTTCTTCGAC GTCATCGCGA GCGGACTGGA CGGCATGGAG TACTACCAAG AGCGTCGGCT TCGCTCGTTA AACTTACTCG ACGAAGACGG TTCGAGCACG TACGATCCGT TCAACGAAAC CATGGCGTGA
|
Protein sequence | MTSRVARDVF AVARSPRATR GTSRRRARWG DATTSKTRRP RTRARRDAAS SADHDALHEW LSANGADVAS VEFYDARAGD EDDGGDAGWG ARATRALARG AKAIVVPKSL WITPEVGMND DELGKALRDE DVAGGLARWT TLALTLLKER ERGEESKYAA YVKTLPEVLH SPLFWNAEEL SEIQGTQLLD NAAGYDGYVR GVYETLRTGM FAKHADVFDV EGAFSEDNFR WAFGILRSRT MAPCDGANIA LVPGVDLVNH SSLSQARWRV SGGVAGAVAG LFGGGKGDDG VSARVECDRA LNVNEPLYVN YNPEGTDTSF ALDFGFVDTI TPSPGYALSL SVPEDDPNVF DKLDVLDVCG LGETPTFTLR AYSDPDPDLR TFLRLLNCKN QDAFLLEALF RQQCWSLISE PLSRENEADC CASMTDGVAA ALSAYASRAL DEEKAYLMSP PSARRAAAGD DERARVRKDV AVRVRLAEKS TLIETASFFD VIASGLDGME YYQERRLRSL NLLDEDGSST YDPFNETMA
|
| |