Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_50879 |
Symbol | |
ID | 5004474 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | - |
Start bp | 201526 |
End bp | 203421 |
Gene Length | 1896 bp |
Protein Length | 607 aa |
Translation table | |
GC content | 62% |
IMG OID | 640419895 |
Product | predicted protein |
Protein accession | XP_001420444 |
Protein GI | 145352203 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1190] Lysyl-tRNA synthetase (class II) |
TIGRFAM ID | [TIGR00499] lysyl-tRNA synthetase, eukaryotic and non-spirochete bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0733722 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCTCG CGCGCGCGTT CGCGCGAAGG ACGGTCTCGA CGCCGCGCGC GTCGCTCGCG GCGCGCCCGC GGGCGTCGAC GCGCGCGAGC GAGCGCGAGA GCGCGCGCGC GACGGCGACG GCGGCGACGA CGCGACGACG AGGCGCGACG ACGAGGACGC GCGCGCGCGC GAGCGACGCG GGAACGAATA AAAAGCGCGA CGGCGGCGCG GGGAATCGAA AGCGCGGCGA GGGCGGCGGA CGAGACGCGA GCGGGGTGAC GAGCTCGCCC GAGGAGGTCA AGGCGCTGCG CGCGCAAAAG CTGGACGCGC TCGGCGCGCT CGGACAGCGG GGATTCGATT ACCGGTTCGA TCGCACGAAA TATTGCGATG CGCTGCAGAG GGAACACGAA GGATTGGAGA ACGGGGTCGA GATCGAGGGC TCGAGCGAGG CGGTGTGCGG AAGGGTGATG GCGAAGCGGT CGTTCGGGAA GTTGGCGTTT TTGTCGCTGG TGGACGAACG GGGAAGCGTG CAGTTGTTTT GCGATAAGAA ACGGTTGGAT GAGACGAGCC CGGGGGCGTT TGAGATGATC ACGGAACTCG TGGACGTCGG GGACATCATC GGCGTGCACG GGAGCGTGAA AAGGAGCGAT AAGGGGGAGC TGTCCATCGT GCCGGCCAAG GTGCAAATGT TGACAAAGGC GCTGCTGCCG TTGCCGGATA AGTGGCACGG ATTGCAGGAC GTGGAGAAGC GATACCGGCA GCGATACGTG GACTTGATCG TGTCGCCCGA GGTGCGAAAC ACGTTCAAGG CGCGCTCGAA CATCATCTCC ACCATTCGTC GAATGCTCGA CGACGACGGG TTTTTGGAGA TGGAGACGCC CGTGCTGCAC ACGCAAGCGG GCGGCGCGGA CGCGAAGCCC TTCAACACCT TCCACAACGC GCTCGGCATG CAGCTCACGC TTCGTATCGC CACCGAGTTG CATCTCAAGC GACTCGTCGT CGGCGGTTTC GAACGCGTGT ACGAGCTCGG ACGCGTGTTT CGCAACGAAG GCTTGAGCAC GCGACACAAC CCGGAGTTTA CGTCCATAGA AGTGTATCAG GCGTACGCCG ACGTCACGGA CATGTTGGAG CTCACGGAAG AGATGATTTG TCGATGCGCG ATGAAGGCGT GCGGGACGCT GACGATTCCT TACGGCGACG TGACGATTGA TTTGAGCCAA CGCCCGTGGC GCCGGGCGCC GATGAACGAT CTAGTCAAGG AAGCCACGGG CGTAGACGTC ATGGCGTTCG GGGATGACTT GGAAGGAGCC AAGGCAGCGG CAATTCCCGC GCTCAAGGCG CACTCTAAGA AAGCTGGGGA AGGCATCAAA GGTGTCAAGT TGGCGGCGAG CGTCGGTCAC GTGCTGAACG AAATGTTTGA AGCCGCGTGC GAAAGCGATT TGATTCAGCC CACGTTCGTT CTCGACCACC CGCTCGAGAT TTCTCCTCTC GCGAAACCGC ACCGTAGTAA ACCGGGAGTC ACCGAACGGT TCGAGCTCTT CGTCGTCGGT CGCGAGCTCG CGAACTCGTT CAGCGAGTTG ACCGATCCCA TCGATCAACG TAAACGCCTC GAGGCGCAGA TGGTGACGCA CGCGAAAACG AGCGCGGCGC AGCGCGAAGC GGCGGCGGCG AGTGGTAAAG ACGAACTCAA AGCGCTCGAA GACGAGGCGT ACGACGTCGA AATGGATGAA GATTTCGTCG CCGCGCTCGA GTACGGCATG CCCCCGACCG CGGGTATGGG TCTCGGCGTC GATCGTCTCG TCATGCTTCT CACGAATTCG CCGTCGATTC GCGACGTCAT CGCGTTCCCG CTTCTTAAGA AGCAAGACTC GTAAGTCATA CGCGAGAGTA TATATATTCT TAGCGGCGCT ACGACA
|
Protein sequence | MLLARAFARR TVSTPRASLA ARPRASTRAS ERESARATAT AATTRRRGAT TRTRARASDA GTNKKRDGGA GNRKRGEGGG RDASGVTSSP EEVKALRAQK LDALGALGQR GFDYRFDRTK YCDALQREHE GLENGVEIEG SSEAVCGRVM AKRSFGKLAF LSLVDERGSV QLFCDKKRLD ETSPGAFEMI TELVDVGDII GVHGSVKRSD KGELSIVPAK VQMLTKALLP LPDKWHGLQD VEKRYRQRYV DLIVSPEVRN TFKARSNIIS TIRRMLDDDG FLEMETPVLH TQAGGADAKP FNTFHNALGM QLTLRIATEL HLKRLVVGGF ERVYELGRVF RNEGLSTRHN PEFTSIEVYQ AYADVTDMLE LTEEMICRCA MKACGTLTIP YGDVTIDLSQ RPWRRAPMND LVKEATGVDV MAFGDDLEGA KAAAIPALKA HSKKAGEGIK GVKLAASVGH VLNEMFEAAC ESDLIQPTFV LDHPLEISPL AKPHRSKPGV TERFELFVVG RELANSFSEL TDPIDQRKRL EAQMVTHAKT SAAQREAAAA NEAYDVEMDE DFVAALEYGM PPTAGMGLGV DRLVMLLTNS PSIRDVIAFP LLKKQDS
|
| |