Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_37953 |
Symbol | |
ID | 5004117 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | - |
Start bp | 25191 |
End bp | 26684 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | |
GC content | 56% |
IMG OID | 640419538 |
Product | predicted protein |
Protein accession | XP_001420057 |
Protein GI | 145351377 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0215] Cysteinyl-tRNA synthetase |
TIGRFAM ID | [TIGR00435] cysteinyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.150647 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCGAA AGAAGGAGAT TTTTACGCCG CGAGACCCGG CGGGGAAAAA GGTGCAAATG TACGTGTGCG GCGTCACGGT GTACGACTAT TCACACATCG GTCACGCGCG CGTGTACGTC GCGTTCGACG TATTATATCG ACAATTGATG CGTTTAGGGT ACGACGTGAC GTATTGCCGA AATTTCACCG ACATTGACGA CAAGATTATC AAGCGCTCAA ATGAGAGCGG GGAGACGTGC GAGGCGCTCA CGGATAAATT CATAGAGGCA TTCCACGAAG ACATGGCGGC GCTCGGATGC CTGCGTCCGA CGCTCGAGCC TCGTGCGACG GAGTGTGTGG ATGACATCAT CGCGTTCATC GAGCGTTTAA TCGCCAAAGG TAACGCGTAC GAGACGGAAG GGGACGTCTA CTTTTCCGTC GACACCTTGC CCGCATACGG GGCGTTGTCA GGGAGAAATC AAGAAGACAA TCGCGCGGGT GAGCGCGTGG CCGTGGACGG GCGCAAGAAA AATCCAGCCG ACTTTGCGCT ATGGAAGACT GCAAAACCGG GTGAGCCAAC GTGGACGAGT CCGTGGGGCG AGGGACGACC GGGCTGGCAC ATTGAGTGCA GCGCAATGAT TGAAAAAATG CTAGGACCGA CGATTGATAT CCACGGTGGA GGCCAAGACT TAGTTTTTCC GCATCACGAA AACGAGCTGG CGCAGTCTTC GGCGGCGTGC GGTTGTGGAG CGCACGCGGA TGAGAATCCG TTTGTGCGTT ACTGGGTGCA TAACGGCTTC GTCAAGGTGG ATTCTGAGAA GATGTCCAAG TCGCTCGGCA ACTTTTTCAC TATTCGCGAA GTGTTGGACA AGTACCATCC GTTCGTGCTA CGTTTCATGC TTCTTGGCGC GCACTACAGA GCGCCCATCA ACTACACACA GCGCGCGCTG GAGGAGGCTT CCGATCGCGT TTACTATTTG TACCAAACAG TTCACGATGT ACGAGCAATT CTTCGCGATG CCGCGGCGGA AGAGCCAGCT AAAAAGCCGG TACCGCTCGT TGCGGATGCG CTGAAGCTCG CGAGTGAGGC TGAGAAGCAA GTGTCCGAGG CTTTGAATGA CGACATGAAC ACGCCCGGAG TGATCGCGAC GCTCTCCGCG CCGCTCAAAT CAATGAATGA TTTCATGACC ACCAAGGCTG GAAAGAAAGC AGTCGGTCGT GTTGGGGCGC TTCAGTCGTT GTTGAGCACT GTCGAGGGTT TAATGGAGGC GGTTGGCATG CCCAAGGATG AAGAAAACGT CATTCTTGCG GAGCTTCGCG CGCGCGCGTT GCACCGCGCG GGCTTGACTG AGGACGATCT CTTAGCTAAA ATAGAAGAAC GCAATAAGGC GCGCGATGCG AAAGACTTTG CGGAGTCCGA CCGCTTACGC GACGAACTTT CCGCGCGCGG CGTTGGTCTC ATGGACGGTT CTGCGGTGCC GTGGCGTCCA GTTCCAGTCA TCGACGCGAC GTAG
|
Protein sequence | MTRKKEIFTP RDPAGKKVQM YVCGVTVYDY SHIGHARVYV AFDVLYRQLM RLGYDVTYCR NFTDIDDKII KRSNESGETC EALTDKFIEA FHEDMAALGC LRPTLEPRAT ECVDDIIAFI ERLIAKGNAY ETEGDVYFSV DTLPAYGALS GRNQEDNRAG ERVAVDGRKK NPADFALWKT AKPGEPTWTS PWGEGRPGWH IECSAMIEKM LGPTIDIHGG GQDLVFPHHE NELAQSSAAC GCGAHADENP FVRYWVHNGF VKVDSEKMSK SLGNFFTIRE VLDKYHPFVL RFMLLGAHYR APINYTQRAL EEASDRVYYL YQTVHDVRAI LRDAAAEEPA KKPVPLVADA LKLASEAEKQ VSEALNDDMN TPGVIATLSA PLKSMNDFMT TKAGKKAVGR VGALQSLLST VEGLMEAVGM PKDEENVILA ELRARALHRA GLTEDDLLAK IEERNKARDA KDFAESDRLR DELSARGVGL MDGSAVPWRP VPVIDAT
|
| |