Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_88696 |
Symbol | |
ID | 5004294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | - |
Start bp | 585015 |
End bp | 586592 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | |
GC content | 59% |
IMG OID | 640419715 |
Product | predicted protein |
Protein accession | XP_001420556 |
Protein GI | 145352440 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0124] Histidyl-tRNA synthetase |
TIGRFAM ID | [TIGR00442] histidyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 50 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAACCG AAGGCGACGA ACCCGCTGTC GCCCCGGACG CGGCGGCGAG CGGTGAAAAG AGCGAGAAAC TTCTCAAGCG TGAAGCCGAA AAAGCCGCCA AGAAGGCGGC GAAGGACGCC GCGAAAGCGT CCAAAGCGGC CGCCGCCGAA CAGCGCTCGC GGGGGCAAAA CGCCGTCGTG AGCGTGCAGT GCTCGACGAC ACCGCCGGTC GAGCTCGAGG CGTGCAGCGG CACGCGCGAT TGGTATCCCG AAGAGTTTCG TTTGCAGCGA TGGCTTTACG AAAAGTTTCG AGCCACTGCG CGAGCGACCG GGTTCGAGGA ATACGACGCG CCCGTGCTCG AGAGGCAGGA GCTTTATAAA CGCAAAGCCG GCGAGGAGAT TACGCAGCAA ATGTACGCGT TCGTGGACCA AGATGGGGTG GAGGTGACGT TGCGACCCGA GATGACGCCG ACGCTCGCGC GCATGGTACT CGGTCGCGCG CAGTCGATGA TGTTGCCTTT GAAGTGGTTT TCTATTCCGC AATGCTGGCG TTTCGAGACG ACGCAGCGCG GTCGTAAGCG CGAGCACTAC CAGTGGAACA TGGATATCAT CGGGTGCAAG TCTGTGAGCG CAGAAACCGA GCTGTTGTTC GCGGTGTGCG AGTTCTTCAA GTCGATCGGG ATCACGTCCG CCGACGTCGG CATCAAGGTG AACTCGCGCA AGGTCATGGC GAGCGTGTTG GATTCATACG GAATCACCGC GGAAAAGTTT GCGCCTGTGT GCATCGTGAT GGATAAGTTG GACAAAATCG GCGCCGATGC CGTCAAGGCT GAGCTCGTGG ACACGCAAGG ATTACCCGCG GAGACGGCTG CGAAAATCGT AGAGTGTTTG GCGTGCAAGA CGGTGAGCGA CCTCGAGGCG CTCTGCGGCG AGGGTGCCGA TCAAACCGGC ATCGATGAGT TGAAGAGGCT TTTCGAGCTC GCCGAAGATT ACGGCTACGG TGATTGGCTG ATTTTCGACG CATCCGTCGT GCGAGGTTTA GCTTATTACA CCGGCATCGT CTTCGAGGGC TTCGACCGCG CCGGGGAGTT GCGCGCCATT TGCGGTGGCG GTCGCTACGA TAGGTTGCTC TCTTTGTACG GTGCCGTGAC CGAGGTGCCG GCGTGTGGTT TCGGCTTCGG CGATTGCGTC ATCGTGGAGT TGCTCAAAGA TAAAGGATTG CTCCCTGAGC TTCCCAAGTC GATCGAGTTC GTCGTCGCCG CGTTCAACGA AGGCATGCAG GGCAAGGCGA TGAAGGCGGC GTCGATGATT CGCGCCGGAG GTTCGGATGT GGATATGCTT CTCGAGCCGA AGAAGAAAGT AGCGAGCACT TTTGATTACG CCAATCGTAT CGGCGCTCGA TACATCGTCT TCGTCGCGCC GCAAGAGTGG GAAAACGACA TGGTGCGAAT CAAGGATTTG CGCGCCGATT ACACGGACAA AGACGAAGAA AAGCAACTCG ACGTCAAACT TAGCGATCTC GGTAGGGTGT CGGAAGTATT AGCGGCGCAC GCGGCGGCGA TCGGCGCTGC GAACAAAATG GGCGGAATGG CCGTTTAG
|
Protein sequence | MTTEGDEPAV APDAAASGEK SEKLLKREAE KAAKKAAKDA AKASKAAAAE QRSRGQNAVV SVQCSTTPPV ELEACSGTRD WYPEEFRLQR WLYEKFRATA RATGFEEYDA PVLERQELYK RKAGEEITQQ MYAFVDQDGV EVTLRPEMTP TLARMVLGRA QSMMLPLKWF SIPQCWRFET TQRGRKREHY QWNMDIIGCK SVSAETELLF AVCEFFKSIG ITSADVGIKV NSRKVMASVL DSYGITAEKF APVCIVMDKL DKIGADAVKA ELVDTQGLPA ETAAKIVECL ACKTVSDLEA LCGEGADQTG IDELKRLFEL AEDYGYGDWL IFDASVVRGL AYYTGIVFEG FDRAGELRAI CGGGRYDRLL SLYGAVTEVP ACGFGFGDCV IVELLKDKGL LPELPKSIEF VVAAFNEGMQ GKAMKAASMI RAGGSDVDML LEPKKKVAST FDYANRIGAR YIVFVAPQEW ENDMVRIKDL RADYTDKDEE KQLDVKLSDL GRVSEVLAAH AAAIGAANKM GGMAV
|
| |