Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_29417 |
Symbol | |
ID | 5006740 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | + |
Start bp | 199254 |
End bp | 200790 |
Gene Length | 1537 bp |
Protein Length | 405 aa |
Translation table | |
GC content | 64% |
IMG OID | 640422161 |
Product | predicted protein |
Protein accession | XP_001422517 |
Protein GI | 145356603 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0124] Histidyl-tRNA synthetase |
TIGRFAM ID | [TIGR00442] histidyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.024328 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00343587 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | CGCGACGCGC GCGCGCGTCG TCGACGATGC GCGCGCGACT CGTCGCGTCG AGCGCGCGGT TGTGCGCGCG ACGCGCGGTG AGGGCGCGAC GGGAGTGCGC GCGGGCGTGC GCGACGACGC CGGGGGCGAG AGTCGCGCCG CGACGGGCGT GGGGGACGCG AACGCGCGCG ACGTCGTCGC GAGGGGGCGA CGGCGGACGG ACGACGACGA CGGTGGATCC GACGCGCGCG AGCGAGCGCG GGGCGAAGCG GGCGACGATC GATCTGCAGC CGCCGAAGGG GACGCGAGAT TTCCCGCCGG AGGAGATGCG ACAGCGGTCG TGGCTGTTTG GACACTTTCG AGAGTGCGCG AAGGTTTTCG GGTTCGACGA GTTCGACGCG CCGGTGCTGG AGAGCGAGGA ACTGTTCACG AGGAAGGCTG GGGAAGAGAT CACGACGCAG TTGTATAACT TTTCGGATAA GGGCGATCGC AGGGTGGCGC TGAGGCCGGA GTTGACGCCG TCGTTCGCGC GGTTGATTTT GCAGCAAGGC AAGTCGTTGG CGTTGCCGGC GAAGTGGTTC GCGATCGGGC AGTGCTGGAG ATACGAGCGC ATGACGCGAG GAAGACGTCG GGAGCATTAT CAGTGGAATA TGGACATCGT CGGCGTGAGC GGGGTGGAGG CGGAGGCGGA GTTGTTGGCG GCCATTACGA CGTTTTTCAA GAGGGTGGGG GTGACGAGCG CCGACGTAGG CATCAAGGTG AGCTCGCGAA AGCTGTTGCA GGAGGTGTTG ACGCGGTTCG GGATCGACAG CGAATCTTTC GCGCCCGTGT GTGTGGTGGT GGATAAGATT GAAAAGCTCC CGCGCGAAAA GATTGAGGAA GAGCTCAGAG AGCTCGGCGT GAGCGACGAG GCGGTGGAGG GCATCTTGGC GGCGACGTCG ATGCGCACGG TAGAAGAGCT CGAGGCCCTC ATCGGCCCGG ACGCGGAGGC GGTGAAGGAC TTAAAGAAGC TTTTTGAGTA CGCCGATGCG TACGGCTACC GAGATTGGCT CGTGTTCGAC GCGTGCGTCG TTCGCGGTTT GGCGTACTAC ACGGGCATCG TCTTCGAAGG TTTCGATCGC GCGGGCGAAC TTCGCGCCAT CTGCGGTGGC GGGCGATACG ACATGCTTCT TGGGGCGTTA GGCGGCGAGA ATCAACCCAT GGTCGGGTTC GGGTTCGGCG ACGCCGTCAT CGTGGAGCTG CTCAAGGATA AGGGTTTGAT GCCCGACTTT TCCAAGGGCG ACGTCCAAGA CTTGGTGTTC CCGCTCGGCG AGTCGCTGCG CCCGGCGGCG ATGCGCGTCG CCGCCCAGCT TCGCGACGCC GGTCGCACCG TTGATCTCAT CCTCGAAGAC AAGAAAGCGA AATGGGCGTT CAAGCAAGCC GAACGCGTCG GCGCCCAACG CGTCATTTTG CTCGGCGAAA AGGAATGGGA AGCGGGGAAC GTTCGCGTCA AAGACTTAGC CAGCCGCGAA GAGGTCGACG TCAAATTGGA AGATCTCAAA TAATTAATAA ATGATGC
|
Protein sequence | MRQRSWLFGH FRECAKVFGF DEFDAPVLES EELFTRKAGE EITTQLYNFS DKGDRRVALR PELTPSFARL ILQQGKSLAL PAKWFAIGQC WRYERMTRGR RREHYQWNMD IVGVSGVEAE AELLAAITTF FKRVGVTSAD VGIKVSSRKL LQEVLTRFGI DSESFAPVCV VVDKIEKLPR EKIEEELREL GVSDEAVEGI LAATSMRTVE ELEALIGPDA EAVKDLKKLF EYADAYGYRD WLVFDACVVR GLAYYTGIVF EGFDRAGELR AICGGGRYDM LLGALGGENQ PMVGFGFGDA VIVELLKDKG LMPDFSKGDV QDLVFPLGES LRPAAMRVAA QLRDAGRTVD LILEDKKAKW AFKQAERVGA QRVILLGEKE WEAGNVRVKD LASREEVDVK LEDLK
|
| |