Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_1214 |
Symbol | |
ID | 5004001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | + |
Start bp | 286930 |
End bp | 288012 |
Gene Length | 1083 bp |
Protein Length | 361 aa |
Translation table | |
GC content | 54% |
IMG OID | 640419422 |
Product | predicted protein |
Protein accession | XP_001419962 |
Protein GI | 145351179 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0042] tRNA-dihydrouridine synthase |
TIGRFAM ID | [TIGR00737] putative TIM-barrel protein, nifR3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGCAAAGAAC TCGATTTCAA GGGCAAGCTG TACGTGGCGC CGTTGACGAC GGTTGGGAAC TTGCCATTTC GGCGAGTTTG CACCGATCTC GGCGCCGACA TTACCGTATC AGAGATGGCG ATGGCGAGTA ATTTGCTCAA GGGGGATCGT AAAGAGTGGG CGCTTCTTCG TCGTCACCCA AGCGAGAAGT GCTATGGTAT ACAAGTTTGC GGTGGATATC CAGATCTGAT GGCAAGGTGT GCGGAATTGA TCGACAACGA AGTCTCGTGT GATTTCATCG ACGTCAATAT GGGATGCCCA ATCGACGGCG TTTGCGCCAA GGGTGCCGGT AGCAGTCTCA TGCGCGATAC TGATCGATTG AAAAACGTTG TACGGACAAT GGCGGCGGTT TCTTCGACCC CCGTAACAAT CAAGCTCCGC ATGGGCTACT TTGACGACCC CTCGAAGTAC GTTGCGCACG ACATCATCCC GCGAGCGAAA GCTTGGGGAG CGTTTGCGGC AACTCTACAC GGGCGCACTC GTGAGCAACG TTACTCGCGC CTCGCGGATT GGTCTTACAT TCATCGTTGC GCCGACGTGG CGGCAAAGAG CGAGTTCACA CTCATCGGTA ACGGAGATGT GTACACGTAC GAAGATTACA ACGCCCAAGT CGCCGACAAC AAAGTGGCGA CGTGTATGAT CGGTCGCGGT GCCATCATCA AGCCCTGGCT CATGACTGAA ATCAAAGAGC AGCGTCACTG GGACATAAGC GCCAATGAGC GATTAGATTT GTTCAAGGAC TTTTGCCAAT ATGGGCTCGA ACACTGGGGC AGCGACTCGA TGGGAGTGGA GAAGACGCGC CGCTATCTCC TCGAGTGGAT GAGTTACACC CATCGATACG TCCCAATAGG ACTATTGGAG CAAAACGTCG TTCCGAAACT CCACTTGCGT CCGATGCGTT ACGTCGGACG ATCGGACCTC GAAACCAAAC TCGCGAGCGA CAGACTCGAA GATTGGCTCG AGTTGAGTGA AATTTGCGGG CTTGGCAAAC CCGACGCGTC GTTCAAGTTT GTTCCAAAAC ACGCTTCGAA TAGCTATACA AAA
|
Protein sequence | RKELDFKGKL YVAPLTTVGN LPFRRVCTDL GADITVSEMA MASNLLKGDR KEWALLRRHP SEKCYGIQVC GGYPDLMARC AELIDNEVSC DFIDVNMGCP IDGVCAKGAG SSLMRDTDRL KNVVRTMAAV SSTPVTIKLR MGYFDDPSKY VAHDIIPRAK AWGAFAATLH GRTREQRYSR LADWSYIHRC ADVAAKSEFT LIGNGDVYTY EDYNAQVADN KVATCMIGRG AIIKPWLMTE IKEQRHWDIS ANERLDLFKD FCQYGLEHWG SDSMGVEKTR RYLLEWMSYT HRYVPIGLLE QNVVPKLHLR PMRYVGRSDL ETKLASDRLE DWLELSEICG LGKPDASFKF VPKHASNSYT K
|
| |