Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_37521 |
Symbol | |
ID | 5006092 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009370 |
Strand | + |
Start bp | 377040 |
End bp | 378044 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | |
GC content | 67% |
IMG OID | 640421513 |
Product | predicted protein |
Protein accession | XP_001421923 |
Protein GI | 145355344 |
COG category | [R] General function prediction only |
COG ID | [COG0656] Aldo/keto reductases, related to diketogulonate reductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 0.414085 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00268426 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGGCGCG CGCTCAGCGC GCGACGCACG AAGGCGCGCG CGGTGACGCC GCGCTGCGAC GCGTCGCGCG TCGCGCGACT CGCGACGGGA CGCGTCGTGT CGCGCGTCGG ATTCGGCACC GCGGCGTGGG GCGACGAGAC GCGCGGGTTC GGGACGCGCT ACCGCGAGCG CGACCTCGCG GCGGCGCTAT CGCGCGCGCT CGAGCGAGGC GTCACGTTCG TCGACACCGC GGAGACGTAC GGGGCAAGCG CGCGGGCGTT CGAACAGGGC GCGGAGGAGA TGGTGAGACG CGCGAGGACG ACGGCGAGGC GAGACGACGC GCGAGACGAC GCGTTCGTCG GAACGAAAGT GTTGACGGTG CCGTGGACGA ACGTGAGCGC GGGAGGGGAC GTGCGATCGA CGACGAAGAG CTTGGTGGAC GCGATCGAGG CGTCGGTGGG GAGGAACGGG GGGGAGGCGT ACGATTTGGT GTCGATTCAT TTTCCGTTTC CGACGTGGAC GCAGAGCGCG CTGTGCGACG CGCTCGCGGA GGCGACGGAG CGAGGGCTGT GTCGGGCGGT GGGGGTGAGT AATTACGACG TAAGGCAGAT GACGGAGGCG CATGGGTTGT TGGCGAAGCG TGGGATCGCG TTGGCGACGA ATCAGGTGAA ATATTCCGTG CTCGATCGAG GCGCCGAAAA GAGCGGGGTG CTCGCCGCGG CGCGGGATTT AGACGTCGCC GTCGTGGCGT ATTCGCCCTT GAGCGGTGGG GCGCTGCGGA CGAGCGCGGA CCCGGAGATT CGCACGTTGG ACAAGTTGCT CGAGTTCATC GGCGCCGTCA ACGGTGGTTG GACGTCGGCG CAGGTGGCGT TGAACTATCT CGTCCGCAAG GGCGCGATTC CGATTCCGAG TTGTACGAGC GTCGCGCGCG CCGACGCCAT CGCGGACGTC CTCGAATTCG AGCTCGGCGT CGAAGACATC GAGACTATCG ATGAAAAAAT GGATTACATC GAACGCAAGT CGTGA
|
Protein sequence | MRRALSARRT KARAVTPRCD ASRVARLATG RVVSRVGFGT AAWGDETRGF GTRYRERDLA AALSRALERG VTFVDTAETY GASARAFEQG AEEMVRRART TARRDDARDD AFVGTKVLTV PWTNVSAGGD VRSTTKSLVD AIEASVGRNG GEAYDLVSIH FPFPTWTQSA LCDALAEATE RGLCRAVGVS NYDVRQMTEA HGLLAKRGIA LATNQVKYSV LDRGAEKSGV LAAARDLDVA VVAYSPLSGG ALRTSADPEI RTLDKLLEFI GAVNGGWTSA QVALNYLVRK GAIPIPSCTS VARADAIADV LEFELGVEDI ETIDEKMDYI ERKS
|
| |