Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_18471 |
Symbol | |
ID | 5006012 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009370 |
Strand | + |
Start bp | 95793 |
End bp | 96881 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | |
GC content | 68% |
IMG OID | 640421433 |
Product | predicted protein |
Protein accession | XP_001421838 |
Protein GI | 145355166 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.23754 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGCCG ACGCGCGCGC GGGCGAAGGC GCGAGACTGT TGGCGGCGTT CGCGACGCGC GTCGTCCCGC GATTGGCGCG CGCGCGCGCG CACAAGGGGT CGCACGGCGG GAAGATCGCC GTCGTCGGTG GGAGCGAACT GTACGCGGGC GCGCCGTACT TCGCGAGCGC GGCGGCGATG CGCGCGGGGT GCGATCTGTG TCACGTGTTC ACGCACGCGA AATGCGCGCC GGTGATGAAG GGGTACGGGC CGGACCTGAT CGTGCACGAG GCGTGGTCGC GGGACGCGCG CGAGGCGACG CGCGGAGCGA AGACGGAGAC AGAGACGGAG AACGAACGAT CGATCGATCT CGTCGAGGCG TTCGGGAGGT TTAGGATCGA TAACGCGGTG ATCGGACCGG GATTGGGGCG CGGGGCGGCG CTGGAGGCGG TGGAGGCGCT GAGAGAGGTC GCGGCGGCGT GCGTCGTGGA CGCCGACGGG TTGAAGGCGC TGGAACCGAC GAGCGCGGAC GAGGACGGCG CGGAGGCGGC GCGAGGGAGA AATCCGACGG CGCTGGCGAC GCCAAATAAG ATGGAACTGT GGCGATTGGT GCGAAAGGCG TCGGGGGCGT TCGAGGGGGG GGTGACGACG ATGGATTTGA GCGCGCGCGA GGACAGGGAG AAAATAGCGA GCGCTCTGCG ACGGTACGCC GGCTATAATT TCCTCGTCAA GGGCGAAGAC GATTACTTAT TCATTCAACA CTGGGACGTC GCGCCGTCGG TGTGCGACAG CGAGCGCGCG GCGAGCGGCG ACGCGTCGAT CGTTCGGCTC CGTTTCGACG GCGTCGGCTC GCCGAAACGT TCCGGCGGTC AGGGCGACAT TCTCGCCGGC GTCCTCGCGG TTTTTCTCCT CTGGTCCCAG CGCACGGACG CCACGACCGC GAGCAATCGT TTAGACGATT ACGTCGCCGC CGTCGGCGCG GCGTGTTTCC TCGTGAAAGC CGCCTCGAGC GCGGCGTATC GCGAGTACGG TCGCGGCGCG CACGCGCAAG ACGTCCTCGC GCGCGTCGCC TCGACGTTCA TGGCGCATTT AGAACCCGAT CTCTCCTAG
|
Protein sequence | MRADARAGEG ARLLAAFATR VVPRLARARA HKGSHGGKIA VVGGSELYAG APYFASAAAM RAGCDLCHVF THAKCAPVMK GYGPDLIVHE AWSRDAREAT RGAKTETETE NERSIDLVEA FGRFRIDNAV IGPGLGRGAA LEAVEALREV AAACVVDADG LKALEPTSAD EDGAEAARGR NPTALATPNK MELWRLVRKA SGAFEGGVTT MDLSAREDRE KIASALRRYA GYNFLVKGED DYLFIQHWDV APSVCDSERA ASGDASIVRL RFDGVGSPKR SGGQGDILAG VLAVFLLWSQ RTDATTASNR LDDYVAAVGA ACFLVKAASS AAYREYGRGA HAQDVLARVA STFMAHLEPD LS
|
| |