Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_38921 |
Symbol | |
ID | 5001831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 385855 |
End bp | 386976 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | |
GC content | 59% |
IMG OID | 640417252 |
Product | predicted protein |
Protein accession | XP_001418003 |
Protein GI | 145347074 |
COG category | [R] General function prediction only |
COG ID | [COG1090] Predicted nucleoside-diphosphate sugar epimerase |
TIGRFAM ID | [TIGR01777] conserved hypothetical protein TIGR01777 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.71449 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0770912 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTCAG CAGTGACTTG TAACGCGTTT TATCGTGCTC CGGCGAGCAA TGCGACGCCC TGCGCGCGCC TCCGCTCGCC GTCCCGTTCG GTGGATAAGA CGTGTCGTCG CCATCTTTTC CGCTCGACGC GCGCACGTCG GGGCGCTCTA GCGCCTTCCG CGAGCGTGGA CGACGACCTG TCCGTAGCGT CGTCCGACGA TCGGCTCATC GTCGCCATCA CGGGCGCAAC CGGGTTCGTG GGTAGCAAGC TAGTGGAGAC GCTCCTCGAG CGGGGCGCCG AAGTGCGAGT GCTCACGCGA GACGTCAACC GCGCTCGCGC GAAGCTTTCT CCACAAAATC TCCCAAAGGG CGACGTCGCG TTTGTGTCTC CGGATAAGTG GCGACGCGGG TTGCTCGGCG CGACGCACGT GGTGAATTTA GCTGGCGAAC CAATCAGCAC GCGCTGGGAC CCGAAGGTGA AGGGTGAAAT CATGGCGTCC CGAGTGAAAA CCACCAAGGC GGTCGTCGAA CACGTGAATT CGATCACAAA CGACGCCAAG AGACCTAAGG TATTGGTGAA CGCTTCGGCG ATCGGGTACT ACGGGACGAG TGAGACAGAT ACGTACGACG AAGCGAGCGG GCCAGGCGCG GACTATTTAA GTCAAGTCTG CCAGGCGTGG GAACAAACCG CGAGTGGGGT TGAAGATTGT AGAGTGGTGC TGCTGCGATT AGGGATTGTG CTCGATCGAG ATGGTGGGGC GCTCGGGAAG ATGGTGCCGA CTTTCCAAGC GTTCATGGGC GGGCCCTTGG GCGACGGTCA GCAGTGGTTT AGTTGGATTC ATAGAGACGA CGCGGTGGGG ATCATAATGG AGAGCTTGAC AAACGTAAAA CTTGAAGGTC CGGTGAATTG CGTCGCGCCA ACGCCCGTCC GCATGCGAGA GATGTGCGAA TCCCTCGGCG AGACCTTAGG GAAACCGAGT TGGTTGCCGG TGCCAGATTT CGCGTTGCGC GCAGTTCTCG GCGAAGGATC GACTCTAGTT CTTCAAGGGC AGAGAATCCA ACCCAAAACC GCGCTTGATG TGGGTTATAA GTTCAAGTAC GAGAGGATCG ACCAAGCGCT GAAGCAGATT CTTCGCCGTT GA
|
Protein sequence | MTSAVTCNAF YRAPASNATP CARLRSPSRS VDKTCRRHLF RSTRARRGAL APSASVDDDL SVASSDDRLI VAITGATGFV GSKLVETLLE RGAEVRVLTR DVNRARAKLS PQNLPKGDVA FVSPDKWRRG LLGATHVVNL AGEPISTRWD PKVKGEIMAS RVKTTKAVVE HVNSITNDAK RPKVLVNASA IGYYGTSETD TYDEASGPGA DYLSQVCQAW EQTASGVEDC RVVLLRLGIV LDRDGGALGK MVPTFQAFMG GPLGDGQQWF SWIHRDDAVG IIMESLTNVK LEGPVNCVAP TPVRMREMCE SLGETLGKPS WLPVPDFALR AVLGEGSTLV LQGQRIQPKT ALDVGYKFKY ERIDQALKQI LRR
|
| |