Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33094 |
Symbol | |
ID | 5003468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | + |
Start bp | 378957 |
End bp | 380039 |
Gene Length | 1083 bp |
Protein Length | 308 aa |
Translation table | |
GC content | 57% |
IMG OID | 640418889 |
Product | predicted protein |
Protein accession | XP_001419146 |
Protein GI | 145349451 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0647] Predicted sugar phosphatases of the HAD superfamily |
TIGRFAM ID | [TIGR01452] phosphoglycolate/pyridoxal phosphate phosphatase family [TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGCGCGCGTC TTCGTGCGCG AACGTTTGAT TTCCTAACAG AGCCGTCCTC GCGCATGTCT TGCGCTCGAA CGATCGCGCC GCACGCGCGC GTCGCCGCGC ACGCGCGTCG ACACACGACG AGTGGGAAAT CGACGATGTC CCGCTCCGCC GTCGTCCCGC GCGCGAAGGC GAACCGCTTG CAAGAGAAGA GCGCGCAGGA GTTGGTCGAC GCCACGGAAA CGTTCATCTT TGACTGCGAC GGTGTCATCT GGAAGGGTGA CTCGCTGATC GAAGGCGTCC CGGAGACGCT CGAACTGCTG CGCTCAATGG GCAAGCGATT GATCTTCGTG ACGAACAACT CCACCAAGTC TAGAGCCGGG TACACGAAGA AGTTCGAGTC GCTGGGATTG AAAGTGAACG CCGAGGAAAT CTTTTCTTCG TCCTTCGCCG CGGCGGCGTA TTTGGAATCG ATCGATTTCA AGAAGAAGGC GTATGTCGTC GGAGAGACGG GCATCTTGGA GGAACTCGAC GGCGTCGGCA TCAAGCACAT TGGTGGTGAA TCGGACGCTG GCAAGCAAGT CACGCTGGCG AGTGGTGAGT TGATGCACCA CGACGAGGAT GTCGGGGCGG TTATCGTTGG TTTTGATCGC AACATCAACT ACTACAAGAT CCAGTACGCG ACGCTTTGCA TCCGCGAGAA CCCGGGTTGC ATGTTCATCG CGACAAACAC CGACGCTGTG ACGCACTTGA CCGACGCCCA AGAGTGGGCT GGTAACGGTT CCATGGTCGG TGCCATCAAG GGTAGCACGA AGAGAGAACC GATCGTCGTC GGCAAGCCCG CGGCGTTCAT GTTGGATTAC ATTGCCAACA AGTTCCAGAT TCGCAAGGAT CAAATCACCA TGGTCGGCGA TCGCCTCGAC ACCGATATCC TCTTCGGTAA CGACGGTGGC TTGAATACTA TGCTCGTGTT ATCCGGCGTG ACCACGAAAG ACATGCTTTG CAGCGACGAC AACACCATTG CGCCGACTTA CTACACCGAT AAATTGGCTG ATTTATTGTG CGTCGGCAAG GTCGCAGCTT AAACGCGTTG TTATATACCT CTA
|
Protein sequence | MSRSAVVPRA KANRLQEKSA QELVDATETF IFDCDGVIWK GDSLIEGVPE TLELLRSMGK RLIFVTNNST KSRAGYTKKF ESLGLKVNAE EIFSSSFAAA AYLESIDFKK KAYVVGETGI LEELDGVGIK HIGGESDAGK QVTLASGELM HHDEDVGAVI VGFDRNINYY KIQYATLCIR ENPGCMFIAT NTDAVTHLTD AQEWAGNGSM VGAIKGSTKR EPIVVGKPAA FMLDYIANKF QIRKDQITMV GDRLDTDILF GNDGGLNTML VLSGVTTKDM LCSDDNTIAP TYYTDKLADL LCVGKVAA
|
| |