Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_44068 |
Symbol | |
ID | 5004423 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | + |
Start bp | 502172 |
End bp | 503407 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | |
GC content | 56% |
IMG OID | 640419844 |
Product | predicted protein |
Protein accession | XP_001420362 |
Protein GI | 145352030 |
COG category | [R] General function prediction only |
COG ID | [COG0319] Predicted metal-dependent hydrolase [COG0561] Predicted hydrolases of the HAD superfamily |
TIGRFAM ID | [TIGR00043] metalloprotein, YbeY/UPF0054 family [TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily [TIGR01484] HAD-superfamily hydrolase, subfamily IIB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.00247631 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCACT GCGAGTTGAG CGTGGCGCTG TGCTCGGATG AGTACATTCG AAGCTTGAAC GCGGGGTATA GAGAGAAAGA TAGCGCGACG GATGTTTTGA GCTTTCCTGC GGAGAGTTTC GGGCCGATGG CAGTGCTCGG GGACGTCATC GTGAGCGTGG ACACGGCGAG CGCCCAGGCG CGGGAGGTGG GGCATTCTTT GCGGGATGAG TGCCGAGTTT TGCTCGTGCA CGGGACTTTA CATTTATTAG GTATGGATCA CGAAGTCAGC GAAAGCGAGG CGGAGGTAAT GGCGGCGGCG GAGCAAGAGG TCTTGAAGGC GCTCGGATGG AAAGTCACCG GGCTCACGAG ACGCGCGTCG GGAGAATCGA CGTCCGACTC TTCTTCGACG ACGCTCACGA CACAGAGAAG TGTGCTCGTG ACGGATTTAG ACGGCACGCT ATTAAATGAA AATAGTGTCA TCACGCCTCG AGTCGCCGAT GCTTTGCGCC GGGCGATGGC GTCGGGAGTC GAAGTTGTCG TCGCCACGGG CAAGGCGAGA CCGGCGGCGA TTAAAGCCGC CGCCACGCAA GGATTAGACG GCATTATCGT CGGTAAGAAC ACACCTGGGG TGTTCTTACA AGGTCTAGAA GTGTACGGTC GAGGCGGTGC TCTGGTCTAT GAAGCGAAAA TGCCCGAAGA CGTCACGAGA GATGCCTTCA TGATGATGGA TGACGTCGTG CACGACGGAT TGGCGCTCAC GGCGTTTTGT GGCGACAATT GCGCGACGCT TGCGCCGAGC GTACTCTTGG ACGAGCTCCA CCACACCTAT CACGAACCAG CCAGCGAAAT CGCCGGATCG GTGGATGAGA TACTATCCAA TAACACCGTT CGTAAACTAT TATTAATGGG ACCGAGCAAA GAGAGCATTG ACGGCGTTCG ATCGATTTGG GAAGCCGCAT TCAGGGGTCG AGCGGAGGTC ACACAAGCGG TGGCGGATAT GCTAGAAATA TTACCCCTTG GGAACGATAA GTCAAAGGGC GTTCGAGCCG TGTTGAAATC CATGGATGTG AATCCGATGA CGGACGTTGT CGCCATCGGC GACGGCGAAA ACGATGCCGA AATGCTTCGA TTCGTCGGTT GCGGCGTCGC CATGGCGAAC GCCACGGAAA AGACAAAAAG CGGTGCCGCA CACGTCCTCG ATGCTTCAAA CACGCAAGAC GGTGTCGCGG AAGCGATTGA TAGGTTTGTT TTGTAA
|
Protein sequence | MSHCELSVAL CSDEYIRSLN AGYREKDSAT DVLSFPAESF GPMAVLGDVI VSVDTASAQA REVGHSLRDE CRVLLVHGTL HLLGMDHEVS ESEAEVMAAA EQEVLKALGW KVTGLTRRAS GESTSDSSST TLTTQRSVLV TDLDGTLLNE NSVITPRVAD ALRRAMASGV EVVVATGKAR PAAIKAAATQ GLDGIIVGKN TPGVFLQGLE VYGRGGALVY EAKMPEDVTR DAFMMMDDVV HDGLALTAFC GDNCATLAPS VLLDELHHTY HEPASEIAGS VDEILSNNTV RKLLLMGPSK ESIDGVRSIW EAAFRGRAEV TQAVADMLEI LPLGNDKSKG VRAVLKSMDV NPMTDVVAIG DGENDAEMLR FVGCGVAMAN ATEKTKSGAA HVLDASNTQD GVAEAIDRFV L
|
| |