Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_27367 |
Symbol | CYSD |
ID | 5005486 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | + |
Start bp | 147944 |
End bp | 149463 |
Gene Length | 1520 bp |
Protein Length | 434 aa |
Translation table | |
GC content | 58% |
IMG OID | 640420907 |
Product | O-acetylserine sulfhydrylase/homocysteine synthase |
Protein accession | XP_001421219 |
Protein GI | 145353863 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2873] O-acetylhomoserine sulfhydrylase |
TIGRFAM ID | [TIGR01326] OAH/OAS sulfhydrylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.365897 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00396876 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | CCCGCACACG TCGACGCCCC GACGCGAACG ACGCGAACGA CGGAACGAAC GCCGCCATCA TGCCCGCGCC CGAGTTAAAG TTCGAAACCC TGCAGGTGCA CGCCGGCCAA GCGTCCGATC CCGCGACGAA CGCGCGCGCG GTGCCGATCT ACGCCACGAG CTCGTACACG TTCGATTCGT CCAAGCACGG CGCCGATCTG TTCGGGCTGC GCGCGTTCGG GAACATATAC AGCCGAATCA TGAACCCGAC GTGCGACGTG TTCGAAAAGC GCGTCGCCGC GCTCGAGGGC GGCGTGGGCG CGCTCGCGGT GTCGAGCGGG CAGAGCGCGC AGTTTTTGGC GATCTCGACG ATTTGCGGGA CGGGCGATAA CATCGTCGCG ACGCCGAGCC TGTACGGTGG AACGTACAAT CAGTTCAAGG TGACGTTTCC GAGACTTGGG ATCAACGTTA AGTTCGCGAA GGACGACGAT CCGGCGAGCT TTGAGGCGCA AATCGACTCG AACACGAAGG CGCTGTACGT GGAGACGATT GGGAATCCGC GATTTTCGGT GCCGGACTTT GCCAAGTTGA AGGCGATCGC TCAAAAGGCG GGGATTCCTT TGATTTGCGA CAACACGTTC GGTGCTTGCG GATACGTGTG TCAACCGCTC AAGCACGGCG CGGATATCGT GGTTGAAAGC GCGACGAAGT GGATCGGCGG CCACGGCACC ACCGTCGGCG GCGTCATCGT CGACGGCGGT ACGATGAACT GGAACAACGG TAAGCACCCA GTGATGACTG ATCCGTCTCC GGGCTACCAT GGATTGAAGT TCTGGGACAC TTTCGGCCCG GACGGCATTC TTGGTGCCAA CGCGACGTTC ATCATGCGTT GCCGCGTCGA AGGCTTGCGC GATCTCGGCA TGTGCCAAAA CCCGTTCGGA GCGTTCAACT TCATCCTCGG CTTGGAGACT TTGTCTCTCC GCATGGAACG CCACTGCTCG AACACTATGG CTTTGGCGCA ATACTTGGAA AAGCACCCGC AAGTGAGCTG GGTGTCTTAC CCAGGCCTCA AGGCGCACCC GTTCCACAAG CTCGCCATCG AATATTTCCG CGACGGCAAC TTTGGAGCCG TGCTCACGTT CGGTATCAAG GGTGGTCTCG ATGCGGGCAT GAAGTTCATC GACAATGTCA AGCTCGCTTC GCACTTGGCT AACGTCGGGG ATGCGAAGAC GCTCGTCATC CATCCGGCGT CGACCACGCA CGAACAGCTC ACTCCGGAGG AACAAACCGC CTCGGGCGTG ACTCCGGACA TGATTCGCGT CTCCGTCGGC ATCGAGCACA TCGATGACAT CTGCGCGGAT TTCGCCCAAG CGCTCACGGC GTAATCTTCC GGTGCTGATC CAATCAGAAT CGTCTCTTGC ACGAACTATT TACGAAAACT TTAATCGCGC GCGCGGCTCC GCAACGCAGC ATGTCTCTCA TATTACTATC ATACCACATC AATAAACTTG TAATGAACAC AAAAAACGAA AAAAATCGAA
|
Protein sequence | MPAPELKFET LQVHAGQASD PATNARAVPI YATSSYTFDS SKHGADLFGL RAFGNIYSRI MNPTCDVFEK RVAALEGGVG ALAVSSGQSA QFLAISTICG TGDNIVATPS LYGGTYNQFK VTFPRLGINV KFAKDDDPAS FEAQIDSNTK ALYVETIGNP RFSVPDFAKL KAIAQKAGIP LICDNTFGAC GYVCQPLKHG ADIVVESATK WIGGHGTTVG GVIVDGGTMN WNNGKHPVMT DPSPGYHGLK FWDTFGPDGI LGANATFIMR CRVEGLRDLG MCQNPFGAFN FILGLETLSL RMERHCSNTM ALAQYLEKHP QVSWVSYPGL KAHPFHKLAI EYFRDGNFGA VLTFGIKGGL DAGMKFIDNV KLASHLANVG DAKTLVIHPA STTHEQLTPE EQTASGVTPD MIRVSVGIEH IDDICADFAQ ALTA
|
| |