Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_94354 |
Symbol | |
ID | 5001895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | + |
Start bp | 525742 |
End bp | 526905 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | |
GC content | 60% |
IMG OID | 640417316 |
Product | predicted protein |
Protein accession | XP_001417800 |
Protein GI | 145346654 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases [TIGR01139] cysteine synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.25713 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00194835 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGATGC GAGCGCATCA CGCGCGCGCG ACGGTGACGA CGACGAAGAG AGGGCGATGC GAGCGACGCG ATAGGGCGCG AACGACGCCG CGGGCGCGGA TTTATGAAAA TATTCTCGAG ACGGTCGGGG ACACGCCGGT GATCAAGGTC AACCGACTGG CGCCGGCGGG GATCGATATG TACGTCAAGT GCGAGTATTT TAATCCGTTG AGCAGCGTGA AGGATCGATT GGCGGTGGCG GTGATCACGG ATGCCGAACG ACGGGGGTTG TTGAAGCCGG GGGATACGGT GGTGGAGGCG ACTTCGGGCA ACACCGGGAT CGCGGTGGCA ATGGCGTGCG CTCAACGCGG CTACAGATGC GTCATCTGCA TGGCCGAGCC GTTTTCTGTG GAACGTCGGA AGATCATGCG CATGCTCGGC GCGAAAGTCA TCGTGACGCC GAAGGGGGGT AAAGGTACGG GGATGGTGGC CAAGGCGGAG GAATTGGCGG AAAAGAATGG TTGGTTTTTG TGCCGACAAT TCGAGAACGA AGCCAATCCC GCGTACCACG CCTCGACTAC GGGGCCGGAA ATCTTGCGAG ACTTCGCGGG TAAGAAGCTC GACTATTTCG TCACCGGTTA CGGCACCGGC GGTACGTTCC AGGGCGTCGC GCGAACGCTC AAGGAATCTC GTCCGGACAC CAAGGTGATT TTGCTCGAAC CCGAAGCCGC GGCGTTGGTG ACTTCCGGCA TCAAGACCGA GCGTAAGCCC ACGGGCGCCC CGAATGGGTC TCATCCGGCG TTCGCGGCGC ACCCTGTGCA AGGTTGGACG CCCGATTTCA TCCCTTTGGT TCTCGAAAAT GGTTTGAACA TGAACCTCTA CGACGAACTC GTGAAGATCG AAGGCGGCGA CGCCGTCAAG ACGGCGCAAG CGTTGGCGAG AAGCGAAGGT ATCTTCACCG GTATTTCTGG TGGCGCCACG TTCGCCGGTG CGCTCAAGGT TGCCGAAAAG GCGCCGAAGG GCTCGGTGAT CTTGGCGATG TTGCCGGATA CGTCTGAGCG TTACATGAGC ACGCCACTTT ACGACTCGAT CGAGGCGGAC ATGAACGAGG AAGAGCTCGA GATCGCGAAG TCGACGCCGT CTTTCCAACT TATCCCGGGC CAAGAACCGA CGCTGCAAAT GTAA
|
Protein sequence | MTMRAHHARA TVTTTKRGRC ERRDRARTTP RARIYENILE TVGDTPVIKV NRLAPAGIDM YVKCEYFNPL SSVKDRLAVA VITDAERRGL LKPGDTVVEA TSGNTGIAVA MACAQRGYRC VICMAEPFSV ERRKIMRMLG AKVIVTPKGG KGTGMVAKAE ELAEKNGWFL CRQFENEANP AYHASTTGPE ILRDFAGKKL DYFVTGYGTG GTFQGVARTL KESRPDTKVI LLEPEAAALV TSGIKTERKP TGAPNGSHPA FAAHPVQGWT PDFIPLVLEN GLNMNLYDEL VKIEGGDAVK TAQALARSEG IFTGISGGAT FAGALKVAEK APKGSVILAM LPDTSERYMS TPLYDSIEAD MNEEELEIAK STPSFQLIPG QEPTLQM
|
| |