Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_45194 |
Symbol | |
ID | 5001052 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 377405 |
End bp | 378693 |
Gene Length | 1289 bp |
Protein Length | 395 aa |
Translation table | |
GC content | 54% |
IMG OID | 640416473 |
Product | predicted protein |
Protein accession | XP_001416640 |
Protein GI | 145344231 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | [TIGR02006] cysteine desulfurase IscS [TIGR03402] cysteine desulfurase NifS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0622302 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.609433 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGGCCGCTGT ACTTGGACAT GCAGGCGACG ACGCCGTTGG ACCCGCGAGT GCTGGACGCG ATGTTGCCCT ATTTCACAGA GCAATACGGG AATCCGCACT CGAGAACGCA CATGTACGGC TGGGAGACGG AGGATGCCAT CGAAAAGGCG AGAGGAGAAT TGGCGTCGCT CATCGGGGCG AACGCGAAGG AGATTGTGTT CACGAGCGGG GCGACGGAGT CGAACAACAT GTCGCTCAAG GGGGTGGCGC GCTTTTACAA GGATAAAAAG AAGCACATAA TCACAACGAC GACGGAGCAC AAGTGCGTGT TGGACTCGTG CAGACAGCTC GAACGTGAAG GTTTCGACGT GACGTATTTG CCCGTGAAGG AAAATGGATT GGTAGACTTG AAGGAGCTTG AAGCGGCGAT GCGCGACGAC ACCGCCATCG TCTCCGTCAT GGCGGTGAAC AACGAAATAG GGGTGATTCA GCCTTTGAAA GCGATCGGTG AGCTTTGCCG ATCGAAGAAA ATATTTTTTC ACACCGATGG CGCGCAAGCA GTTGGGAAGG TACCGATGGA TGTGAACGAT ATGAACATCG ACCTGATGTC GATTAGCGGG CACAAGTTTT ACGGTCCCAA GGGGATCGGC GCTTTGTACG TCCGTCGTCG TCCTCGAGTT CGGATGGAGC CTATCATCAA CGGCGGCGGT CAAGAGCGAG GGTTACGCTC GGGGACGCTA CCGACCCCGC TCATCGTCGG TATCGGTGAA GCTGCTCGCG TGGCGCAGAA GGAGTTGCAG CGCGACGAAG AGCACGTCAA CCGCTTGGCT AAGAGATTGA TAGAGGGCAT CGAATCTCGC GTCGAGCACA CGCAATTAAA CGGTGACCGT GAAGCGCGCT ACCACGGCAA CGTGAACATG TCCTTTGCAT ACGTGGAGGG TGAATCCATG CTCATGGGAC TTAAAGAAAT CGCGGTGAGC AGCGGCAGCG CGTGCACGAG TGCGTCTTTA GAGCCATCCT ATGTTTTGCG TGCGCTCGGT GTGAACGAAG AGATGGCGCA CACGTCGGTA AGATATGGAT TAGGCCGATT CACTACTGAA GCCGAGGTCG ATCGCGCCAT CGAAGCCACA GTGCGTCAAG TCGAAAAGCT TCGTGAGATG TCTCCGCTCT GGGAGATGGT CCAGGAAGGC ATAGATTTAA AGACGATCGA GTGGAGTCAA CATTAACAAG CTCGCGCGCG CGTCATTTTT CGTAGAATTC AATTAGTTCA GTTCATGTTA TCATCACCAC GTTTTGTTCG TAATATTCC
|
Protein sequence | MQATTPLDPR VLDAMLPYFT EQYGNPHSRT HMYGWETEDA IEKARGELAS LIGANAKEIV FTSGATESNN MSLKGVARFY KDKKKHIITT TTEHKCVLDS CRQLEREGFD VTYLPVKENG LVDLKELEAA MRDDTAIVSV MAVNNEIGVI QPLKAIGELC RSKKIFFHTD GAQAVGKVPM DVNDMNIDLM SISGHKFYGP KGIGALYVRR RPRVRMEPII NGGGQERGLR SGTLPTPLIV GIGEAARVAQ KELQRDEEHV NRLAKRLIEG IESRVEHTQL NGDREARYHG NVNMSFAYVE GESMLMGLKE IAVSSGSACT SASLEPSYVL RALGVNEEMA HTSVRYGLGR FTTEAEVDRA IEATVRQVEK LREMSPLWEM VQEGIDLKTI EWSQH
|
| |