Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_39543 |
Symbol | |
ID | 4999924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | + |
Start bp | 550217 |
End bp | 551284 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | |
GC content | 55% |
IMG OID | 640415345 |
Product | predicted protein |
Protein accession | XP_001415530 |
Protein GI | 145340849 |
COG category | [R] General function prediction only |
COG ID | [COG3491] Isopenicillin N synthase and related dioxygenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.159789 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGTCGG TTGCCAGCCA GCTCCCGGTG CTGTCGCTGC CGCGAGAAGA CGCCGACGAA AACTCACACC GCGCGTTCGC CGCGGAGCTG TGTCGAACGT GTCACGACGT TGGATTCTTT TATTTATCGA ATCACGGCGT CGAGCAGGCG TGCGACAACG TTCTCGACGC GTCGAGGGCG TTCTTCGCGT TGCCACTGGC TGAGAAGAAA GAGATAGACT ACACGCGAAG TCGAGCGTTT CGAGGATACA TGTACGACGG CGCCGAAAAC ACCGCTGGAA AGCCAGATCG AAGAGAGCAG ATTGAATTCG GTGTCGAGTG CGCCGAAACG TGCGCCACCG AAGGGCCATA TTACGAGCGA TTAAAAGGTC CAAATCAGTG GCCGGCTCAG GTGCCGCTTC GCGCACCAGT GGAAGATTTC CAAAACAAAA TGGCGACGTT GAGTCGAAGA ATCATGACTT ATTTGGCGAT CGGACTAGAC CTCGACGGCG GTTATTTCGA TTCCATGTTC GGCGACGAAC CGAACGTGCA GATGAAAATT TGCCGGTATC CGCCGAGCGA CGGATCCGTC GGCGAACACA GCGACACTGG AATTCTTTCT TTTGTCGTCC AAGACTCGGT CGGTGGTTTG CAAGTGCAGC TTCACGATAG CGGTGAATGG ATTGATGCGC CGCCGATCGA CGGAACGCTC GTCGTCAACT TGGGGGAGAT GATTCAACTA ATCACCGGCG GCTACTTTCT TGCCACGCCG CATCGCGTTC AAAACCTGAA CGGCAGCCAG GCGCGATACT CGGTGCCTTA CTTTTGGAAT CCGGAGCTAG ACTATCGAGT CGAATTTATA GAGTCCTCCA TATTGGACAG TTTGGTTTGG CATCGGCCGC GACCGTCGGA CGATTCCTTC AGAGCGACGG GATCGCACGG CGGCGCGAAT CGACTCATCT CAGCGTACGG CGAAAACGCG TTCAAATCGT TCGCACGGTC TCATCCCGTA GTCATGCGAA GGCATCATTC AGACTTAGTG ATGAACCCCG ACGGTAGTAT AACTAGTCCT AGCAGCTCAA TATTATAA
|
Protein sequence | MASVASQLPV LSLPREDADE NSHRAFAAEL CRTCHDVGFF YLSNHGVEQA CDNVLDASRA FFALPLAEKK EIDYTRSRAF RGYMYDGAEN TAGKPDRREQ IEFGVECAET CATEGPYYER LKGPNQWPAQ VPLRAPVEDF QNKMATLSRR IMTYLAIGLD LDGGYFDSMF GDEPNVQMKI CRYPPSDGSV GEHSDTGILS FVVQDSVGGL QVQLHDSGEW IDAPPIDGTL VVNLGEMIQL ITGGYFLATP HRVQNLNGSQ ARYSVPYFWN PELDYRVEFI ESSILDSLVW HRPRPSDDSF RATGSHGGAN RLISAYGENA FKSFARSHPV VMRRHHSDLV MNPDGSITSP SSSIL
|
| |