Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_17574 |
Symbol | |
ID | 5004657 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | + |
Start bp | 233558 |
End bp | 234658 |
Gene Length | 1101 bp |
Protein Length | 323 aa |
Translation table | |
GC content | 55% |
IMG OID | 640420078 |
Product | predicted protein |
Protein accession | XP_001420626 |
Protein GI | 145352595 |
COG category | [R] General function prediction only |
COG ID | [COG3491] Isopenicillin N synthase and related dioxygenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 0.769732 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.690692 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCCCG CGGTGACGCG CGAGGACGCG CGCGAGAAAG TCCCCACGGT GTCGCTCGAA CTGTACGCGA AAGATTTCGA AAAGTTTGTC CACGACCTGG GAGAGGCGTA CGAGCGATAC GGATTCGTCA TCTTGGAAAA TCATGGGATC GATCTGGAAC TCATCGAACG CGCGATGGAA CGCTCGAAGG CGTTCTTTGC GTGCGATGAG GAATGTAAGA AGAAGTACAC GCTGCGCGGG TTGGGCGGCG CGCGCGGGTA CACGGCGTTC GGAGTGGAGA CGGCGAAGGG AGCGAACGCG CCTGGTGCGT GAAGGCGATG CGAAAAGTGA TACTTTTTTG TTTTGATGCG TCGCGCTATA GCGCGTGTGA TGGATTCGAC GACGCGATTC GTTTGACTGA CGCGAAGGAT AACGTCGCCT CTCGCGCGCG CAGATTTGAA AGAGTTTTGG CACCTCGGGC GAGACTTGCC GGCGGGCCAT AAGTACGAGT CGACGATGCC GCCGAATGTG GATGACGTGG TGGAAGTCGA AGGGTTTAGC GAAGTGAATA AGCAAATCTT CAACGCTCTA GATGAGCTCG GGAATTGCGT GCTCGAGGCG CTGGCGGTGC ATCTGGAACA ACCGCGCGAG TATTTCGCAG ATAAGACGAA CGAAGGCAAC AGTATTTTGC GTATCATTCA CTATCCACCA ATTCCGCCTG ATGCTGCGGG GCGCGTGCGC GCTGGTGCGC ACGAGGATAT CAATCTCATC ACGCTCCTGC TCGGCGCCGA CGAGGGTGGT TTGCAACTCT TGGGAAAGGA TGGAGAGTGG CTCGAAGTCA ACCCGCCTCC CGGATGCGTG ACGTGCAACA TCGGTGACAT GCTGCAACGC CTGTCGAATC ACAAGCTTCC TTCGACGACG CACCGTGTCG TGAATCCTCC CGCGGAACGG GCGCACATTC CTCGATATTC CATGCCATTC TTTTTACATC CAAATCCAGA CTTCGTCATC AAAACGCTGG AGTCGTGCAT TTCGGATCAG TATCCCAATC GCTATCCAGA GCCCATTAGC TCTGATGAAT ACTTGCAGCA ACGCCTTCGA GAGATTAAAC TCAAATCATG A
|
Protein sequence | MTPAVTREDA REKVPTVSLE LYAKDFEKFV HDLGEAYERY GFVILENHGI DLELIERAME RSKAFFACDE ECKKKYTLRG LGGARGYTAF GVETAKGANA PDLKEFWHLG RDLPAGHKYE STMPPNVDDV VEVEGFSEVN KQIFNALDEL GNCVLEALAV HLEQPREYFA DKTNEGNSIL RIIHYPPIPP DAAGRVRAGA HEDINLITLL LGADEGGLQL LGKDGEWLEV NPPPGCVTCN IGDMLQRLSN HKLPSTTHRV VNPPAERAHI PRYSMPFFLH PNPDFVIKTL ESCISDQYPN RYPEPISSDE YLQQRLREIK LKS
|
| |