Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31624 |
Symbol | |
ID | 5001701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 243571 |
End bp | 244730 |
Gene Length | 1160 bp |
Protein Length | 381 aa |
Translation table | |
GC content | 55% |
IMG OID | 640417122 |
Product | predicted protein |
Protein accession | XP_001417962 |
Protein GI | 145346988 |
COG category | [R] General function prediction only |
COG ID | [COG3491] Isopenicillin N synthase and related dioxygenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.492816 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00491035 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | CGGCTTTCGA CGGCATGCCC GCCGCGCGCG AAGACGCAGC GTCCGAGTAC CCCGTCATCG ATGTCGGCCC TTTACTCAGC GCCGGCGATC CGAACGAGGA CGTTCGCTTC ATTCTTGCCC GACGACGCGA GGTCGGTCGA GCGCTTCTCG CCGCGTGCGA ACGGTTCGGC TTCTTTTCCA TCGTATGCAC CATTCAAAAG GAATCAATCC CGTGGGCCGA CGTCGTTGAC CTGTTAACGC TGAACACCCG TGATGATTTG CCCGACGACT TTCTCTGGGA GTACGCGCGG CATGACCGGT CAGAAGGGTG TCTGATCGAG GGGAAAGATT TGACCTTTGC ACACGTCGAC GCGTGGTTTT CACAGAAACA GGCTGACAAG GACGCGTACG CGATGACGAA CGGAAGAGGA TATCAACGAA TAGGGGAGAA TGTGACGAAT GGGAGACGAG ATCAGCACGA AGCCATTGAT TTTTATCGAC CGTGTGCCGT TAGTGATGGT GGTTTGCGCG CGCCGCATCC GTACATTGGT CCGAACGATT TGAACGAGCG CGTCGACAGA TACGCGAAAG GGATGACGCG CATTGGTAAA TATATTTTGC GCGCGCTATT GGGCGCGATT CGGAAAGAAT ACCTCCACTA TCGCATAGCG GACGACTTCA CAGAGGACAT ACTCGAGGAT GACATCGCGG GATGTCCGTT TTGGATCTTA CGGCTCATAA ACTCGCCCGG TTGCGATTCC GATGGCGAGA AAACTTCGTG CGGTTGGCAC ACCGACTACG GTTTGCTGAC TTTCATTCAC GCCACGCATC CAGGCTTACA AATCGAAGTA TCAGGCAAGA TTATCGATGT TCCTCACCAT CCAGAACACA TGGTCTGTAA CGTTGGCGAG ATGCTTCAGC TCTTCACGGA CGACAGTCTC AAAGCCACCC GGCATCGCGT CGTCCGAAAG CCGAGCGACG AAAACTGCGC GCGTCCTCGT ATTTCGATTG CGTTCTTTTA TGAACCCAAC TACGACGCTG TGATATCGAA CAGGCATCTC ACGGATCAAA GTGCTTTGGA ATCGGGTTAC TCATCTCCGC GCGAGGTGCG ATACGCCGAC TTCTTGCGAC AAAAGGTCGC CACGAACTTC GCAAAGGTTG ACGAGGACCC TCGGGCGTAG
|
Protein sequence | MPAAREDAAS EYPVIDVGPL LSAGDPNEDV RFILARRREV GRALLAACER FGFFSIVCTI QKESIPWADV VDLLTLNTRD DLPDDFLWEY ARHDRSEGCL IEGKDLTFAH VDAWFSQKQA DKDAYAMTNG RGYQRIGENV TNGRRDQHEA IDFYRPCAVS DGGLRAPHPY IGPNDLNERV DRYAKGMTRI GKYILRALLG AIRKEYLHYR IADDFTEDIL EDDIAGCPFW ILRLINSPGC DSDGEKTSCG WHTDYGLLTF IHATHPGLQI EVSGKIIDVP HHPEHMVCNV GEMLQLFTDD SLKATRHRVV RKPSDENCAR PRISIAFFYE PNYDAVISNR HLTDQSALES GYSSPREVRY ADFLRQKVAT NFAKVDEDPR A
|
| |