Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31472 |
Symbol | |
ID | 5002056 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 14909 |
End bp | 16519 |
Gene Length | 1611 bp |
Protein Length | 485 aa |
Translation table | |
GC content | 60% |
IMG OID | 640417477 |
Product | predicted protein |
Protein accession | XP_001417890 |
Protein GI | 145346840 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG3239] Fatty acid desaturase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGGCG CGGCGAACGC GACGCACATG CTCACGAACG CGCGCGCGAC CGCGCCCGCG GGCGCGCGAT CGGCGCGTCG CGCGCGCGTG TTCGCGCGCG CCGACGTCGC GGTGCGCGCG GCGCCGACGT CGACGGGGCG CGCGCGGGAC GGGCGCGCGC GAGGCGCGCT GGCGGTGACG CGCGAGCGCG CGAGCGCGGT CGATGGGAAG AGACGGAGAA GCGCGCAGAC GATCGGGGCG GTGGCGAATC CGCTCGCGGT GCCGACGTAC GACGCGCCGG AGGGGAGGGA TAAGAATGAA CCGATTCGCG TGAAAATCGG AGACGAGTGG TACGACTGTC GGGGGTGGGC CAAGGCGCAT CCGGGCGGCG AACGGTGGCT GTACTTTTTC GATGGACGCG ACGCCACGGA CGTGTTCTAC GCGCTGCATT CGTACGGCCC CAACGGCTCG GATTTGGCCG TGCAAAGGTT GAAGAAGCTT CCGCGCTGCG ACCCGCCGGC GGACAAGTCT CGCCTTCCGG ATGAGAAATC GTACGCGGTG AGCATGGCGT TCGGTGAATT GCGCGACAAG CTCGCGGAGG ACGGATTCTT CAAGCGCCAA CCGCTCAAAG AAGCTTGGGC GCTCTTCCAA GTCGTCGCTC TGTACGTGAG CGGAACCGCT CTCGCGTATT CGCACCCCGT CTGGGCGACT ATTTTGCTCG GACTCGGGAT GGAACAAGCG GGTTGGTTGG GTCACGACTA CGTCCACGGA CGTGGGCCGT GGTGCTCGTT GATGCGCTAC ATGCCGACTA TTTTGAATGG CCACAGTGTG GAGTGGTGGA TGCAAAAGCA CTCGATGCAC CACTCGTTTA CGAATGAAGA GCACCTCGAC AACGACGTCA TGATGGAGCC GTTCTTCTTC TTGCGCTCGC CGCAAGAGTC CGGCCGACCG GATCACCCCA TGCGCAAGTT CCAGCACATC TACGGCTACC CACTGCTATC CATTATGTTT TGGCTCTGGC GATTCCACTC GGTTCAAACC GCGTGGAAGA AGAGGGATTA CAAAGAGCTC GCCTTCATCG GGGCAAACTA TCTCTTCTTG GCGACGATGA TGCCGTGGCA GGTGGCTGTC GGCTCTATCA CGCTCAGTGG TTTCCTCGTC GGCGCTCTCG TGAGTGCGAC GCACCAGAGC GAAGAAATCA TGGAGTTTGG TGAGAATCCA GAGTACGTGG AAGGGCAATT CCGCTCGACG CGCGACGCCG AGTGCGTCTT CGGCGGCTTG GAAACGTGGA TTTGGGGCGG AATGGATACG CAGTTGGAGC ATCACTTGTT CCCCACGATG CCGCGTTACA ACTACCATAA ACTTCGCCCG CTACTGAAGG CGTGGGCCAA GGCAAACGGT GTCACGTACC GCTCGTCTCC GAGCACGGAG ATCATCGCCG ACAACTTCAA GATGCTCCAT CGCGTCGCCA CCGCGTAAAA ATTTATTCTT TACCCTGGAC GAATCCAGAC GTTGCTTAAT TGTGTGTTAC CCGTGTTTCC CTATCGTGAA TAGTTACTTT TGAGACCGCA TGCGCGTGCG TTGCGGTGTT AGTGTGTTTG TGAAACGAAG CTAAAAAACT CGTAGACAAA A
|
Protein sequence | MVGAANATHM LTNARATAPA GARSARRARV FARADVAVRA APTSTGRARD GRARGALAVT RERASAVDGK RRRSAQTIGA VANPLAVPTY DAPEGRDKNE PIRVKIGDEW YDCRGWAKAH PGGERWLYFF DGRDATDVFY ALHSYGPNGS DLAVQRLKKL PRCDPPADKS RLPDEKSYAV SMAFGELRDK LAEDGFFKRQ PLKEAWALFQ VVALYVSGTA LAYSHPVWAT ILLGLGMEQA GWLGHDYVHG RGPWCSLMRY MPTILNGHSV EWWMQKHSMH HSFTNEEHLD NDVMMEPFFF LRSPQESGRP DHPMRKFQHI YGYPLLSIMF WLWRFHSVQT AWKKRDYKEL AFIGANYLFL ATMMPWQVAV GSITLSGFLV GALVSATHQS EEIMEFGENP EYVEGQFRST RDAECVFGGL ETWIWGGMDT QLEHHLFPTM PRYNYHKLRP LLKAWAKANG VTYRSSPSTE IIADNFKMLH RVATA
|
| |