Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_29534 |
Symbol | |
ID | 5006639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | + |
Start bp | 407804 |
End bp | 408931 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | |
GC content | 55% |
IMG OID | 640422060 |
Product | predicted protein |
Protein accession | XP_001422581 |
Protein GI | 145356734 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0426424 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAGC GAGAGTACGA GTGGAAACAC ATAAACGCGC GAGATTTTCC GAGTCTTGAA GATGGGTTCC GCGTGCTGCG GGAGATTCGA TATTTGATTT GGAAAATCGG CGACGTCGAG ATTCTGCGGG CGGCGGCGAA CGTCCGAGAG AGCCATCCCG GGACGGGCGC GCCTCTGGGG GACGGTTTGA GAAATCTGGA CATGTTCGAT CACTCAAACT TTGTGATTGC ATATTTCCAT GTTCCCGTTG AGATCTATCG ATTCTATCAC GAAGAAATGC CATCGTTGGT GCCAGATTTG GAAGATCGGA AAAAATGCAA AGTCTTAGCG ACGCAAGGCG GATTGTATCG GAAAAACATG AGCGAATGGT GGTGCGTTGT CGATTATCTT CTCGCGATAG GTTACGAGTG GCCCGTGGAA ACGATGGCGA AACTTGTTCA TTGGCACAGA AAGTTTGACG ACACCGAGAA GAAGGCGCTC GAAGATTACA AAGCGGCCGG GTGCCCGTAT CATCCCCGCG TAGTTCGCGA GGCGGCAAAG CAAGGAAGTC TGGATGCTCT CAAGTGGCTG CGTGAGAAAA ATGCGCCTTT CGACAAGTAC GCGGTGAGCT GGGCGGCGAA TTCCGGGCAA GTGGAGACGC TTAAGTATCT TGTGCAAGAA ATCAAGGTCA AACCGGATCC GATGATGTGC GCGCGAGCGG CCGAGGCGGG CGAGTTGACC GCTTTGCAGA CATTGCACGA AGCCGGCGTT CCGTGGGATA AGCGATGTAT AATCGCCGCG GCGAAGGAGA AAAAATCCAA GAAGAAGCGC AAGCGCGAGG AATGGAGTTC GTGGAATAAG CTCGGTCGCA TTCGATGTCT GCAATTCGCG CTCGCCTACG GCTGTCCGGG CGCCGAAGAC GCGCTCGCCC TCTCCGACCA AATCTCGGAG AACACCCGCA TCTACCTTCG CCAAGTCGTC CGATGCCCCG GCATCAACCA CTGGTTCTTC CGAATCAGCC GCATCCTCGG CAAAACGGTG CAAACCAAGA TGCCCGCCGC GGAGCGAACC GTCCTCGAAG TCTGCACCAA CGAGCTCCGA CTTCGACAAC TGAATTTCGA GAGCGACGCC TTCGCCGGCA TCGTTTAG
|
Protein sequence | MTEREYEWKH INARDFPSLE DGFRVLREIR YLIWKIGDVE ILRAAANVRE SHPGTGAPLG DGLRNLDMFD HSNFVIAYFH VPVEIYRFYH EEMPSLVPDL EDRKKCKVLA TQGGLYRKNM SEWWCVVDYL LAIGYEWPVE TMAKLVHWHR KFDDTEKKAL EDYKAAGCPY HPRVVREAAK QGSLDALKWL REKNAPFDKY AVSWAANSGQ VETLKYLVQE IKVKPDPMMC ARAAEAGELT ALQTLHEAGV PWDKRCIIAA AKEKKSKKKR KREEWSSWNK LGRIRCLQFA LAYGCPGAED ALALSDQISE NTRIYLRQVV RCPGINHWFF RISRILGKTV QTKMPAAERT VLEVCTNELR LRQLNFESDA FAGIV
|
| |