Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_12302 |
Symbol | |
ID | 5000621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 155931 |
End bp | 158093 |
Gene Length | 2163 bp |
Protein Length | 720 aa |
Translation table | |
GC content | 57% |
IMG OID | 640416042 |
Product | predicted protein |
Protein accession | XP_001416570 |
Protein GI | 145344088 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01787] squalene/oxidosqualene cyclases |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0477917 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.996732 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGCGG CGTACGCGAA GACGCGGCAC GAGCGTAAAC ACAGCGAAGA CGCGTTTCTG CGCGCGCAGT ACGACGCGAG GCGAAGCGCG CGAGGCTTGG AGACGAAGAC GAAGACGCCG AGCGCGATGG AGATTTCGAG AGGGGAAAAA GAGGGGAAGG GCGTCGACGG GGAGGTGGTG CGGCGAGCGA TGCGAGCGGG GATCGAGTAT TATCGAGGGA TACAGGACGA GGACGGACAC TGGGCGAGCG ACTACGGCGG GCCGATGTTT TTGATGCCGG GGTTGATCAT CGCGGCGAAC GTCATGGAAA AATCGGAGGA GATCATCGGT GAAGCGCGAG GGCGGGAGAT GCTGAGATAC CTGGAGAATC ACTTGAACGA GGATGGGGGC GTTGGGTTGC ACATCGAAGG TCACAGTACA ATGTTTGGGA CGGTTTTGAC GTACGTGGCG ATGCGGTTGC TCGGCAAGGC GGCGGATTCG CCCGAGTGCG CGAAAACGCG CAAGTGGATC ATCGATCGCG GTGGCGCGAC GCAGGTACCG TCGTGGGGGA AGTTTTGGCT CGCCGTGCTG GGGGTGTACG AGTGGCACGG GTTGAATCCG ATTCCGCCCG AGTGTTGGTT GTTGCCGTAT TGGCTCCCCA TGCATCCGGG TCGATTCTGG TGCCACTGTC GCATGGTGTA CTTGCCGATG AGTTACCTGT ACGGCATCCG GGCCACGGGT AAACCGACGG CGTTGACCGC GGCGTTGAAA TCGGAGTTAT ACGCCGAGGC ATCGTACGAC TCAATCAACT GGAACACTGC GAGAAACGCG TGCGCGAAGG AGGACTTGTA CTACCCGCAC CCGTGGATTC AAGACGTCGT CTGGTCGACG CTCATGAAGG TCGAGCCGTT CTTGATGAAT TCTCGCGTGC GCAAGGCGGC GTGCGAAGAT GCGATGCGGC AGATTCATTA CGAGGACGAA AACACGCGCT ACGTGGACAT TGGGCCGGTG AATAAAGTCT TCAATATGTT GTCATGCTGG TTTGAAGATC CCGATTCAGA GGCGGTGAAG AAGCACATTC CACGCATCGC CGACTACCTC TGGGTCGCCG AGGACGGGAT GAAAATGCAA GGATACAACG GAAGTCAGCT GTGGGACTGC GCGTTTTCCG TTCAAGCGAT CGTAGCGACG GGTTTAGCGG ACGAATATGG CGAGTGCTTA CGTCGCGCGC ACGACTACAT CGAAAAGAGT CAAGTACGAG ACGACTGCCC TGATGTCGAA AAGTGGTATC GTCACATTTC CAAGGGTGCG TGGCCGTTCA GCACACGCGA TCACGGTTGG CCGATCTCCG ATTGCTCCTC CGAAGGCTTG AAGGCGGCTT TGACGCTGGC GTCGATGGAC GAGAAACTCG TCGGCGAGGC GATTCCCGTC GACCGACTCG CGGATTGCGT GAACGTCATC CTATCCTATC AAAATCGCGG CAGTGGTGGC TGGGCTACGT ACGAAAATAC TCGATCGACG AAGTGGGTCG AGCTCTTGAA CCCAGCGGAG ACGTTCGGGG ACATCATGAT CGATTACCCG TACGTTGAGT GCTCGAGCGC GAGTTTGCAG GCGCTGTGCA AATTTTCTGA GCGCTATCCG GACATTCGCG CCAAGGATAT CGCGCATGCC AAGAAGACTG GCCGAAAGTT TTTGAAAAGT ATTCAACGCG CCGATGGAAG CTGGTACGGA TCGTGGGCGG TGTGCTTCAC GTACGGAACG TGGTTCGGCG TCCTGGGATT GATCGCCACG GGGTCGACGT ACGAGACGTG CCCATCGCTT CGTAAAGCAG TCGAGTTCTT GCTCTCGAAA CAGCAAGAAA ATGGTGGATG GAGCGAGAGC TATTTATCGT GCGAGAAAAA GGCGTACCAC GAGCTTCGCG ACAAGGATGG TAAGCCGAAG CCGCACTTGG TAAACACGGG ATGGGCGATG CTTGCGCTCA TCGCCAGCGG TCAACAAAAC CGCGACGCGA CGCCATTACA TCGCGCCGCA CGATGCATGC TCTCACAACA ATATGAGAAC GGCGATTTCC CGCAGCAAAG CATCATGGGT GTCTTCAACG CCAACTGCAT GATTTCCTAC AGTTGCTACC GAAGCATTTT CACACTCTGG GCGCTCGGCG AGTACTGCAA CAAAGTTCTG TAA
|
Protein sequence | MRAAYAKTRH ERKHSEDAFL RAQYDARRSA RGLETKTKTP SAMEISRGEK EGKGVDGEVV RRAMRAGIEY YRGIQDEDGH WASDYGGPMF LMPGLIIAAN VMEKSEEIIG EARGREMLRY LENHLNEDGG VGLHIEGHST MFGTVLTYVA MRLLGKAADS PECAKTRKWI IDRGGATQVP SWGKFWLAVL GVYEWHGLNP IPPECWLLPY WLPMHPGRFW CHCRMVYLPM SYLYGIRATG KPTALTAALK SELYAEASYD SINWNTARNA CAKEDLYYPH PWIQDVVWST LMKVEPFLMN SRVRKAACED AMRQIHYEDE NTRYVDIGPV NKVFNMLSCW FEDPDSEAVK KHIPRIADYL WVAEDGMKMQ GYNGSQLWDC AFSVQAIVAT GLADEYGECL RRAHDYIEKS QVRDDCPDVE KWYRHISKGA WPFSTRDHGW PISDCSSEGL KAALTLASMD EKLVGEAIPV DRLADCVNVI LSYQNRGSGG WATYENTRST KWVELLNPAE TFGDIMIDYP YVECSSASLQ ALCKFSERYP DIRAKDIAHA KKTGRKFLKS IQRADGSWYG SWAVCFTYGT WFGVLGLIAT GSTYETCPSL RKAVEFLLSK QQENGGWSES YLSCEKKAYH ELRDKDGKPK PHLVNTGWAM LALIASGQQN RDATPLHRAA RCMLSQQYEN GDFPQQSIMG VFNANCMISY SCYRSIFTLW ALGEYCNKVL
|
| |