Gene OSTLU_12302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_12302 
Symbol 
ID5000621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp155931 
End bp158093 
Gene Length2163 bp 
Protein Length720 aa 
Translation table 
GC content57% 
IMG OID640416042 
Productpredicted protein 
Protein accessionXP_001416570 
Protein GI145344088 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0477917 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.996732 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGCGG CGTACGCGAA GACGCGGCAC GAGCGTAAAC ACAGCGAAGA CGCGTTTCTG 
CGCGCGCAGT ACGACGCGAG GCGAAGCGCG CGAGGCTTGG AGACGAAGAC GAAGACGCCG
AGCGCGATGG AGATTTCGAG AGGGGAAAAA GAGGGGAAGG GCGTCGACGG GGAGGTGGTG
CGGCGAGCGA TGCGAGCGGG GATCGAGTAT TATCGAGGGA TACAGGACGA GGACGGACAC
TGGGCGAGCG ACTACGGCGG GCCGATGTTT TTGATGCCGG GGTTGATCAT CGCGGCGAAC
GTCATGGAAA AATCGGAGGA GATCATCGGT GAAGCGCGAG GGCGGGAGAT GCTGAGATAC
CTGGAGAATC ACTTGAACGA GGATGGGGGC GTTGGGTTGC ACATCGAAGG TCACAGTACA
ATGTTTGGGA CGGTTTTGAC GTACGTGGCG ATGCGGTTGC TCGGCAAGGC GGCGGATTCG
CCCGAGTGCG CGAAAACGCG CAAGTGGATC ATCGATCGCG GTGGCGCGAC GCAGGTACCG
TCGTGGGGGA AGTTTTGGCT CGCCGTGCTG GGGGTGTACG AGTGGCACGG GTTGAATCCG
ATTCCGCCCG AGTGTTGGTT GTTGCCGTAT TGGCTCCCCA TGCATCCGGG TCGATTCTGG
TGCCACTGTC GCATGGTGTA CTTGCCGATG AGTTACCTGT ACGGCATCCG GGCCACGGGT
AAACCGACGG CGTTGACCGC GGCGTTGAAA TCGGAGTTAT ACGCCGAGGC ATCGTACGAC
TCAATCAACT GGAACACTGC GAGAAACGCG TGCGCGAAGG AGGACTTGTA CTACCCGCAC
CCGTGGATTC AAGACGTCGT CTGGTCGACG CTCATGAAGG TCGAGCCGTT CTTGATGAAT
TCTCGCGTGC GCAAGGCGGC GTGCGAAGAT GCGATGCGGC AGATTCATTA CGAGGACGAA
AACACGCGCT ACGTGGACAT TGGGCCGGTG AATAAAGTCT TCAATATGTT GTCATGCTGG
TTTGAAGATC CCGATTCAGA GGCGGTGAAG AAGCACATTC CACGCATCGC CGACTACCTC
TGGGTCGCCG AGGACGGGAT GAAAATGCAA GGATACAACG GAAGTCAGCT GTGGGACTGC
GCGTTTTCCG TTCAAGCGAT CGTAGCGACG GGTTTAGCGG ACGAATATGG CGAGTGCTTA
CGTCGCGCGC ACGACTACAT CGAAAAGAGT CAAGTACGAG ACGACTGCCC TGATGTCGAA
AAGTGGTATC GTCACATTTC CAAGGGTGCG TGGCCGTTCA GCACACGCGA TCACGGTTGG
CCGATCTCCG ATTGCTCCTC CGAAGGCTTG AAGGCGGCTT TGACGCTGGC GTCGATGGAC
GAGAAACTCG TCGGCGAGGC GATTCCCGTC GACCGACTCG CGGATTGCGT GAACGTCATC
CTATCCTATC AAAATCGCGG CAGTGGTGGC TGGGCTACGT ACGAAAATAC TCGATCGACG
AAGTGGGTCG AGCTCTTGAA CCCAGCGGAG ACGTTCGGGG ACATCATGAT CGATTACCCG
TACGTTGAGT GCTCGAGCGC GAGTTTGCAG GCGCTGTGCA AATTTTCTGA GCGCTATCCG
GACATTCGCG CCAAGGATAT CGCGCATGCC AAGAAGACTG GCCGAAAGTT TTTGAAAAGT
ATTCAACGCG CCGATGGAAG CTGGTACGGA TCGTGGGCGG TGTGCTTCAC GTACGGAACG
TGGTTCGGCG TCCTGGGATT GATCGCCACG GGGTCGACGT ACGAGACGTG CCCATCGCTT
CGTAAAGCAG TCGAGTTCTT GCTCTCGAAA CAGCAAGAAA ATGGTGGATG GAGCGAGAGC
TATTTATCGT GCGAGAAAAA GGCGTACCAC GAGCTTCGCG ACAAGGATGG TAAGCCGAAG
CCGCACTTGG TAAACACGGG ATGGGCGATG CTTGCGCTCA TCGCCAGCGG TCAACAAAAC
CGCGACGCGA CGCCATTACA TCGCGCCGCA CGATGCATGC TCTCACAACA ATATGAGAAC
GGCGATTTCC CGCAGCAAAG CATCATGGGT GTCTTCAACG CCAACTGCAT GATTTCCTAC
AGTTGCTACC GAAGCATTTT CACACTCTGG GCGCTCGGCG AGTACTGCAA CAAAGTTCTG
TAA
 
Protein sequence
MRAAYAKTRH ERKHSEDAFL RAQYDARRSA RGLETKTKTP SAMEISRGEK EGKGVDGEVV 
RRAMRAGIEY YRGIQDEDGH WASDYGGPMF LMPGLIIAAN VMEKSEEIIG EARGREMLRY
LENHLNEDGG VGLHIEGHST MFGTVLTYVA MRLLGKAADS PECAKTRKWI IDRGGATQVP
SWGKFWLAVL GVYEWHGLNP IPPECWLLPY WLPMHPGRFW CHCRMVYLPM SYLYGIRATG
KPTALTAALK SELYAEASYD SINWNTARNA CAKEDLYYPH PWIQDVVWST LMKVEPFLMN
SRVRKAACED AMRQIHYEDE NTRYVDIGPV NKVFNMLSCW FEDPDSEAVK KHIPRIADYL
WVAEDGMKMQ GYNGSQLWDC AFSVQAIVAT GLADEYGECL RRAHDYIEKS QVRDDCPDVE
KWYRHISKGA WPFSTRDHGW PISDCSSEGL KAALTLASMD EKLVGEAIPV DRLADCVNVI
LSYQNRGSGG WATYENTRST KWVELLNPAE TFGDIMIDYP YVECSSASLQ ALCKFSERYP
DIRAKDIAHA KKTGRKFLKS IQRADGSWYG SWAVCFTYGT WFGVLGLIAT GSTYETCPSL
RKAVEFLLSK QQENGGWSES YLSCEKKAYH ELRDKDGKPK PHLVNTGWAM LALIASGQQN
RDATPLHRAA RCMLSQQYEN GDFPQQSIMG VFNANCMISY SCYRSIFTLW ALGEYCNKVL