Gene OSTLU_31144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31144 
Symbol 
ID5001391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp378705 
End bp380215 
Gene Length1511 bp 
Protein Length456 aa 
Translation table 
GC content58% 
IMG OID640416812 
Productpredicted protein 
Protein accessionXP_001417488 
Protein GI145346006 
COG category[I] Lipid transport and metabolism 
COG ID[COG1562] Phytoene/squalene synthetase 
TIGRFAM ID[TIGR01559] farnesyl-diphosphate farnesyltransferase 


Plasmid Coverage information

Num covering plasmid clones62 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCGACGGGTC GTCCGGCGTC GAGTCGACGC GATCGATCGC GCGCGTCGCG ACGAACGATT 
CGAACGCGGA TCGATCGTCC GACGACCCGG TCGCGGATTC GGTCGCGAGA CGCGCGACGC
GACGGATGGG GGCGCTTCGG GCGGTGCTGT CGCACCCGGG GGACGTCGCG CCGCTGGTGC
GGGTGAAGAT GATGGCGAAC GCGGCGAAGA GATTGCCGAA GGATCCGGAT CTGGCGTACT
GCTACGACGT GCTGAATAAG GTGTCGAGAT CGTTCGCGAT CGTGATTCAG CGGCTGGACG
CGGAGCTGCG AGACGCGGTG TGCGTGTTTT ATCTCGTGCT TCGAGCGCTG GACACGGTGG
AGGACGACAT GAGCATACCG ATCGAGAGGA AGGTGCCGGA TCTGCTCGCG TTTCACGAGT
ACATTTACGA CGCGAATTAT AGCGCGGATT GCGGGGAGAA GCACTACAAG GATTTGATGA
AGAATTATCC GCGCGTGACG TCGGTGTTCT TGGGATTGAA GAAGGAGTAT CGAACGGTGA
TCGCGGACAT CACCAAGCGA ATGGGGCACG GGATGGCGAA GTTTATCGAG AAGGAGGTGA
TGGACATGGC GGATTTCGAC GAGTATTGCC ACTACGTCGC GGGCTTGGTC GGCATCGGTC
TGTCCAACTT GTGGGGGGTG TCGAAGATGG AGAGCGCGGA GTTTGTGGGC GAGGAAAAGC
TCTCGAACGC CATGGGATTG TTTTTACAGA AGACGAACAT CATTCGCGAT TACTTGGAAG
ACATTCAGGA GCTTCCCGCG CCGCGAATGT TCTGGCCTAG GAGCGTGTGG AGCAAGTACG
CCGAAGAGCT CGACGATTTG CAACACGAGG AAAACCGCGA AAAGGCGGTG CAGTGCATGA
ACGAATTGAT CACCAACGCC CTGTCGCACT CGTTGGATTG CTTAAAGTAC ATGAGCCGAG
TCAAGGAGAT CTCCATCTTC CGGTTCTGCG CCATTCCTCA GGTGATGGCC ATCGCGACGT
TGGCCGAGTG CTACGGCAAC GAAAACGTCT TCAAGGGCGT GGTCAAGATT CGCCGAGGCC
TGAGCGCGCG CATCATGCTC AAGTGCAACA ACATGCTCGA GTTAGCGAGC GGCTTCAAGC
ACTTCGCCAA CGAACTCGTG CACAAGGTGG ACCCGAAGAG CGATCCGAAC GGCGAGGAGA
CCATGGCGCG CATCGAGGCT CTCGAGGAGG CGTGCGACGA AATCATCGCC AAGGAAACCG
AACGCATGCG CAAGGAGCAA ATAGAGGACG ACTCCATTCC CATGGCGACC AGAATCGTCC
TGTGGTTGTT GTGTTTCGGC TACTTTTTGT ACGCCTGGAA TCTCGAAAAC GTTCGCGAGT
CGCTCGGCGT CAACAGACAC GCCGGCGATC CTCTCGTCGA CGACGCCCAA AAAGTCTTGG
CGACGATTTG CATCATCTCC ACGACCGCGC TCGTGATGGC GGGTAAAAAG AGATGATTTC
GGTTCGCGGC G
 
Protein sequence
MGALRAVLSH PGDVAPLVRV KMMANAAKRL PKDPDLAYCY DVLNKVSRSF AIVIQRLDAE 
LRDAVCVFYL VLRALDTVED DMSIPIERKV PDLLAFHEYI YDANYSADCG EKHYKDLMKN
YPRVTSVFLG LKKEYRTVIA DITKRMGHGM AKFIEKEVMD MADFDEYCHY VAGLVGIGLS
NLWGVSKMES AEFVGEEKLS NAMGLFLQKT NIIRDYLEDI QELPAPRMFW PRSVWSKYAE
ELDDLQHEEN REKAVQCMNE LITNALSHSL DCLKYMSRVK EISIFRFCAI PQVMAIATLA
ECYGNENVFK GVVKIRRGLS ARIMLKCNNM LELASGFKHF ANELVHKVDP KSDPNGEETM
ARIEALEEAC DEIIAKETER MRKEQIEDDS IPMATRIVLW LLCFGYFLYA WNLENVRESL
GVNRHAGDPL VDDAQKVLAT ICIISTTALV MAGKKR