Gene OSTLU_31472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31472 
Symbol 
ID5002056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp14909 
End bp16519 
Gene Length1611 bp 
Protein Length485 aa 
Translation table 
GC content60% 
IMG OID640417477 
Productpredicted protein 
Protein accessionXP_001417890 
Protein GI145346840 
COG category[I] Lipid transport and metabolism 
COG ID[COG3239] Fatty acid desaturase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGGCG CGGCGAACGC GACGCACATG CTCACGAACG CGCGCGCGAC CGCGCCCGCG 
GGCGCGCGAT CGGCGCGTCG CGCGCGCGTG TTCGCGCGCG CCGACGTCGC GGTGCGCGCG
GCGCCGACGT CGACGGGGCG CGCGCGGGAC GGGCGCGCGC GAGGCGCGCT GGCGGTGACG
CGCGAGCGCG CGAGCGCGGT CGATGGGAAG AGACGGAGAA GCGCGCAGAC GATCGGGGCG
GTGGCGAATC CGCTCGCGGT GCCGACGTAC GACGCGCCGG AGGGGAGGGA TAAGAATGAA
CCGATTCGCG TGAAAATCGG AGACGAGTGG TACGACTGTC GGGGGTGGGC CAAGGCGCAT
CCGGGCGGCG AACGGTGGCT GTACTTTTTC GATGGACGCG ACGCCACGGA CGTGTTCTAC
GCGCTGCATT CGTACGGCCC CAACGGCTCG GATTTGGCCG TGCAAAGGTT GAAGAAGCTT
CCGCGCTGCG ACCCGCCGGC GGACAAGTCT CGCCTTCCGG ATGAGAAATC GTACGCGGTG
AGCATGGCGT TCGGTGAATT GCGCGACAAG CTCGCGGAGG ACGGATTCTT CAAGCGCCAA
CCGCTCAAAG AAGCTTGGGC GCTCTTCCAA GTCGTCGCTC TGTACGTGAG CGGAACCGCT
CTCGCGTATT CGCACCCCGT CTGGGCGACT ATTTTGCTCG GACTCGGGAT GGAACAAGCG
GGTTGGTTGG GTCACGACTA CGTCCACGGA CGTGGGCCGT GGTGCTCGTT GATGCGCTAC
ATGCCGACTA TTTTGAATGG CCACAGTGTG GAGTGGTGGA TGCAAAAGCA CTCGATGCAC
CACTCGTTTA CGAATGAAGA GCACCTCGAC AACGACGTCA TGATGGAGCC GTTCTTCTTC
TTGCGCTCGC CGCAAGAGTC CGGCCGACCG GATCACCCCA TGCGCAAGTT CCAGCACATC
TACGGCTACC CACTGCTATC CATTATGTTT TGGCTCTGGC GATTCCACTC GGTTCAAACC
GCGTGGAAGA AGAGGGATTA CAAAGAGCTC GCCTTCATCG GGGCAAACTA TCTCTTCTTG
GCGACGATGA TGCCGTGGCA GGTGGCTGTC GGCTCTATCA CGCTCAGTGG TTTCCTCGTC
GGCGCTCTCG TGAGTGCGAC GCACCAGAGC GAAGAAATCA TGGAGTTTGG TGAGAATCCA
GAGTACGTGG AAGGGCAATT CCGCTCGACG CGCGACGCCG AGTGCGTCTT CGGCGGCTTG
GAAACGTGGA TTTGGGGCGG AATGGATACG CAGTTGGAGC ATCACTTGTT CCCCACGATG
CCGCGTTACA ACTACCATAA ACTTCGCCCG CTACTGAAGG CGTGGGCCAA GGCAAACGGT
GTCACGTACC GCTCGTCTCC GAGCACGGAG ATCATCGCCG ACAACTTCAA GATGCTCCAT
CGCGTCGCCA CCGCGTAAAA ATTTATTCTT TACCCTGGAC GAATCCAGAC GTTGCTTAAT
TGTGTGTTAC CCGTGTTTCC CTATCGTGAA TAGTTACTTT TGAGACCGCA TGCGCGTGCG
TTGCGGTGTT AGTGTGTTTG TGAAACGAAG CTAAAAAACT CGTAGACAAA A
 
Protein sequence
MVGAANATHM LTNARATAPA GARSARRARV FARADVAVRA APTSTGRARD GRARGALAVT 
RERASAVDGK RRRSAQTIGA VANPLAVPTY DAPEGRDKNE PIRVKIGDEW YDCRGWAKAH
PGGERWLYFF DGRDATDVFY ALHSYGPNGS DLAVQRLKKL PRCDPPADKS RLPDEKSYAV
SMAFGELRDK LAEDGFFKRQ PLKEAWALFQ VVALYVSGTA LAYSHPVWAT ILLGLGMEQA
GWLGHDYVHG RGPWCSLMRY MPTILNGHSV EWWMQKHSMH HSFTNEEHLD NDVMMEPFFF
LRSPQESGRP DHPMRKFQHI YGYPLLSIMF WLWRFHSVQT AWKKRDYKEL AFIGANYLFL
ATMMPWQVAV GSITLSGFLV GALVSATHQS EEIMEFGENP EYVEGQFRST RDAECVFGGL
ETWIWGGMDT QLEHHLFPTM PRYNYHKLRP LLKAWAKANG VTYRSSPSTE IIADNFKMLH
RVATA