Gene OSTLU_94891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_94891 
Symbol 
ID5004040 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp431490 
End bp432818 
Gene Length1329 bp 
Protein Length434 aa 
Translation table 
GC content60% 
IMG OID640419461 
Productpredicted protein 
Protein accessionXP_001420173 
Protein GI145351632 
COG category[I] Lipid transport and metabolism 
COG ID[COG3239] Fatty acid desaturase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.700078 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACACG AGAGCGCGGA CGACGCGACG GCGACGGCGA CGGCGGCGGG CGACGATGGG 
AAGGCGAAGA AGCCGTGGGG GTTCCTGAGC CCGTTCGGCG CGAGCGCGGC GGTGCTGGAA
CAGGCGGCGG AAGCGAACGA GAAGCAGAGC GGGCGCCGGC GACGGACGGA GGAAGTGAAG
CTGAAGGTCG ATGGACAGTG GTACGATGCG ACGGGGTGGG CGCTGGCGCA TCCCGGTGGG
GCGGGGTTCG TGCGGTTGTT GAACGGACAA GACGCGACGG ATGTGTTTTA CGCGCTGCAC
TCGTACGGGC CGAACGGGAG CGACGAGGCG TTGAAACGCT TGCGCGCGTT GCCGAAGTGC
GATGCACCGT ATGATACGGA AGAGTACGAA ACGCAAAGGC TGACGACGGC GAACGCCGAG
TTCGGGGAGT TGCGAGGGAA GTTGGAGGCG GAGGGCTGGT TTAAACGCAA CGCGCTCTCC
GAACTCTCGG TGTTGGCGCA AGTTTTGGGA TGCTACGTCG TCGGACAAGC CATCGCGGCG
ACGCACCCGA TCTTAGCCGC GATCTCGATC GGGATAGGGA TGCAACAAGC CGGATGGCTC
GCGCACGACT ACGTTCACGG CCGCGGGAAG TGGTGCTCGA TGATGCGCTG CTTTGGCGCG
TTGACGAATG GGTTTTCCGC CGAATGGTGG TCGCACAAGC ACAACATGCA CCACTCGTTC
ACGAACGTCG ACGGTAAGGA CGGCGACATC AAACTCGAGC CTCTGTATTA CTTGTCGCCG
CCGGAGACCA GTGGGCGCCC GGATAGCTGG TTGCGCAAGT ACCAGCACAT CTACGGCTAT
CCGCTCTACG CGATGACGTA CGTGCTCTGG CGCCGACACA GCGTCGCGAG TGCGTGGGCG
CGCAAGGATA AGACTGAGCT CGCTTTGCTC GTCGGCCACT ACGCGTGGTT GTTCGGCACG
CTTCCTTTGG GTGTCGCCAT CGGCTCCATG CTCATCGGTG GGTTTTTGGT AGGCTCTCTC
GTCACCGCTA CGCACCAGAG CGAGGAAATC ATGTATGAAG ACGGTTCCTT CGTCGATATT
CAGTTTAGAA GCACTCGCGA AGCGGACGTG AAGAATCCAC TCGAGCGTTG GTTGTGGGGT
GGTATGGACA CGCAGCTCAT TCACCACTTA TTCCCCACCA TGCCGCGTTA CAAGCTTCAC
AAGCTTCGTC CCATCATGCA GGAATGGGCC CAGAAACACG GATACGATTT TAGAATCTCC
GATTCGCGCG ACATCCTGAA GAAGAACTAC AAACATCTTG AGGGTATCGC CGCATTGGAG
ACGATTTAA
 
Protein sequence
MGHESADDAT ATATAAGDDG KAKKPWGFLS PFGASAAVLE QAAEANEKQS GRRRRTEEVK 
LKVDGQWYDA TGWALAHPGG AGFVRLLNGQ DATDVFYALH SYGPNGSDEA LKRLRALPKC
DAPYDTEEYE TQRLTTANAE FGELRGKLEA EGWFKRNALS ELSVLAQVLG CYVVGQAIAA
THPILAAISI GIGMQQAGWL AHDYVHGRGK WCSMMRCFGA LTNGFSAEWW SHKHNMHHSF
TNVDGKDGDI KLEPLYYLSP PETSGRPDSW LRKYQHIYGY PLYAMTVASA WARKDKTELA
LLVGHYAWLF GTLPLGVAIG SMLIGGFLVG SLVTATHQSE EIMYEDGSFV DIQFRSTREA
DVKNPLERWL WGGMDTQLIH HLFPTMPRYK LHKLRPIMQE WAQKHGYDFR ISDSRDILKK
NYKHLEGIAA LETI