Gene OSTLU_4044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_4044 
Symbol 
ID5002361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp186566 
End bp187552 
Gene Length987 bp 
Protein Length329 aa 
Translation table 
GC content55% 
IMG OID640417782 
Productpredicted protein 
Protein accessionXP_001418174 
Protein GI145347440 
COG category[R] General function prediction only 
COG ID[COG0561] Predicted hydrolases of the HAD superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.000894209 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.644992 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GAGGTTTTCA CGAAGGTGGA GACGTGGATC GACGAGGTGC TCGAGCTGCC GAAAGAGAAG 
CGGAGGTTTA GTAAGCTTTC GAGAATGGTA CCCGCGGTTG GGTTTTTCTT CCACCGCTTG
CCTTTGTTGA AGGCGCTGCG AGAGTACGAC GAGTTTTCCT CGTTGTCCAA GCGGCGGTAC
GTGCTTCCAA ACTTTGCCGA GGTGCGGCAC ATTTTGAACA TCGCGCAAGT GCACGCGTCG
AGTAAGGACG TGCGGTTGGT GACTTTCGAT GCCGATGGAA CGTTGTACGC CGACGGTGAG
CACTTTGAGG ACGATAATAA GATGATCGAT AAGATTATGC AGCTCATGGA GTTGGGCATT
CACGTCGCCA TTGTCACCGC CGCGGGTTAT CCGGGCGAGC CGACCAAGTT TGAGGGGCGA
CTGAAGGGTT TGGTGGACGC TTTCGAGGCG CAAGCGCTGC CGAAAGAAGT GTACGAAAAG
TTTCACGTCA TGGGTGGCGA ATGTAACTAC CTCTTGCGAG TTAACGACGA GTATCGCCTG
GAGTTTGTAC CCTCGGAGGA GTGGCACAGC GAGCACATGT ACGACTGGAG AGACAACGAC
GATGTTCGCA TGTTCCTCGA CCGCGCGGAA GAATTCTTGA CCTCATATGC GAAGCACTTG
GGCGTTCAAG TGGATGTCTT GCGCAAGGAA TACGCGGTCG GAGTCTTGCC CAAGGGCGAT
ACCATTTACG AAAACTTGGA AGAAATGGCG CTCGCGAGCC AAGCCGAGCT TAGCGACGCG
AAGATTCCAT TCTGCGCCTT CAACGGAGGT AACGACGTTT TCGTGGACGT AGGTAACAAG
CACATCGGTT TGCAAGCGCT CATGAAGTAC TTAAACGTCG CTGGTTCGCA AACTTTGCAC
GTCGGCGATC GTTTCACTCT CACGGGTAAC GACGCCAAGG TGCGCGAAGC GGCGTCCATT
CTCTGGGTCG CGAGTCCGGA CGAAACC
 
Protein sequence
EVFTKVETWI DEVLELPKEK RRFSKLSRMV PAVGFFFHRL PLLKALREYD EFSSLSKRRY 
VLPNFAEVRH ILNIAQVHAS SKDVRLVTFD ADGTLYADGE HFEDDNKMID KIMQLMELGI
HVAIVTAAGY PGEPTKFEGR LKGLVDAFEA QALPKEVYEK FHVMGGECNY LLRVNDEYRL
EFVPSEEWHS EHMYDWRDND DVRMFLDRAE EFLTSYAKHL GVQVDVLRKE YAVGVLPKGD
TIYENLEEMA LASQAELSDA KIPFCAFNGG NDVFVDVGNK HIGLQALMKY LNVAGSQTLH
VGDRFTLTGN DAKVREAASI LWVASPDET