Gene OSTLU_33438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33438 
Symbol 
ID5003741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp202773 
End bp203870 
Gene Length1098 bp 
Protein Length365 aa 
Translation table 
GC content64% 
IMG OID640419162 
Productpredicted protein 
Protein accessionXP_001419740 
Protein GI145350705 
COG category[R] General function prediction only 
COG ID[COG1075] Predicted acetyltransferases and hydrolases with the alpha/beta hydrolase fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.163153 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCAGA TGTCGAACGC GCTCGCGAGC ACGGCGGCGA GTCGAGGCGA CGTCGGATGG 
TTGGACGACG AGGCGGCGGA AACGACGATC GACGGCGAAT CGGCGTTCGA GGAAGCTTTG
GAAGGAGTTT TAGCGCGTCC TGGGAGCGTG CTCGCGACGG GTGCGGAGTA CTTTTATCTG
CTCGTGCCAG GTTTGTTTGG ATCGTACTAT CCGCGATACT ACGCCGACGT TGAGCAGGCG
TTCCGAGACC GCGGAGCGCA GTGTCGCATC TCGCGTTTGG TCGATGGCGA AGGCGCGGTC
GTAACGAACG CAAAGGCACT GGCGCGCGAG ATTGAAGATA TTCACGCCGA GACTGGGAAA
CGTGTCGTGA TCATTGGACA CTCGAAGGGC GGCGTCGACG GAGGCGCGGC GCTCGCGTTG
CACGACGACA GACTACGAAA GCTCGTGCGC GGTTTAATCG CGGTGCAAAG CCCGTTCGGA
GGGTCACCCA TCGCGACCGA TTTACTCAGC GCGCCGTTGG CGGACCCCGT CGCTTCGCTT
CTTGAAATTT TGGTGAGCGC GCCCAAAGGC GACGGCGCTC GATTGCTCGA GCCTATTCGC
GACTTGACGT ATCGCGAACG TCGCGCTTTT CTCGCCGCGC ACCCCATTCC GAGTCACTAT
CCCGTGGTGT CCTTCGCCAC GGCGACGAAA TCCGCCGCGG CCGGTTTGTT TCCATCCGCG
CGCTACATCG ACAATCGCTA CGGCGAGCCC AGCGACGGCT TGGTGTGCGT TCGCGACGCT
CAAATCCCTC GCGCCGTGTG CGTCAACGTC AAATTTGAAA ACGACCACGC CGACTGCGTG
TTCCCTTCGC GGCACCCCTC CGACATGGTG GACGCGCACG CGCGCGCGCA GGCTGAAAAT
CTCGCCCTGC GCCAGCGTCT GGGTCTGTGC GATTCCCCTC GCCGTGGTCC GCCGCTCCCG
CCGCCCGTCG GCGTCTCCGT CGTCGCCGCG CAGCGCGCGC TCGCCGACGC CCTCCCCGAG
CGCTTAAAGT CCTCCCCCGC GAGCGTCGAT TACCACGAAG CCTTGGTCGG GGTGTTACTC
GCGCGTCCGG GTCCTTAG
 
Protein sequence
MRQMSNALAS TAASRGDVGW LDDEAAETTI DGESAFEEAL EGVLARPGSV LATGAEYFYL 
LVPGLFGSYY PRYYADVEQA FRDRGAQCRI SRLVDGEGAV VTNAKALARE IEDIHAETGK
RVVIIGHSKG GVDGGAALAL HDDRLRKLVR GLIAVQSPFG GSPIATDLLS APLADPVASL
LEILVSAPKG DGARLLEPIR DLTYRERRAF LAAHPIPSHY PVVSFATATK SAAAGLFPSA
RYIDNRYGEP SDGLVCVRDA QIPRAVCVNV KFENDHADCV FPSRHPSDMV DAHARAQAEN
LALRQRLGLC DSPRRGPPLP PPVGVSVVAA QRALADALPE RLKSSPASVD YHEALVGVLL
ARPGP