Gene OSTLU_41114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_41114 
Symbol 
ID5002577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp234832 
End bp235752 
Gene Length921 bp 
Protein Length306 aa 
Translation table 
GC content53% 
IMG OID640417998 
Productpredicted protein 
Protein accessionXP_001418189 
Protein GI145347473 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.830125 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGCTTGGTGA AGCCTTTGCA AGATTTCGGC TTTGGGCGCA CGCGCTTGTG GGAAGGAGGC 
GTTGGGTTAT TTATGATAAC TGGTGTCGCG CTTTCCTTCG TGATTTGGGG ATGGATCCAA
GGCTTGCTGA GCTTTGCGCG CAAAAACTCG TATCAGGCAT TTATCGAGTT TCCGGTGGCG
TGCGGGATCC AAGTCGGCAC GAATGTTCGA ATTCGTGGCG TCAAGGCTGG TTCCGTGCTG
AGCGTGCAAC CGAGCTTGGA GAAGGTCGAA GTGCTTGTGG AGATGGACGA CAAGAACGTT
CCCATACCTC GCAACTCTCT CATAGAGGCA AACCAAAGCG GTTTGATCGC AGAAACAATC
ATTGACATCA CTCCCGCCAT CCCGATTCCA GTGGCTCAGT GGGGGCCTTT GGATTCTGGA
TGTGAAGGTG AGGGCGTGAT CGTGTGTGAT CGGGGTAAGA TTAAGGGTCT GCCGGGGGTG
AGCATGGACG AACTCGTCGG TATTTGTACG AAGCTCGCGA GAGAGATGGA AAGGCAAGAC
GGCATGAACA AGATGTTCGA TACGACCGAC ACGGCTAGAC GACTGATGAC GACTTTGCAA
CCGCTTCTTC GTGAGGCGGC GCAAATCGCC CAAGAGCTCC GACCGATGAT GCAAGGAGTG
AACGAACAAG GCACTTTGGA CACGCTCGAA TCGCTCGCCG GTCAGACATC AGCCACCGTG
GAAGACATCA GAAAGCTCAA GGATGCAATT TTGACCGAGG AGAATCAAGA ACTTCTTCGA
CAATCCATTT CGACGCTCAC GAAGACGCTG CAACACGTCG AAAAGGTGAG CGGGGATATT
AGTAGTGTGT CCGGTGATCC GAGCACTCGC GCAAACTTGC GACACTTGAT TCAATCGCTG
TCGCGATTGG TTGACGCATA G
 
Protein sequence
RLVKPLQDFG FGRTRLWEGG VGLFMITGVA LSFVIWGWIQ GLLSFARKNS YQAFIEFPVA 
CGIQVGTNVR IRGVKAGSVL SVQPSLEKVE VLVEMDDKNV PIPRNSLIEA NQSGLIAETI
IDITPAIPIP VAQWGPLDSG CEGEGVIVCD RGKIKGLPGV SMDELVGICT KLAREMERQD
GMNKMFDTTD TARRLMTTLQ PLLREAAQIA QELRPMMQGV NEQGTLDTLE SLAGQTSATV
EDIRKLKDAI LTEENQELLR QSISTLTKTL QHVEKVSGDI SSVSGDPSTR ANLRHLIQSL
SRLVDA