Gene OSTLU_39442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_39442 
Symbol 
ID5004756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp260014 
End bp261144 
Gene Length1131 bp 
Protein Length376 aa 
Translation table 
GC content55% 
IMG OID640420177 
Productpredicted protein 
Protein accessionXP_001420801 
Protein GI145352960 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.512984 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAACG CGGTGTTGGG GAAGGTAGAA AAGTTAGAAA AGGAGAATGC AGAGCTTGTG 
GAGCGACTGA TGGAGATGAA GACGAAGGAG GCGGAGAAGA TGAATGAAAT TAACGATTTG
TACGCTGATT TGTTGCGACA AAAGAAGAGC GTAGAATTGA ATGCGAGGGC GGAGAGTCTC
GCTGAGGCCT CGGCGGCGAG TATGAAGGCG CTTTCGATGT CTACCGTCAT GAGCAACGTC
GTGCCTTCGA AGAAGAGACA TATTTTACAA AGTAACAAGG GTGGGACGCA CCGTGTCGCC
CTCTCGCACG ATGGTTTCAC CGTAGCAAGC GCCGGAGAGG ACAAGGTAAT CGCTATGTTC
GACACGAATA CGGGCGCACG AACCAGCGAA TTGACTGGTT TGCTCGGTGC AGCATTAGAT
GTTACATTTA GCGCCGACGA CTCGCTCGTC CTCGGGACCT CGACCGATTG CTCGCTCCAG
CTTTGGGACG CGCTCACCGG TCGCGTACGG CACCGACTCA CTGGGCACGC GCAAAAGGTG
ACGTCGGCGC GAATCAGCCA GATTGATGCT AAGCGCGCGA TTTCGTGCTC TCAGGACCGG
AACGTGAAGC TTTGGGATCT CAATCGTGGA CACGTAACGT CATCCATGTT GACTTCAAGC
GGCGTGTACT CGGTCGTCTT CGACGCAAAC GAACAGCAAG CGTATTCCGG TCACTTTGAT
GGCGCGATTC GCGCGTGGGA TCTTCGCGCG GGGAACGTAG CGCGCGAAAC GAAGGTGCAT
AATGGTTTAA TCACCGCCGT CTTCGATACG CCAAATCAAA ACGAAATTCT GACAAACAGT
CGCGACAACA CGTTGAAACT CGTCGATATT CGAACAATGG ACGTCGTGCA AACGTTCTCC
GCGCCAAAAT ATCGCGTCGG CACTGATTGG AGTAATCCTT GCGTGTCACC GGATGGACAA
CATATCGCAT CTGGCGGGGC AGACGGGGCG TTATTCATCT GGCGTGTACA GGGCGGACGC
TTGATGACGA CGTTGCACGG TCACGACGCC GTCGTCGCGA CGTGCGCGTG GAACGCGGCG
GGCGTGCTCG CGTCGGCGTG CAAAAATGGC GTGTGTCTGC TGTGGGAATA G
 
Protein sequence
MKNAVLGKVE KLEKENAELV ERLMEMKTKE AEKMNEINDL YADLLRQKKS VELNARAESL 
AEASAASMKA LSMSTVMSNV VPSKKRHILQ SNKGGTHRVA LSHDGFTVAS AGEDKVIAMF
DTNTGARTSE LTGLLGAALD VTFSADDSLV LGTSTDCSLQ LWDALTGRVR HRLTGHAQKV
TSARISQIDA KRAISCSQDR NVKLWDLNRG HVTSSMLTSS GVYSVVFDAN EQQAYSGHFD
GAIRAWDLRA GNVARETKVH NGLITAVFDT PNQNEILTNS RDNTLKLVDI RTMDVVQTFS
APKYRVGTDW SNPCVSPDGQ HIASGGADGA LFIWRVQGGR LMTTLHGHDA VVATCAWNAA
GVLASACKNG VCLLWE