Gene OSTLU_49403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_49403 
Symbol 
ID5001409 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp300339 
End bp301769 
Gene Length1431 bp 
Protein Length332 aa 
Translation table 
GC content57% 
IMG OID640416830 
Productpredicted protein 
Protein accessionXP_001417471 
Protein GI145345971 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.220793 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGGTCGACGC GCTCGACGCG GTCGACGCGA GCGAGGCGCG CGCTTCGAGC GACGCGAAGG 
CGTCGGACGC GAACGCGAAG GCGTCGAGCG CGAACGCGGG ACCCGACGGG CGGTGAAGGC
GCGCGACGAA AGACGAGGAA GGCGCGCGCG AACGATGAAC GCGGAGAAGA GGGCGGAAAT
ATACACCTAT GAGGCGCCGT GGATGATCTA CGCGTGCAAT TGGAGCGTGC GTGGCGAGCG
AGGCGATGGA TTGGGGGCGA GCGCGGGAGA ATTGAATCGC GAGGGGCGAC GGAGGAGACG
CGACGGAGGA GACTCGGGGA CGCGCGCGAA CGGTCGATCG GAGATTAAAA ATGGAGACGC
GCGAGTGAAG ACGCGAATGG CGTGGACTGA CGACGTCGAA TTGAACGCGA CAGGTTCGAC
AAGATAAACG CTTCCGCCTC GCCTTGGGTT CGTTCGTGGA GGAGTATAGC AACAAGGTTG
AGATCATCAC CTTGGACGAG GAAACCGGGG AGTTTCCGAA GGAGGCGCAG TGTTCGTTCA
CGCATCCGTA TCCTTGCACG AAAATTTTGT TCATTCCGGA CAAGGAGTGC ACGAAGGAGG
ATTTGTTAGC GACGACGGGG GACTACTTGC GAATCTGGCA AGTGCAGGAT GATAACACGG
TGCAGATGAA ATCTTTACTG AATAATAACA AGAGCAGCGA ATTTTGCGCA CCGCTGACGA
GCTTTGATTG GAACGAGACC AAGCTTCAGC GAGTGGGGAC GTCGTCGATC GACACGACGT
GTACGATTTG GGACATCGAG CGCGAGTGCG TGGACACGCA GCTCATCGCG CATGATAAGG
AGGTGTACGA CATCGCGTGG GGTGGTCCAG AGGTTTTCGC TAGCGTAAGT GCGGATGGAA
GTGTGCGAGT TTTCGACTTG AGAGACAAGG ATCACAGTAC GATCATTTAC GAGAGTCAAA
CTCCAGACAC GCCGCTGCTG CGTTTGGGGT GGAACAAGCA GGATCCGAGA TACATGGCCA
CCATTTGCAT GGATAGTCCG GTGATCATTC TCGATATTCG CTTCCCGACG TTGCCGGTCG
CAGAACTTCA GAGTCACAGA GCGAGCGTGA ATACATTGGC GTGGGCGCCA CACAGCTCAA
GCCACATGTG CACGGCGGGC GACGACAGTC AGGCGTTGAT TTGGGATTTG TCGTCCATGA
ATCAACCACC CGAAGGCGGT CTCGACCCTA TTCTCGCTTA CTCTGCTGGA GCAGAAATCA
ATCAGTTACA GTGGAGCGCG TCGCAACCGG ATTGGATCTC GATAGCTTTC CGAAACAGCC
TCCAGATCCT CCGAGTTTAG TCAACGCGCT GTCAGGTCTG CGCCGACGCC ACTGTATATT
ACCCGAATTT CCGGATACGC GACACACGAC ACACGACACG CACGCACGTA G
 
Protein sequence
MNAEKRAEIY TYEAPWMIYA CNWSVRQDKR FRLALGSFVE EYSNKVEIIT LDEETGEFPK 
EAQCSFTHPY PCTKILFIPD KECTKEDLLA TTGDYLRIWQ VQDDNTVQMK SLLNNNKSSE
FCAPLTSFDW NETKLQRVGT SSIDTTCTIW DIERECVDTQ LIAHDKEVYD IAWGGPEVFA
SVSADGSVRV FDLRDKDHST IIYESQTPDT PLLRLGWNKQ DPRYMATICM DSPVIILDIR
FPTLPVAELQ SHRASVNTLA WAPHSSSHMC TAGDDSQALI WDLSSMNQPP EGGLDPILAY
SAGAEINQLQ WSASQPDWIS IAFRNSLQIL RV