Gene OSTLU_50433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_50433 
Symbol 
ID5003654 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp251748 
End bp252855 
Gene Length1108 bp 
Protein Length331 aa 
Translation table 
GC content61% 
IMG OID640419075 
Productpredicted protein 
Protein accessionXP_001419536 
Protein GI145350271 
COG category[R] General function prediction only 
COG ID[COG5273] Uncharacterized protein containing DHHC-type Zn finger 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.120441 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGCGCGGTCG CGTCGCGCGC GCGTCGCCGC GGTCGCTCGG ACTCGATTCC GACGACGCGC 
GCGCGCGATG GCGTCGCCCG CGCGACGGAC GCGCGAGACG GACGACGCGA CGGTGAACGC
CATGGGTGGC GATCGAAACG TGTTCGCGTC GCGCGCGTGC GACGCGTGCC GCGCGCTCGG
GTCGTTCATG GTGCTCGTCG TGCTGGCGAT CGTCGGGCTG ACGTACTACG CCACGGTGGT
CGTCGTGTAC GGACCGTTGG CGGCGGAGGG GGGGGAGGAC GCGGGCGTGG CGACGGGGGC
GCTGTGCGCG TATCACGTCT TCGCGTTCAT GCTGCTGTGG TCGTACTTTG CGTGCGTGCT
GACGGCGCCG GGAGACGTGC CGAGGGGGTG GACGCCGGCG CCGGAGGATC CCGAGGAGGC
GGCGTCGGAG GCGAAGAAGT CGAACAGCGA AAAGAGACGG CGGTTTTGTA AAAAGTGCGC
GGCGTGGAAG CCGACGCGGA CGCACCACTG CTCGGTGTGC AAACGATGCG TGTTGAAGAT
GGATCATCAC TGCGTGTGGG TCGCGAATTG CGTGGGGGCG TATAACTATA AATTTTTTCT
GCAGTTTTTG GCGTACACGT TCTTGGCGAC GGTGCTGGAT GCGATTTTAC TGTTGAGCAA
TTTTATAGAT TTCTTCAAAG ACGTCGAGGA GAGTCAGGCT GCGGGAAGCC AAGGGGCGGA
CGCGAAGGTC GATCCGGCGG AAGGAACGGA GTTAGCGGTG GTGTTTGTGA CGTTTATAGT
CAACGTGGCG TTCTCGGCGT CGTTACTGGG CTTTTTAGTG ATGCACGGTA ACTTGATCCT
GAGCAACATG ACGACGATCG AAATGTACGA AAAGAAAAAG ACGCTTCCGT GGAAGTACGA
CTTGGGAAGG TTCAGAAACT TCAAGGAAGT GTTTGGAGAG AACGTTTTCA TGTGGTTCCT
CCCCGTGCAT TCGAGCTCGC ACTTGGAAAA GATGCGCGTG AACACGGGGA TTTCAGACGG
GGAATGTTTA GAAGGCGCCG CGTACGCCAG GGCGTGCGAA AGCGCGCAAC GAGAGGCGAC
GATCGGGAAT AGAAAAGGTC GAGCGTAG
 
Protein sequence
MASPARRTRE TDDATVNAMG GDRNVFASRA CDACRALGSF MVLVVLAIVG LTYYATVVVV 
YGPLAAEGGE DAGVATGALC AYHVFAFMLL WSYFACVLTA PGDVPRGWTP APEDPEEAAS
EAKKSNSEKR RRFCKKCAAW KPTRTHHCSV CKRCVLKMDH HCVWVANCVG AYNYKFFLQF
LAYTFLATVL DAILLLSNFI DFFKDVDPAE GTELAVVFVT FIVNVAFSAS LLGFLVMHGN
LILSNMTTIE MYEKKKTLPW KYDLGRFRNF KEVFGENVFM WFLPVHSSSH LEKMRVNTGI
SDGECLEGAA YARACESAQR EATIGNRKGR A