Gene OSTLU_18155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18155 
Symbol 
ID5005281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp524900 
End bp526234 
Gene Length1335 bp 
Protein Length444 aa 
Translation table 
GC content63% 
IMG OID640420702 
Productpredicted protein 
Protein accessionXP_001421489 
Protein GI145354433 
COG category[R] General function prediction only 
COG ID[COG5141] PHD zinc finger-containing protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.143202 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCGG CACCCGCGCT GCCCGCTGCC GACGTGTGCG CTGTCTGTTT GGCCATCCCC 
GAGCAAAGAG GGAGGTTAGA CTCGTGCTGT CATCTCTTCT GCGTCCCGTG CATCGTGCGC
TGGGCTTCGA TTGAGACGAA ATGTCCCCTG TGCAAGGAAA GGTTCACAAA GATGACGCCC
GAGGACGCGT CGACGAGCGC GCGCGCGGGG CCGGTGATGG AGTTTCGAGA GACGAATCAA
GGCGACGAAC GCCCGGACGA GGCGGAGGAA GAATCGGAGG ACGAGGCGGA GCGGTACTTT
TGCGACGTGT GCCGACGCGG CGACGACGAG GCGTCGCTGT TGCTGTGCGA CGCGTGCGAC
ATCGGGGCGC ACACGTTTTG CGTCGGACTC GAGTCCGTGC CGCGTGGAAG ATGGTTTTGC
GAACTATGCC GAGGAATGGA GGGAGAGTTC GCGGGAGCGC GCGATGGTGG GTCGAGTCGA
CGCCGACGAC GAGCCGAGGC GGAGTCGCGC GCTCGGGACA TTTTGTTACG GTCGGATGGC
GGCGTCGAAG CCGTGCGGAC GAGAGAGCGC AGGGCGGATC GCGCGCGTCG CGCGGTGAGA
GCGCGTCAAC GTGGGGAAAT ACGTCGACGC GGGGGCGGCG GCGAGCGATC GGTGATGCGC
AGTCGCTCGG GCGGCGGCGA CGACGCTCGA GTTGAGCAGA TTTCGAGAGT TCACGTACTG
AGAGAAGCCT GGGAAATGTT ACAAAGCGGG GAAGTCGAGT TTCCGGGCTG GACGCGGATA
CAAAACACAC CTTCGTCGCC TCGCTCGTCG GTGCCGTCGG CACCGCGTCG GGTAGGATCG
AACACGGATG AAGACGGACC GAAGGACGTC GTGGACGAAG CGTGGGACGT TCTTGAAAAG
GCGATGCACG CAGACGACAA GAAAAAGAAG CGGGCGTCAA CATCGGACAA AGCATCAAAC
GCAGTCGCTA GCACCAGTGC ACCGAAACTC AAACGTCCAT CCATTCGAAG CGAGCCTCCC
GGTTGGAGCA TGTCGGCTAC GGCGTGGAAA CCGCGCGAGG AGACGGCGCG AGCGACGCCC
GTTCCGCAAC GGGCCGCACC GTCGCACTCT CACACCTCTC GGCCACTGCC CGCCGCATCA
TCACCGGATA AATCACTCAA ATTCGCCATC GCCGATCGCG TCAAAGCCGT CTTGCGTCCG
CTCTACGCCG CCGGAGACGT CACGAAGGAT GAATATCGGC GTATTTGCAA ATGCGCCACG
AGCGAAGCTC TCGCTTCGAA CGCCACCGAC GATGCGTCCA TAGCACACAT AGTCCATCGC
TTACGCAAGC GGTGA
 
Protein sequence
MTPAPALPAA DVCAVCLAIP EQRGRLDSCC HLFCVPCIVR WASIETKCPL CKERFTKMTP 
EDASTSARAG PVMEFRETNQ GDERPDEAEE ESEDEAERYF CDVCRRGDDE ASLLLCDACD
IGAHTFCVGL ESVPRGRWFC ELCRGMEGEF AGARDGGSSR RRRRAEAESR ARDILLRSDG
GVEAVRTRER RADRARRAVR ARQRGEIRRR GGGGERSVMR SRSGGGDDAR VEQISRVHVL
REAWEMLQSG EVEFPGWTRI QNTPSSPRSS VPSAPRRVGS NTDEDGPKDV VDEAWDVLEK
AMHADDKKKK RASTSDKASN AVASTSAPKL KRPSIRSEPP GWSMSATAWK PREETARATP
VPQRAAPSHS HTSRPLPAAS SPDKSLKFAI ADRVKAVLRP LYAAGDVTKD EYRRICKCAT
SEALASNATD DASIAHIVHR LRKR