Gene OSTLU_16584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16584 
SymbolSDG3504 
ID5003491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp645079 
End bp646566 
Gene Length1488 bp 
Protein Length495 aa 
Translation table 
GC content59% 
IMG OID640418912 
Productpredicted protein 
Protein accessionXP_001419228 
Protein GI145349623 
COG category[R] General function prediction only 
COG ID[COG5141] PHD zinc finger-containing protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAAG TCGTGGGATA CGTGCGACCG ACGGTGCGAA ATAGGTATCG GATGAGCGAG 
GCTGAGGTGG TCGAACGGCG CCGGCGTGAG GCCATCGAAA ACGCCGGCGC CGACGAGACG
CCGAGCATCT TCAGGACGCC CAGCGTCGAA AAGTGCGATG TGTGTGATTC TGTTCGAGAG
TTTGACCAAG ATGTGCTCGT GCAGTGCGAT GAGTGCATGA TTTTGGTTCA CATGGGTTGC
TACGGCGTCA CGACTGCGCC GACGGGCGGG CGTTGGCTGT GTCGAGCGTG CGAACTCGGC
TTGCGCACGC CGCCGCGGTG CGCGCTGTGT CCGAACGTGG GTGGTGCGAT GAAACCAACG
TTGTGTGGGA GGTGGTGCCA CGTCGTATGC GCGTTGTGGG CGGAATGCAC GTTCGCGCAT
CCTGATGGCG TCGCGGAGCC CATCGAAGGC GTCAACATGG TTCCAGCGGA GAGTTTGAAA
GCAACGTGCG CGGTGTGCGA GCAAAGTTAC GGCGCGTGCG CGCAGTGCAT GGGTACGAAA
AAGTGTCAAA AAGCGTTTCA CGTGTACTGC GCGAGAGACG CGGAGTGTGG ATACATCGCG
CACTCGCGCA CGGTGGCGCA GCTGAAACAG GCGGGCATTC GCAAATTCAT CGTGGGTTAC
GAACAGCCTC TGCGAAACAC CGACACACTT TTGTTTCCGA GTTGTCCCGC GTGCGCAAAC
TGGCGAGGTC GCAAGCGCAA ACGGCGCGCG TCGACACCGA AGAAGCGAAC TCAGACACCG
AAGACAAGGC CAACTGTGGA TTCGCGCGAA GTCGAAGACA AAGACGAAGA CGCGAAACCA
CTCCAGTGCG CCAAGTTTGA CCCTTTAGGC GCGTACGCTC GCGCGTTGAC GGTGTCGCCA
AAGGATTCGA TACCATATCT CGTCACGGGC GCGCGAACGA GTCGCTTGGA ATCGTTCAGT
CTTCGAGCCG TCGCACTTGC CGATCCGCCG CGAAATCTGA ACGAGCGTTT CGAGCGCATG
AAGGCGACGA TTTCAGATCG CTTGACGCTG GGGAAATCGT ATATTCACGG CTATGGTTTA
TTCGCAAAAC GCGCGCACGC GCGAGGCGAG ATGATCATCG ATTACGTCGG CGAAATCGTG
CGTCCAGTCG TTGCCGATAT TCGCGAGCGC GACGTGTACG ACACCTGTTT CGGCAACGGG
ACGTACATCT TCGCGCTAGG CGGCGACGAT CAACCCGTGC GCTTAGACGC CACGTGTGCA
GGAAATCTCG CAAACTTGGC CAACCATTCG TGCGCACCGA ACGCGCATTC GAGACAAGTG
TACGCCGCGA ACGACAACCA CATTTGCTTA TTCGCGTCGC GAAACATCCA GCCCGGCGAG
GAAATTTTGT ACGAGTATAG ACTCGGCGCC GATCAGACGT TACGATGCAA CTGCGGCGCC
GCAAACTGTC GCGGCGTCGT CAACTTTACC GCCGAGCACC CGGCGTAG
 
Protein sequence
MDKVVGYVRP TVRNRYRMSE AEVVERRRRE AIENAGADET PSIFRTPSVE KCDVCDSVRE 
FDQDVLVQCD ECMILVHMGC YGVTTAPTGG RWLCRACELG LRTPPRCALC PNVGGAMKPT
LCGRWCHVVC ALWAECTFAH PDGVAEPIEG VNMVPAESLK ATCAVCEQSY GACAQCMGTK
KCQKAFHVYC ARDAECGYIA HSRTVAQLKQ AGIRKFIVGY EQPLRNTDTL LFPSCPACAN
WRGRKRKRRA STPKKRTQTP KTRPTVDSRE VEDKDEDAKP LQCAKFDPLG AYARALTVSP
KDSIPYLVTG ARTSRLESFS LRAVALADPP RNLNERFERM KATISDRLTL GKSYIHGYGL
FAKRAHARGE MIIDYVGEIV RPVVADIRER DVYDTCFGNG TYIFALGGDD QPVRLDATCA
GNLANLANHS CAPNAHSRQV YAANDNHICL FASRNIQPGE EILYEYRLGA DQTLRCNCGA
ANCRGVVNFT AEHPA