Gene OSTLU_34647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_34647 
SymbolHDA3509 
ID5003712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp26139 
End bp27338 
Gene Length1200 bp 
Protein Length399 aa 
Translation table 
GC content62% 
IMG OID640419133 
Productpredicted protein 
Protein accessionXP_001419469 
Protein GI145350128 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGTC CTCGCGTCCA TCTCACTCGC GTCGCGCCCG CGTCACCGCG CGCGTCGCGT 
CGCGCCGAGC GTCACCCGTC TCGCGCGCGA CGAATCGCGT CCGCGTCCGC GTCCGCGTCC
GCGTCCTCGT CCCCGTCGCT GCTCTTTGCC ACCTCCGCCG CGCTGGAGCA CGTGCAGTTG
GGGCACCCGG AGAGCAACGC TCGCGTGCCC GCCATCTTAG ACGCCCTCGA GGCGGGCGCT
CTGACCCCCG CCGCGAGACC GGGTGAGGTG TTGGAGATCA CAGACGTCGT GCCGGCGACG
AAAAAGGCGC TCGAGCGTGT GCACGCGAAG AATTACTGCA ACGGGCTCGA GTTGTTGTGC
GCCACGCGAG CGCCGACGAA TTTAGACACC GCACCGACGT ACTGCACGCC GTCGAGCTTT
CAAGACGTGA TGCTAGGCGT CGGTGCGGCG ACGCGATTGG TGGATGAGGT GATCGATCGC
GCGAAGGAGA CGAAAGAAAA GGCGCCGAGC GCGTTTGGTT TGATACGCCC ACCGGGACAC
CACGCGGTGC CGCGAGGGGC GATGGGATTT TGTTTGGTCG GCACCGCGGC GGCGGCGGCG
AGACACGCGC AGCTTAGAGG GCATAAAAAA GTGCTCATTT TCGATTACGA CGTGCACCAC
GGTAACGGTA CGAATGATAT TTTCCGAGAC GACGATTCCG TGCTGTTCAT CTCCACGCAC
GAGGACGGGA GCTATCCGGG GACGGGGAAG ATTACGGATA TGGGCGAAGG CGATGGTCTG
GGTGCGACTA TTAATATACC TTTGCCTCCG GGAAGCGGTG ATAAGGCGGT GTTGAGCGCT
TTGGAAGAAA TCGTCGTTCC GGCGGCGGCG CGGTTCCAAC CCGACTTCAT CATCGTCAGT
GCCGGCTACG ACGCGCACTG GCGAGATCCT CTCGCTGGGT TGACGTTTCG CACGGGAACG
TACCATCGTC TTTGCACAAA GTTGAAAGAA CTGGCGAATG AGATGTGCGG CGGGAAAATC
GTGTTCTTAT TAGAGGGCGG GTATGATTTG GTGGGTTTGA GCGAAGGCGT CGCGGATTCT
TTCCGCGCGC TCTTGGGCGA CGCCTCAACG GACGTCGGCG AGATTCCCGG GCTGAGAGAC
GAGCCCGATG ATAAAGTGCG AAATGTTCTC ACCGAAGTCA AGGCGATGCA TCAAGTTTAG
 
Protein sequence
MARPRVHLTR VAPASPRASR RAERHPSRAR RIASASASAS ASSSPSLLFA TSAALEHVQL 
GHPESNARVP AILDALEAGA LTPAARPGEV LEITDVVPAT KKALERVHAK NYCNGLELLC
ATRAPTNLDT APTYCTPSSF QDVMLGVGAA TRLVDEVIDR AKETKEKAPS AFGLIRPPGH
HAVPRGAMGF CLVGTAAAAA RHAQLRGHKK VLIFDYDVHH GNGTNDIFRD DDSVLFISTH
EDGSYPGTGK ITDMGEGDGL GATINIPLPP GSGDKAVLSA LEEIVVPAAA RFQPDFIIVS
AGYDAHWRDP LAGLTFRTGT YHRLCTKLKE LANEMCGGKI VFLLEGGYDL VGLSEGVADS
FRALLGDAST DVGEIPGLRD EPDDKVRNVL TEVKAMHQV