Gene OSTLU_51202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_51202 
SymbolHXA3501 
ID5004953 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp248139 
End bp249954 
Gene Length1816 bp 
Protein Length515 aa 
Translation table 
GC content55% 
IMG OID640420374 
Productpredicted protein 
Protein accessionXP_001420946 
Protein GI145353279 
COG category[B] Chromatin structure and dynamics 
COG ID[COG5114] Histone acetyltransferase complex SAGA/ADA, subunit ADA2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones59 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0345215 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGACGACGAC GACGACGACG ACGACGACGA CGACGACGAC GGGCGCGAGT CGCGCGTGAA 
CGCACGCGAG TGAGCGCGCG CACGACCGTC GAGGACGCGC GCGACGGGAA CGCGGACGCG
GAGGTAACGC GACGCATCGG CATCGGCGTT CGGCATCGCA TTCGGCGCCA TGGCGAGCGC
GCTCGTGCCG AAACGGCGAC GGGTGGCGAC GGAAAACGCG ATGACGAAGC TGAGTGGGAA
CGGGGAGTCG TGCGCACTGT TTAACTGTAA CTATTGCCAA AAGGACATCT CGAACGTGGT
GCGCGTACGG TGCGCGGAGT GCGCAAACGT GGATCTGTGC ACGGAGTGCT TCGCGGTCGG
CGTGGAGCCG CACCCGCACA AGGCGTATCA TCAGTATCAC GTCATCGACA ACATGTCGTT
TCCGCTGTTC ACGCGAGATT GGGGGGCTGA CGAAGAGTTG TTATTGCTGG AGGCAGTGGA
GATGTTCGGG TTGGGGAACT GGACCGAGGT GAGCGAACAC GTCGGGACGA AGACGCGCGC
GCAGTGTCAC GCGCACTATT TTGAAGTCTA CGTCAAGTCT CCTTGCGCGC CGTTACCGGA
TATGTCGAAG ATTTTAGGAA AAGGCGTCGC GCGTATGACA TCAGACGAGC TCAAAGCGGA
GGCGGAGCAA AAGGCGAACG AAAATAAGGA TGTGGAGGAG GAGGAGAAGC TTCTCGAATC
GCTTGCTAAC CCGAACGCAG TGAAGACGGA GGGCAACGTG CAGGAACTCA CAGGTTACAA
CATCAAGCGC AATGAGTTCG ATCCCGAATA CGACATGGAT GCCGAACTTC CCCTGGCGGA
GATGGAATTT CGCGAAAACG ACACCGAAGA AGACGTCCAG ATGAAGCTGC GAATGATTGA
AATCTACAAC AGCCGGCTTC AAGAACGAGC GAGAAGAAAA CAATTCATTC TCGAACGCAA
TCTGCTGAAC GTGAAAAAGC AACAAAACGT GGAAAAGAAG CGTTCACAAT ACGAGCGCGA
CTTACACGGC ACCATGCGTA TATTTGCACG CTTTCTCACG AGTACCGAGT ACGACGTCTT
GCTCGAGGGT CTCGCCGCGG AGCACCGAAT CCGAACCCGC ATCACCGAAC TGAAAGAGTA
CAGACGCAAT GGTATTCATA CCATCGCAGA GGGCGAGGAT TACGATTTGG AGAAGCGTCG
TCGTGAGACG GAGTTCGCTC GTCTACACGC GATCGAGCAT CCAACTAGCA AGAACATAGC
CAGAGCGAAC AAGTTCATCG TGCGAGATGC CACACAAATC AATGAGCAGT TGACTCGCAT
GAACGACGAA GACAAGACGG TATCCGTGAT CCCGACGCCT CGTACGTCGA GCTTAGGTCC
TCGCCGTCGA ATGTACTTGT CACTTGATCT CGCCGATCTT CCAGGCGTAG ACCTTTTGAA
CGACGACGAA AAGGAGTTGT GCAGGAGCTG TCGCTTATTG CCTGTGCAGT ATCTCTCGAT
GAAGGTGGAG TTGATGCGAG AGGGTCTCAA GTCCGAAAAG CCGCTCAACA GAAATCACGT
TCGGAATATG TTCAAAGTAG ACCCACTCAA GGCTATTCGT GTGTATGAGT TACTCCTACA
GCACGGCTGG GTGTTGGAAG ACGGCTTCGT GAACCCAGGT GAGGATGAAG ACTCCGAACC
TGCGCCGAAA AAGTCAGCCA GCGCAGACGA GGAGGAAGAC GAGGAGGACG ATGAAGTAGA
TTACGAAACC GACGATAACG ACGAAGACGA GGACGAGGAA GACGACGAGG AAGAGGATAG
CGAGGAAGAC GATTAG
 
Protein sequence
MASALVPKRR RVATENAMTK LSGNGESCAL FNCNYCQKDI SNVVRVRCAE CANVDLCTEC 
FAVGVEPHPH KAYHQYHVID NMSFPLFTRD WGADEELLLL EAVEMFGLGN WTEVSEHVGT
KTRAQCHAHY FEVYVKSPCA PLPDMSKILG KGVARMTSDE LKAEAEQKAN ENKDVEEEEK
LLESLANPNA VKTEGNVQEL TGYNIKRNEF DPEYDMDAEL PLAEMEFREN DTEEDVQMKL
RMIEIYNSRL QERARRKQFI LERNLLNVKK QQNVEKKRSQ YERDLHGTMR IFARFLTSTE
YDVLLEGLAA EHRIRTRITE LKEYRRNGIH TIAEGEDYDL EKRRRETEFA RLHAIEHPTS
KNIARANNLG PRRRMYLSLD LADLPGVDLL NDDEKELCRS CRLLPVQYLS MKVELMREGL
KSEKPLNRNH VRNMFKVDPL KAIRVYELLL QHGWVLEDGF VNPGEDEDSE PAPKKSASAD
EEEDEEDDEV DYETDDNDED EDEEDDEEED SEEDD