Gene OSTLU_19271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_19271 
SymbolHXA3502 
ID5006916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009375 
Strand
Start bp278515 
End bp280161 
Gene Length1647 bp 
Protein Length548 aa 
Translation table 
GC content54% 
IMG OID640422337 
Productpredicted protein 
Protein accessionXP_001422948 
Protein GI145357484 
COG category[B] Chromatin structure and dynamics 
COG ID[COG5114] Histone acetyltransferase complex SAGA/ADA, subunit ADA2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0114099 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGCG CGCTCGTGCC GAAACGGCGA CGGGTGGCGA CGGAAAACGC GATGACGAAG 
CTGAGTGGGA ACGGGGAGTC GTGCGCACTG TTTAACTGTA ACTATTGCCA AAAGGACATC
TCGAACGTGG TGCGCGTACG GTGCGCGGAG TGCGCAAACG TGGATCTGTG CACGGAGTGC
TTCGCGGTCG GCGTGGAGCC GCACCCGCAC AAGGCGTATC ATCAGTATCA CGTCATCGAC
AACATGTCGT TTCCGCTGTT CACGCGAGAT TGGGGGGCTG ACGAAGAGTT GTTATTGCTG
GAGGCAGTGG AGATGTTCGG GTTGGGGAAC TGGACCGAGG TGAGCGAACA CGTCGGGACG
AAGACGCGCG CGCAGTGTCA CGCGCACTAT TTTGAAGTCT ACGTCAAGTC TCCTTGCGCG
CCGTTACCGG ATATGTCGAA GATTTTAGGA AAAGGCGTCG CGCGTATGAC ATCAGACGAG
CTCAAAGCGG AGGCGGAGCA AAAGGCGAAC GAAAATAAGG ATGTGGAGGA GGAGGAGAAG
CTTCTCGAAT CGCTTGCTAA CCCGAACGCA GTGAAGACGG AGGGCAACGT GCAGGAACTC
ACAGGTTACA ACATCAAGCG CAATGAGTTC GATCCCGAAT ACGACATGGA TGCCGAACTT
CCCCTGGCGG AGATGGAATT TCGCGAAAAC GACACCGAAG AAGACGTCCA GATGAAGCTG
CGAATGATTG AAATCTACAA CAGCCGGCTT CAAGAACGAG CGAGAAGAAA ACAATTCATT
CTCGAACGCA ATCTGCTGAA CGTGAAAAAG CAACAAAACG TGGAAAAGAA GCGTTCACAA
TACGAGCGCG ACTTACACGG CACCATGCGT ATATTTGCAC GCTTTCTCAC GAGTACCGAG
TACGACGTCT TGCTCGAGGG TCTCGCCGCG GAGCACCGAA TCCGAACCCG CATCACCGAA
CTGAAAGAGT ACAGACGCAA TGGTATTCAT ACCATCGCAG AGGGCGAGGA TTACGATTTG
GAGAAGCGTC GTCGTGAGAC GGAGTTCGCT CGTCTACACG CGATCGAGCA TCCAACTAGC
AAGAACATAG CCAGAGCGAA CAAGTTCATC GTGCGAGATG CCACACAAAT CAATGAGCAG
TTGACTCGCA TGAACGACGA AGACAAGACG GTATCCGTGA TCCCGACGCC TCGTACGTCG
AGCTTAGGTC CTCGCCGTCG AATGTACTTG TCACTTGATC TCGCCGATCT TCCAGGCGTA
GACCTTTTGA ACGACGACGA AAAGGAGTTG TGCAGGAGCT GTCGCTTATT GCCTGTGCAG
TATCTCTCGA TGAAGGTGGA GTTGATGCGA GAGGGTCTCA AGTCCGAAAA GCCGCTCAAC
AGAAATCACG TTCGGAATAT GTTCAAAGTA GACCCACTCA AGGCTATTCG TGTGTATGAG
TTACTCCTAC AGCACGGCTG GGTGTTGGAA GACGGCTTCG TGAACCCAGG TGAGGATGAA
GACTCCGAAC CTGCGCCGAA AAAGTCAGCC AGCGCAGACG AGGAGGAAGA CGAGGAGGAC
GATGAAGTAG ATTACGAAAC CGACGATAAC GACGAAGACG AGGACGAGGA AGACGACGAG
GAAGAGGATA GCGAGGAAGA CGATTAG
 
Protein sequence
MASALVPKRR RVATENAMTK LSGNGESCAL FNCNYCQKDI SNVVRVRCAE CANVDLCTEC 
FAVGVEPHPH KAYHQYHVID NMSFPLFTRD WGADEELLLL EAVEMFGLGN WTEVSEHVGT
KTRAQCHAHY FEVYVKSPCA PLPDMSKILG KGVARMTSDE LKAEAEQKAN ENKDVEEEEK
LLESLANPNA VKTEGNVQEL TGYNIKRNEF DPEYDMDAEL PLAEMEFREN DTEEDVQMKL
RMIEIYNSRL QERARRKQFI LERNLLNVKK QQNVEKKRSQ YERDLHGTMR IFARFLTSTE
YDVLLEGLAA EHRIRTRITE LKEYRRNGIH TIAEGEDYDL EKRRRETEFA RLHAIEHPTS
KNIARANKFI VRDATQINEQ LTRMNDEDKT VSVIPTPRTS SLGPRRRMYL SLDLADLPGV
DLLNDDEKEL CRSCRLLPVQ YLSMKVELMR EGLKSEKPLN RNHVRNMFKV DPLKAIRVYE
LLLQHGWVLE DGFVNPGEDE DSEPAPKKSA SADEEEDEED DEVDYETDDN DEDEDEEDDE
EEDSEEDD