Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_19271 |
Symbol | HXA3502 |
ID | 5006916 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009375 |
Strand | - |
Start bp | 278515 |
End bp | 280161 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | |
GC content | 54% |
IMG OID | 640422337 |
Product | predicted protein |
Protein accession | XP_001422948 |
Protein GI | 145357484 |
COG category | [B] Chromatin structure and dynamics |
COG ID | [COG5114] Histone acetyltransferase complex SAGA/ADA, subunit ADA2 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 50 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0114099 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAGCG CGCTCGTGCC GAAACGGCGA CGGGTGGCGA CGGAAAACGC GATGACGAAG CTGAGTGGGA ACGGGGAGTC GTGCGCACTG TTTAACTGTA ACTATTGCCA AAAGGACATC TCGAACGTGG TGCGCGTACG GTGCGCGGAG TGCGCAAACG TGGATCTGTG CACGGAGTGC TTCGCGGTCG GCGTGGAGCC GCACCCGCAC AAGGCGTATC ATCAGTATCA CGTCATCGAC AACATGTCGT TTCCGCTGTT CACGCGAGAT TGGGGGGCTG ACGAAGAGTT GTTATTGCTG GAGGCAGTGG AGATGTTCGG GTTGGGGAAC TGGACCGAGG TGAGCGAACA CGTCGGGACG AAGACGCGCG CGCAGTGTCA CGCGCACTAT TTTGAAGTCT ACGTCAAGTC TCCTTGCGCG CCGTTACCGG ATATGTCGAA GATTTTAGGA AAAGGCGTCG CGCGTATGAC ATCAGACGAG CTCAAAGCGG AGGCGGAGCA AAAGGCGAAC GAAAATAAGG ATGTGGAGGA GGAGGAGAAG CTTCTCGAAT CGCTTGCTAA CCCGAACGCA GTGAAGACGG AGGGCAACGT GCAGGAACTC ACAGGTTACA ACATCAAGCG CAATGAGTTC GATCCCGAAT ACGACATGGA TGCCGAACTT CCCCTGGCGG AGATGGAATT TCGCGAAAAC GACACCGAAG AAGACGTCCA GATGAAGCTG CGAATGATTG AAATCTACAA CAGCCGGCTT CAAGAACGAG CGAGAAGAAA ACAATTCATT CTCGAACGCA ATCTGCTGAA CGTGAAAAAG CAACAAAACG TGGAAAAGAA GCGTTCACAA TACGAGCGCG ACTTACACGG CACCATGCGT ATATTTGCAC GCTTTCTCAC GAGTACCGAG TACGACGTCT TGCTCGAGGG TCTCGCCGCG GAGCACCGAA TCCGAACCCG CATCACCGAA CTGAAAGAGT ACAGACGCAA TGGTATTCAT ACCATCGCAG AGGGCGAGGA TTACGATTTG GAGAAGCGTC GTCGTGAGAC GGAGTTCGCT CGTCTACACG CGATCGAGCA TCCAACTAGC AAGAACATAG CCAGAGCGAA CAAGTTCATC GTGCGAGATG CCACACAAAT CAATGAGCAG TTGACTCGCA TGAACGACGA AGACAAGACG GTATCCGTGA TCCCGACGCC TCGTACGTCG AGCTTAGGTC CTCGCCGTCG AATGTACTTG TCACTTGATC TCGCCGATCT TCCAGGCGTA GACCTTTTGA ACGACGACGA AAAGGAGTTG TGCAGGAGCT GTCGCTTATT GCCTGTGCAG TATCTCTCGA TGAAGGTGGA GTTGATGCGA GAGGGTCTCA AGTCCGAAAA GCCGCTCAAC AGAAATCACG TTCGGAATAT GTTCAAAGTA GACCCACTCA AGGCTATTCG TGTGTATGAG TTACTCCTAC AGCACGGCTG GGTGTTGGAA GACGGCTTCG TGAACCCAGG TGAGGATGAA GACTCCGAAC CTGCGCCGAA AAAGTCAGCC AGCGCAGACG AGGAGGAAGA CGAGGAGGAC GATGAAGTAG ATTACGAAAC CGACGATAAC GACGAAGACG AGGACGAGGA AGACGACGAG GAAGAGGATA GCGAGGAAGA CGATTAG
|
Protein sequence | MASALVPKRR RVATENAMTK LSGNGESCAL FNCNYCQKDI SNVVRVRCAE CANVDLCTEC FAVGVEPHPH KAYHQYHVID NMSFPLFTRD WGADEELLLL EAVEMFGLGN WTEVSEHVGT KTRAQCHAHY FEVYVKSPCA PLPDMSKILG KGVARMTSDE LKAEAEQKAN ENKDVEEEEK LLESLANPNA VKTEGNVQEL TGYNIKRNEF DPEYDMDAEL PLAEMEFREN DTEEDVQMKL RMIEIYNSRL QERARRKQFI LERNLLNVKK QQNVEKKRSQ YERDLHGTMR IFARFLTSTE YDVLLEGLAA EHRIRTRITE LKEYRRNGIH TIAEGEDYDL EKRRRETEFA RLHAIEHPTS KNIARANKFI VRDATQINEQ LTRMNDEDKT VSVIPTPRTS SLGPRRRMYL SLDLADLPGV DLLNDDEKEL CRSCRLLPVQ YLSMKVELMR EGLKSEKPLN RNHVRNMFKV DPLKAIRVYE LLLQHGWVLE DGFVNPGEDE DSEPAPKKSA SADEEEDEED DEVDYETDDN DEDEDEEDDE EEDSEEDD
|
| |