Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_17735 |
Symbol | CHB3501 |
ID | 5005065 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009367 |
Strand | + |
Start bp | 39424 |
End bp | 42413 |
Gene Length | 2990 bp |
Protein Length | 902 aa |
Translation table | |
GC content | 60% |
IMG OID | 640420486 |
Product | predicted protein |
Protein accession | XP_001420888 |
Protein GI | 145353152 |
COG category | [B] Chromatin structure and dynamics [K] Transcription |
COG ID | [COG5259] RSC chromatin remodeling complex subunit RSC8 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.31577 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.344998 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGCGA CGTTGGCGGC GTCCGCGGAT GAGCCAGGGG TGACGCACGT CCTACGCACG GATGATTCGG TGAATGACGA CGAGATCGAG GACGGCGCGT TGTGGCGAAT CACGGAGACG AGAGTGGCGA CGTCGAGTGG AAAGACGGAG CAAAGGTTAC ACTTTTGGTT TTATCCGGAT TCGTACGACG CCTGGTACGC GGATTCGTCG ATTAGTGGTA AGGGGAAAGC GCCGTGGGCG CCGGAGACGA GCGTGATTTT GAGAAATAAA GCGGAGAAGA GAAACGACCT GCCGCACAAC GTGCGCGCGC GATGGTTGCG CGACAGTCAC ACGTTTAACG AGTGGATGAA CGAGCTCGAT TACGAGTACG ACGTGACGCG CGAAATGTTG AGCGCGCCGC GGCACGCGTG GGATCCATCG AACGCGCCAC AAAACAAACG CGCGCGTGCG CCCGAAAACT TTGAGGGTGA TGCCGAAATT CTCGCTCCAG GAGAAGAGGA ACGCGTGCCG TGGGGGTTTG CCGTCACGCG TCGTCGCGTC GTTCGCGCGC ATCCAGTCGT CGCGGCGGCG GCGAGTCAAG ACGACGGCGC ACTCGTCTTG CAAGACGACG AAGAACTAAT CACGAAAGAG TTCATGATTG CGAAAACGCT ACGCATGCAG AATATCTCGG TGGATCAGTT GCCGTGGAAT GCGACGCCGT CAATGCGCAA AGCCGAGCGT AAAGCCGAGC CTTTAACGGA GTACAGAGTG CCGACGCACA GCGCTTGGTT CAAATGGGGC GAAGTTCACG CCATCGAAAG ACGTGCGTTG CCAGAATTCT TCGACGATGA TGACACGTGT CAAAAGTACA TCGCGTGTCG AAACGAAATC ATGAATCAGT TTCGCTTCAA AGGCCAAGAG GTTACGTTGC ATGAAGTGTC TTCGTCAAGA ACGACAAAAA ATATTGTCGA CGCCGCCGCG CATCAGAGAA TTTTCTCATT TCTCGAGCAG TGGGGATTGA TTAATTGGCA ATTCACATCC GGACGTGATG TGATTGACTT GAAACAAAAA CCTCTCGCCG CGTGGCGTCG CATCGTCACT GGCGAGGATG GCGCGGCGCG TGTCGAGAAG ACGGATCCCT TAGCCGCCTT CAAAGGGACG TTGTTCGAGT TTTCGAAATG TCGTGCGACG ACTGCGAGTG GTTTACACCC GCTCGAACCG CAGTCGAGAT ATGCGCCGTC TTCGGAAACG CAACTCGAGC GTCAATCTTT GGATGCGTTG TTTGCCTCTC ACGACGCGCT GTCAAAGCGT GGGGTCGACG TCAAGTTTGC GTGCAACGCG TGCGGCGCTG ATTTAAAGAG CACTGGCGTT TTTTACCACG CGTTTCTCAC GCGTGATTTT GATTTGTGCC CATCGTGCTT TTCCAAAGGC GTGTACCCGC ACGGCCAAGC GAGCGGCGAC TTTGTCAAGG CAATGTACCC AGACTTTCAC GCCGAAGCCG TCTCGGCGGA CGAAATCGTC GACGACGCCG AGTGGACGCC GCAGGAGGTC GCCGCCCTGC TCGATGCAAT TTCGCAGTCG AATGAGTTAA ATTGGAACGA TATTGCTTCT GCGGTCGGGA CAAAGAGCGA GGATGAGTGC TTGAAGCACT TCGCGCGCAT GCCCATCGAA GACGCCGCGA TTGAAAACAT AGAGCGCGAG TTACTTGTGC CGCGCGGCGC CATCATCGAT GATGAGGGAG CCAAGATCCT CGATCCTGTG CCTTTCTCAT TCGCCCCAAA CCCCACGATG GCTCAGCTCG AGTTTTTGGT GAGCATGATC TCCCCTCGCG TCGCCGCTGC GTCGGCGAAA GCCGCGCTGA CGAAAATCGC GCTCGGCGGG TCGCTCGACG CCGCCGACCT CAACGTCGAC GGTCTCGCCG CTGCCGCCAT TCAAGCCAAG ATCCTGGCCC AAGACGAAGA ACACGAAGTT CATCGCATCA TCGCCAGTGC TCTGGACGTC TTGCTGAAAA AGCTCGAAAT TAAGCTCAGA TTCCTCGGCC GACTGGTCGA CGACGAGCCG GAGACGGCGA GCCGTCTCGC CAAGCTTCGA GAGGAGTCCG CGCGCAATCG AACGAACGAT CTGTACACGC GCGACGTGCA ATCCGCGCGA CACAAGGAAC ACATAGCCAC GATTCATCGT TTACGACAGC AGCTCGCCGG TCTGTCGTCT TAGTCGCCTC GCGCTTGCAG CAACAAAGCC GTAAGCCGTC ACAACTCGCC TCGCCTCGAA ACATGACCAC CCGGCAGTTG ACGATCCAAA TCGTCTCCGA CGTCGTGTGA CCTTGGTGTT ACGTCGGCGT GAAGAACCTC GACCGCGCGC GCGCCGCGCT TCGTCCCGAC GTCGCGTCCT CCCGGGCCGT TTGGCGACCA TTTCAGCTCG TGAGTTCTCA TGATTGGCGG CGAATGGGCG CCGACGTCGC GCGCGCGGGC GTGAACAAGC GCTCGTGGTA CAACGAACGA TTCGGCGCCG ATACGGTGGC GACGTTCGAG CCCAGGCTCG CGAGCGCGTT CGCGAAGGCG GGGATCGAGG GCGCGTACAC GCTCGACGGC AACACCGGCG ACACGAGACC TGCGCACCGC GTCGCGGCTT ACGCCGAGGA AACGCACGGC CCGGCGGCGC AGGACGCCTT CATGCGCGCC ATGTTCCACA GATACTTCAT CGAAGCGCTC GCGCCGTGCG ACGAAGCCGT GATGAGAGAC GCGGCGAGCG CCGCGGGTTT GGACGAAGCG GCGGTTTCCA AAGTGCTCGC CGACGGCGAG GCGTCGCCGT TCGAGACGGT CGTGGAGGAG CAAATGTCGG CGACGCGCGC GCGCGTTCGC GGCGTGCCGC ACTTCATCAT CACGTGCGAC GGCGACGGTG CGTCGCGAAA GATTGAGATC GGCGGCGCGC AACCGCCCGA GGCGTTTTTG GACGCGTTCG CCGAGCTTTT GGATTTGGAC GCCGACGACG TCGCAGCGAC GAAGTCCTAA
|
Protein sequence | MRATLAASAD EPGVTHVLRT DDSVNDDEIE DGALWRITET RVATSSGKTE QRLHFWFYPD SYDAWYADSS ISGKGKAPWA PETSVILRNK AEKRNDLPHN VRARWLRDSH TFNEWMNELD YEYDVTREML SAPRHAWDPS NAPQNKRARA PENFEGDAEI LAPGEEERVP WGFAVTRRRV VRAHPVVAAA ASQDDGALVL QDDEELITKE FMIAKTLRMQ NISVDQLPWN ATPSMRKAER KAEPLTEYRV PTHSAWFKWG EVHAIERRAL PEFFDDDDTC QKYIACRNEI MNQFRFKGQE VTLHEVSSSR TTKNIVDAAA HQRIFSFLEQ WGLINWQFTS GRDVIDLKQK PLAAWRRIVT GEDGAARVEK TDPLAAFKGT LFEFSKCRAT TASGLHPLEP QSRYAPSSET QLERQSLDAL FASHDALSKR GVDVKFACNA CGADLKSTGV FYHAFLTRDF DLCPSCFSKG VYPHGQASGD FVKAMYPDFH AEAVSADEIV DDAEWTPQEV AALLDAISQS NELNWNDIAS AVGTKSEDEC LKHFARMPIE DAAIENIERE LLVPRGAIID DEGAKILDPV PFSFAPNPTM AQLEFLVSMI SPRVAAASAK AALTKIALGG SLDAADLNVD GLAAAAIQAK ILAQDEEHEV HRIIASALDV LLKKLEIKLR FLGRLVDDEP ETASRLAKLR EESARNRTND LYTRDLVSSH DWRRMGADVA RAGVNKRSWY NERFGADTVA TFEPRLASAF AKAGIEGAYT LDGNTGDTRP AHRVAAYAEE THGPAAQDAF MRAMFHRYFI EALAPCDEAV MRDAASAAGL DEAAVSKVLA DGEASPFETV VEEQMSATRA RVRGVPHFII TCDGDGASRK IEIGGAQPPE AFLDAFAELL DLDADDVAAT KS
|
| |