Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_93625 |
Symbol | |
ID | 5005186 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009367 |
Strand | - |
Start bp | 344382 |
End bp | 346211 |
Gene Length | 1830 bp |
Protein Length | 609 aa |
Translation table | |
GC content | 56% |
IMG OID | 640420607 |
Product | predicted protein |
Protein accession | XP_001421121 |
Protein GI | 145353653 |
COG category | [B] Chromatin structure and dynamics |
COG ID | [COG5034] Chromatin remodeling protein, contains PhD zinc finger |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00774306 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGTTGGAT GTGATGATTG CGGTGACTGG TATCACTTAA AATGTATCAA TGTGACGCCT ACTATGGCAA AGACGATGCA CAATTACATC TGTCCGCCAT GCATCGCGAA GAGTGGCAAG GCGAGCGCAC TCTCGTTGGA TACGTATCGC TCGGTGCATC GCACCAATCG CCCAAACGTC ATGCTTTTGC GAGAACTCCT CGGTGAAGCG CAGCGATTTC CTGGAGAAGT GCCAGAAGAA GCCGTGTTGA ATCAAGTCAT CAATACTCAC GACGCGTGGC GATTGGAAGT ACGAAAGACT TTGCATAGAA AGAGTATGAA GGAAGCACAC GACAACGCCA TCAGTCAAGC TTCGAAGATT GCCCAAGAAG CAGCGATGAG AGAATCTGAG GAACGCACAC GACGAATCGC CGAAGCGTCG CTCGCGGCGA CTGCAGAGAA CCCCAATCCT CCAGCTTCTC TCACGCCATT ACAGCAAGTC ATGCTGATGG GTCAAGCTGT CACGTTCGCC GCAACGCAGC GCACGCACGC GTTGAACATT TCGCCGTGTT CGCAGGAGAT TTCAGATGTC GTCACAAAGC AGCTCGGGCA ACTTCAACAG CTCGCCATGC AACTCGCGAT GTGCCAAGCT CAAATTCACA TGGCTCCAGT TCGGTCTAGG CCCCACATCC TCCACGTCTT GATGATGCAG CAACAGCATC TAAAGATGAC AAACGAGCAC GAGGAAGAGC TGGCAAATGT TGGCGCTAGC AACTGGCTCA TGGAAGCGAC TCTGCAACTA TCTGGAATGT CTGGCATGAT GCCGCCGCCG ACGGCGCAAG ACACCGGCAA AGAACCTTCA ACATCCACGG GTCATGGTAG TCCAAGCGAT ATGTCTGCTC AGTTGAAGAC GAACGAACCG ATGATGGTTC CGCAGCAGTT CATGTTGCAA CCGGCCATGC CATCGCTTGT CGGCGACGCG TCTGGAATGC CCAAGCAAGA GATGGATTAC ACGTATGACA TTCCTCGAAA AGATAATGTC GATCCAGCGC TGCAAGTGTA CGCGGGCATG AAGGGTGCTC TGGCGATGGA GCTGGAACCT GTTGAGGAGA CGCTCGCGTT GATGCGCGAA GTGTGCGGTC CGGCGTGGAG AGAACAAGCC ACGCGGCTCA TGACTGGTGC GCCGTTCCCG AGGTTGATCA AGCTTCACGA GCTCAAGGAA TCTGCCATGG CTGCTGGTCT TTGCCCTGGT GCCGGCATTG ATCCTTTGGC CGATCGCGCG CACGCGTTGG AGGTTGCTGG ACAAATCTGG CTCGAACGCG CCGCCGCCGT CGTGCAAGAC AAAACGATTC CTATCGAGGC GGCGCAGTTG CTGCTTCAAG AGGGTCGATC TTTGCCTTTA TACTTGAAGG AGGAGCTCGA GGAGTTGGGA GAGCGATGCG AACTGTATTG CGTTTGCCGA AGCGCGTACG ACGCTCTCAG GCCGATGATT TGCTGCGATC GATGCGATGG TTGGTTTCAT TACGAGTGCA TCGGCATGCA GTCGCCGGCG CCGGGCGAGG AAGACGAAAA CGCCGAAAAC GTCAAGTTTG CCTGCCCAGA GTGCTGCGCG GCGCAAGGTA TTCCGTACGT TCCGTTCCGT CCAGCGCCGA AGGACACGGA CAAAGCGCCG GAGCGAGCCG CGCCTCCGCC GGCTGAAGAA GCTCCGGAGC CGCCAAAGAC GGAAGAAGAA GCGAAGCCGC CCGCTAAACC AGAACCGGAA GTCGCGGACA ACAAGAAGAA ACGAAAAACG GTGGAGCCGA CTCCGCCTGC TAATAAAAGT ACCTCTAGTC GCAGCAGAAG ACGAAAGTAG
|
Protein sequence | MVGCDDCGDW YHLKCINVTP TMAKTMHNYI CPPCIAKSGK ASALSLDTYR SVHRTNRPNV MLLRELLGEA QRFPGEVPEE AVLNQVINTH DAWRLEVRKT LHRKSMKEAH DNAISQASKI AQEAAMRESE ERTRRIAEAS LAATAENPNP PASLTPLQQV MLMGQAVTFA ATQRTHALNI SPCSQEISDV VTKQLGQLQQ LAMQLAMCQA QIHMAPVRSR PHILHVLMMQ QQHLKMTNEH EEELANVGAS NWLMEATLQL SGMSGMMPPP TAQDTGKEPS TSTGHGSPSD MSAQLKTNEP MMVPQQFMLQ PAMPSLVGDA SGMPKQEMDY TYDIPRKDNV DPALQVYAGM KGALAMELEP VEETLALMRE VCGPAWREQA TRLMTGAPFP RLIKLHELKE SAMAAGLCPG AGIDPLADRA HALEVAGQIW LERAAAVVQD KTIPIEAAQL LLQEGRSLPL YLKEELEELG ERCELYCVCR SAYDALRPMI CCDRCDGWFH YECIGMQSPA PGEEDENAEN VKFACPECCA AQGIPYVPFR PAPKDTDKAP ERAAPPPAEE APEPPKTEEE AKPPAKPEPE VADNKKKRKT VEPTPPANKS TSSRSRRRK
|
| |