Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_17648 |
Symbol | GTE3501 |
ID | 5004694 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | - |
Start bp | 377611 |
End bp | 380325 |
Gene Length | 2715 bp |
Protein Length | 904 aa |
Translation table | |
GC content | 54% |
IMG OID | 640420115 |
Product | predicted protein |
Protein accession | XP_001420833 |
Protein GI | 145353027 |
COG category | [B] Chromatin structure and dynamics [K] Transcription |
COG ID | [COG5076] Transcription factor involved in chromatin remodeling, contains bromodomain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.571817 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0825581 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGATC CCAAGAATAA TGTGGGCATC ATCGTTCAAG ACGACGGCGT GGGCATGGAT CGTCGGCGAC TGGTCGGCAT GCTGTCGTTT GGTTTTAGCG ACAAGGAACA CAAGGCTGGG AACGTCGGGC GGTTTGGTAT CGGTTTTAAG TCTGGATCGA TGCGATTGGC GCGAGACGCG TTGATATTGA CGAAAAGAGA TGGGTACGCC CACGTCGCGT TCCTGTCACA GACGTTTTTA GATGACGCCG AGCTGGACGA CATTCTGATT CCGATGTTTT CGTGGAGGAT GGAGCGAGAC GCGACGACTG GCGGCAGGGT TTCATACGTC GCGAGCGAAC CAGCGAATAC GAAAAAATGG GATGAACACA TGTCTGTGAT CTTGCGATAT TCATTCGTAC CCTCGGAGCC ACAGTTGATG CGAGAGTTGG ACAAGATAAG AGGTTCACAC GGGACGCGCA TCGTACTTTT CAACTTGCGC GATCCGCCCG AGTTGGATTT CACGAGCTAC AAGGATGACA TTCGTCTGGT GGGGGCTATC CCAGATGACG AGCGAGCGGT GCGCGGACCG ATTTTCCAAC AATCGCGCGA AGGCCAGCAG GCGTCGATTG ATGTGCAGGA AGACTATAGT TTACGTGCTT ACATGGAAAT TTTGTACCTG AAACCGCGGT GCGAATTCAC GCTTCGCGGG CGACCAGTCG TGCCGCGAGA CCCGATCGCG CATCTCGCGC GAGAATATTA TGTGTTTCCG GAGTACAAGC CGCGAGGCTT GGACGCGGGA ATCACGATTC ACATCGGATA CGCCGCGGAT GAAACTTCGA AGAAGTGTGG GTTCCACATT TACAACAAGA ACCGATTGAT TCGCATGCAC CAACGCTTCG GTTCGCAGCT GCAGGCGAAC ACAATGATGA AGGATATGAT CGGTGTCATC GAGGCGGACT CCTTGGAACC GACACACAAC AAACAAGCAT TCAAGGAGGC CGACATCACC TATCAAAAGT TCAAGCGTCA TCTAGTGCAA TGCATGCAGG ATTATTATTT CGGTATTCAA AGGTATCGTT TAGCCGGCGG CGGCGGCCGC GGCGCGGGAA CGACGTCTTT GAAGCAAACG GCGAAGCGAC GAAAACGTTT GCACAAGTCT AGCTCGTTCG ACGAAGACGA TGAAGATGGA GGGAGCGACG GTGCGAATAA GGCTCCAGCG CCGCGCGGTA GGCCGAAAGG CGGACGTGTC GCGACCCCAC TTACCCGCAT CAAGTCGATT CATCGCTCTC TCATGGTACA CAAGAACGCG TACATTTTCT TGCGTCCAGT CGATCCGGTG TACTGGGAGA TTCCAGACTA CTTCGAAGTC ATTAAAAATC CGATGGATCT CGGTACGATT AAAGAACGCA TCGACGCGGG GTATTACGAC GAGAAAAATG TCGAGGCGTA CGCCGCCGAC GTTCGACTGG TTTGGTCCAA TGCGATGACT TACAACAAAG ACGACACGCC AGTGTTCAAG ATGGCGCGCA TCATGTCAAG AGAGTTTGAG TATCAATGGC AAACTAGAAT TGAAGACGAG GAATTCGTGG TTCCTGCGCT AGCGTCGCAC GCCGCAGCGG ACGCACGCGG GCCGACAAGA ACGGAAACGG ACACTGGCGA CGCGTTCCAC TCCGCACCGA CGTCCGCACC GACTGCGGTG TCGCGGCGCG AGAATTTAAT GCGAGAGGCT CAAGTGCGCA CTGCGGTCGA TGATCAAACG GTCGAGCCTT CGCCTGCTAT TGCCGTTCCT CCACTTTTGG TCAATAAAGT CGCGGCAGCG GCAGATGCAA AGAAAACAGT CACTGAGAAA GAAGAAAGCA ATATGGAGCG TCCTGATGTC AAAGCAGAGC CAAACCAGAT GGTTCGAGTT CCCAAAGCGC TCTCAACCGT CATCGATGAG AGCTTCTTCA AGATTCTCGA GGAAAAGTTC GTGCAGCTCG AGACAGAACT CGCGGAGGAA CGCGCTGCAA ATGCGCGTTA TATTGATCAG AATATTAAGA GGGGCATCAA CGGCTCTTCC ACCGTGAGCG TAGAGGAAAT GAAATCATAC GCAGGAGAGC TCGAAAGATT ACGAGTACGC GTGCAAACTC GTGAGTTCCA GCTCGACAGT TCGCGCGCGA AAGTACGCGC TCTGCAAGAG GAGAATGTGC GACTTAGAGA TCGGTTGGAG TTGAAGAAGC CGAAAACGAC CGCCGCGAAG CCAGTCAGCG AGGTTCGCGA GACGAAATTC GCCGTCAGAA ACGCGACAGC GAGCGAACTT GAGAAATTGT ACGCGTATGT CGAAAAGCAC GCGCCCAATA TCGACATTCG CGGTCGCGGC TGGCGTGTAG ACGTGTTTGA GCGAAAAGGT GGACGTCTTG CGGGGACTTC GTACAAAGAA TTCCTCTCAC CAAGCGGCCA CAAATTACGT TCGATGAAAG AAGTGCTCGC TTACGTGGCG ATGATCGATG ACTATCGAGC GACTCATTTT GTCGACATGA AATACTCGAA AGGCGACGCC AAACCCACGC AAGAGCAGGG GGGTGCACAC CCGCTGCAAA CCAAAACGGT TCCGGGCGTA CCTGTGGCCG CGCCCGCGCC GATGGACACT GATGACGTGG ATGCACCGGC ACACGAAGTG CACGCGACTA CTTCCATCGA AGGAACCGTC GATGATGCGC ATGCACCGGC GCCTGAAGCG CCCGCGACTA CGTAG
|
Protein sequence | MVDPKNNVGI IVQDDGVGMD RRRLVGMLSF GFSDKEHKAG NVGRFGIGFK SGSMRLARDA LILTKRDGYA HVAFLSQTFL DDAELDDILI PMFSWRMERD ATTGGRVSYV ASEPANTKKW DEHMSVILRY SFVPSEPQLM RELDKIRGSH GTRIVLFNLR DPPELDFTSY KDDIRLVGAI PDDERAVRGP IFQQSREGQQ ASIDVQEDYS LRAYMEILYL KPRCEFTLRG RPVVPRDPIA HLAREYYVFP EYKPRGLDAG ITIHIGYAAD ETSKKCGFHI YNKNRLIRMH QRFGSQLQAN TMMKDMIGVI EADSLEPTHN KQAFKEADIT YQKFKRHLVQ CMQDYYFGIQ RYRLAGGGGR GAGTTSLKQT AKRRKRLHKS SSFDEDDEDG GSDGANKAPA PRGRPKGGRV ATPLTRIKSI HRSLMVHKNA YIFLRPVDPV YWEIPDYFEV IKNPMDLGTI KERIDAGYYD EKNVEAYAAD VRLVWSNAMT YNKDDTPVFK MARIMSREFE YQWQTRIEDE EFVVPALASH AAADARGPTR TETDTGDAFH SAPTSAPTAV SRRENLMREA QVRTAVDDQT VEPSPAIAVP PLLVNKVAAA ADAKKTVTEK EESNMERPDV KAEPNQMVRV PKALSTVIDE SFFKILEEKF VQLETELAEE RAANARYIDQ NIKRGINGSS TVSVEEMKSY AGELERLRVR VQTREFQLDS SRAKVRALQE ENVRLRDRLE LKKPKTTAAK PVSEVRETKF AVRNATASEL EKLYAYVEKH APNIDIRGRG WRVDVFERKG GRLAGTSYKE FLSPSGHKLR SMKEVLAYVA MIDDYRATHF VDMKYSKGDA KPTQEQGGAH PLQTKTVPGV PVAAPAPMDT DDVDAPAHEV HATTSIEGTV DDAHAPAPEA PATT
|
| |