Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31894 |
Symbol | CHR3523 |
ID | 5002085 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 724032 |
End bp | 728816 |
Gene Length | 4785 bp |
Protein Length | 1594 aa |
Translation table | |
GC content | 58% |
IMG OID | 640417506 |
Product | predicted protein |
Protein accession | XP_001418094 |
Protein GI | 145347265 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.25404 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGGCGG CGGCGAGTGA TGAGCGACCG TTTTGTGGAG TTTTAGGATG CCAGACGCAC GTGGGCGATC GTCGAGCGAC GTTGAGCGCG GTGTTGGAGG AGTTGCGGCC GGAGATGCGG CGGGAGCACG CGACGATTAA GGCGGCGACG CCGGGATATT TGTGCGCGGG CGATTTGCCA GAGGACGTCG TGGAGCGAAT TTTAGGATTT TTAGACCCGA AGTGCGCGAG CACGCTCGCG TGTACGTGCA GAGGATTTCG TTCGCGCGTG GAAGAGACGT CACCGGGATT GGCGTTGACT TTACATCCGC ACCAACGCGC GGCGCTCGGA TGGATGCGAC GGCGCGAACG CGCTCGTCTA CCGCCGATTC TCGATCCACT TTGGAAGCCA ATCGATTGCG AGGACGGTCG TGTGCTGTGG CTTCACACTT TAAAGGGGGA GTTAAGCGAT CACCATCCCG ACGTATACGA AGACTCGCAA GGAGGAATGC TGTGCGACGA GCCCGGATTG GGGAAGACGG TGACTGCGCT CGCGCTCGTG CTGGCGCGAC GAGGTTGGCG ACCTTCACCA CCGATCGGTT ACACGGCGCG TAAGATGGAA TCATGTTGGT ACTACGAAGA CACAAACTCC GGTGGTGCGG GCGGATACGG TTCACCTGCG GTGATGGCTG CCGTACCGAC ACTCGACGCC GAAACGCCCG CAATTACTCC GATGAAGACA TCTGCCGCGT TCGCGGGAAG TTCTTCGAGA GGTAACTCTT CCGGCCTGAG GCGAAGCAAA CGCTCAGCCG GTAACACGCC AGTGGGGTAT TTTGCTTCGG TGTGCAAGAA TGGGTTGGAT GGCACGATGA GTAGGAGCGC AAAGCGTTCG AAATTAGAAG AGAAACAAGC GTACGAGGCG TTAGCCGCTG CGACGACGGT GGAAGTTCCA GATTGGCAGA CACGTTTGGA GTGCTCGGCA CACAGCGCCC CCATTGGTTT CATCGCGCGC ACGGGGTTGA AGGAAGATTC AGCGCGTTTG GCGCGAAACA CGTCTTACTT TGAAAGCATT CTTCGAAACT GCGTCGTGTA CGTTGGACGC GCCGCGACGT GCGACGTCGT CGATTACATT TTAGAAAACA CCGAGGAAGT GTCGCACGCG ATGCCGCTTT CGATGCCACT CTTTCGCAGA AATTCGGACC AGCCTCCGAT TTTTCGCGCA CTCGGGTTGG AACCGTACAC TGGAGTCGGC AAGTTGGAAA ACGCGATTCC GTACGCGCCG CCCGGGTCGG CGACGGAGGA CCTCATCTTA GACAAGCAAG CACTTCACGA CGCCATACGC GTCGTCGATA ACAGAGGGAG TAGCAGCGAG CCGATGCGTG TCTGGTTATC GAGCGCGACG TTAATCATCA TTCCGAGTGT CCTCATCGAG CATTGGTTGC AGCAAATATC GTTTTGCACT TGGAACATCA CCGGTGAGGG TGTCCCGCGC GTCGCGGTGC TGGACAAACC ACCGAAAGTG GACGGCTTCA GCTCAGTTTA CAAGACCGAA GGTAGCTTGG AACCCGTGCA CTTGAATGAT CTGGCGCATG TCGATGCGAA AGATTTAGCA AACGACTTTG ACATTGTCAT CATGCCCATC AATCGACTTT CCACAGAGTT TTCAAAAGTT GATACGCCGA TTTTACGCAT TCTTTGGCAG CGAGTGATTT TGGATGAAGG CCATCAGCTC GGGGCGTCTT TAGCTGTCAC GGCCAAGCTT TCTGTGGCGT GTGCACTGAA GGCACATGCA CGTTGGCTCA TGACAGGAAC ACCGACGCCG ACGACGCTCA AAGGCGCGGG GACGGCACAT TTGCAACCGC TGCTCGGATT TCTACGTCAG CCTCCTTACG GGACGAGTGC GGGGCTTTGG ACTACTGCGG TGCAGCGTCC ACTTGAAGGT AGAGACCGTT TCGCTCAGGC GGATGCCGTT GTGCGTCTCG GTGACGTTCT GCGGCGATGC ATGGTGAGAA CGTGCAAGTC GCACATCGAG CTGCCGCCGC TCAAGCGCAC GACGTCAATG CTGCAATTCA GCGATGTACA CGCCGATTCA TACAATGATC TCGTCGCGTT CGTCAAACGT AATTTACTTT TGGCAGATTG GGGCGACCCC AACCACACAG AGTCTTTGCT GAATCCCAAA AACGTTCGAG AAGCGGCAGC GACGGTGACC AACCTCCGCG AAGCCGCGTG TGTCGTCGGC CAGATGCCGA TCACGTGCTC TTCTGATGAA TTCGATGAGA CGATTCTCGC TTTGATGCAA CTTCTAAAAG AGCGAGGTTT TGGCGATGAA GAACGCAAGG CACGCGTGAG ACGCGTCTCG CCCATGTTGA TGCAGTGCAA AGGGACATGC GATTTGTGCT TGCACAAAGT CTTGATGCCG CTCGTCACTC CTTGCGCGCA CATCTTGTGC TGCGGTTGCG TCATGCGCGG TCCTCCGAAA GGCAAAACGT TGAATGGCAA GGACGTCGAA GATGAGAGAG TGAACGAACC AGAGCTCCCA CCGGGGGTGC ACCGTGCGCC GCGAGGATGT CCTGTGTGTG GATCGGCCTA CTTCATGGAG GAGGACCATG ACAACAACTT GAATCCAGTG CAAGAAGTTC CGAAGGATTT GATCGAGCTT CAGCCAAAGC TCGAACAACA CTCGTGGAAA GTCGACGAGA GTCGGGGCGG TGACGGTGTG TACGCGCAGG GTGAAAGTTC TAAAGTGGAT CATTTGCTCG CCGAACTGCG AAAGATAGGC GCCGCCGTGA GCGCGGAAAT CGTCAAGCAA CAAGACGATC GTGACGCGTT CTTCGCTCGT GAAGCGAACG GGATGTCGGA AAACACGCCG GGTTGGAGTG GTCGATTAAA CGCAAACGCA CAGGCGCGCG AAGGAACGAC GCAACAAGGT AGACGCCGCA AACGTATGCT GGACGCAGCG ACGCCGCTCA GGCTGCGAGA CTTCACCCCA ACCGCGCCGC CACCGAAAAA GTGCATCATC TATAGCGATT TTCGTCCACA TTTGGACACG ATTGATCTAG CACTTTATGG TGCGCGCGTT CCTCACGAAA GCATAACGAG AATCGGGCAA AGTCGATACG ATCGCGAGCA AGCGCTCAAG AATTTCAAAA ACGATCCCGA CTGCGCCGTT TTGCTTCTAA ACCGCGCCGC CGCTGAAGGC TTGGATCTAT CGTTTGTTTC GTACGTATTC CTCATGGAGC CGCTTTCGAA TATGTCGCTC GAACAGCAGG TCATTTCGCG AGCGCACCGC ATGGGCCAAA AAGACACCGT CCGCGTGAAA GTTTTCGCCA TGGCGAACAC CGCGGAGGAA ATCATGTTGG ACGTACAATC GGAGCTCGCG CGCAATGGCA CAAAGATGTC TCTAGATTCG CTCGACGCGA CGACGAGACA CGACGTCACA GAAGATTCTG AAGCTGACGT GGCGCCAACA GTGGTGGCAG AATCGCTTTC GCGTCGACGC ATTCTCGAAC GTCTCGAGCT CGTCCCGTCG AAGAGCTCGG ACGAAGCTGC GGCGAGAAAA GCCAAGGCGG TGACTGATCA CTTCGAGCGC ATTCGAAACG GCGGCGACGG GCGCGTGCTT GGAAATGGAA AGAAGAAAGG CGAAGAAGAA GAGGCGCGCT GGATGCGCGA GGCACTTGAG GCGAGCGCGC GAGAATTGGA AGCGCGCGCT AGTGACGGGG GCGTTCTCGG CATCGGGAGT CCCCAATCGG GAGTCCCCAA TCGGGAAGAA ATCGCTGAAG ACGATGAATC CGAGGCGTGG ACCATGCGCG TTCGCGACCC TAACGATGGA GGCGCTGTGA AAGACTTTAC ATTGCCGAAT GCGGGCAAGA CGACGGTGAG CGCGATGCGC GCGCGCATTT CTTCAAGGAT TGGCGCGAGT ATTGATGCCT TCGTCGTTCG GTGCGGTTTC CCGCCGCGCG CGCTCACCGA CCGTGATCTC TGCACGCCAA TCGCGCTGCT TGGGATTCAA GCGAATGATT TGATCAATCT TCAGGCGGAG AACGCTCCGG ATATGCATCA AGCCGACGCG ACGATCGTGA CGACCAGGGC GAAACCAAAG TTGACGAAAG CAGCTCGCGC GAAGGCGTTG GATACGTCTG TTTTGGATGC CAAGGTGCAG AGACGGCTCA AAACCGTACT CAAGCAATCC GAGACGAGTA CGGAGATGAT GTTACAAGAT CCAGACGAGG ATCCTCGAGG CGCTGGCGCC GGTGGTGGCG CCGCCGGCGC GATGTCCATC GATCTCCTTC GCGCCGCCGA GGGCGGTAAG CGCGCCGTGA ACGACGCTTC GATTCGTTCG CTTCAGACAG CTTTCCAAGC CATGGTGGAG GAACGCGCTA AGGAGTCCGA AGGTAACATG AAATGTGGTG CCGCGCGTTC CGGTGCCGTC GAATACTCGA CGCTCGCCGA CGGTCGATTG GTGGTAAAAT ACGAAGCCGT CGACATCTCC ACCGGACGTC GCCAGAAGAA AACTGACTGC GTTCAGGACA TCCCGGCCGC CATGCTCCCC ATCGTTCTCG CCCTCGTCGC CGCCGACGCC GACACCAACG CCGCCGCGCG CGCCAATCTC TCTCCTCCCT CCATGGCCGT CGCCTCCCCT CGCGTGTTTT GGGCCGTCGT CCGCCACGGC GGCGTCGGAC CGCATCTCTC GTTTCCGGAC GCTTTGAAGA AAATCGCGCC GAAGCTCAAA TTCGATTGGC ACGCCATCGA CGCCCGCGAG CGTCGCGCCA ATCCACGATA CGCCGATTAC GACACGTCTC ATTAG
|
Protein sequence | MEAAASDERP FCGVLGCQTH VGDRRATLSA VLEELRPEMR REHATIKAAT PGYLCAGDLP EDVVERILGF LDPKCASTLA CTCRGFRSRV EETSPGLALT LHPHQRAALG WMRRRERARL PPILDPLWKP IDCEDGRVLW LHTLKGELSD HHPDVYEDSQ GGMLCDEPGL GKTVTALALV LARRGWRPSP PIGYTARKME SCWYYEDTNS GGAGGYGSPA VMAAVPTLDA ETPAITPMKT SAAFAGSSSR GNSSGLRRSK RSAGNTPVGY FASVCKNGLD GTMSRSAKRS KLEEKQAYEA LAAATTVEVP DWQTRLECSA HSAPIGFIAR TGLKEDSARL ARNTSYFESI LRNCVVYVGR AATCDVVDYI LENTEEVSHA MPLSMPLFRR NSDQPPIFRA LGLEPYTGVG KLENAIPYAP PGSATEDLIL DKQALHDAIR VVDNRGSSSE PMRVWLSSAT LIIIPSVLIE HWLQQISFCT WNITGEGVPR VAVLDKPPKV DGFSSVYKTE GSLEPVHLND LAHVDAKDLA NDFDIVIMPI NRLSTEFSKV DTPILRILWQ RVILDEGHQL GASLAVTAKL SVACALKAHA RWLMTGTPTP TTLKGAGTAH LQPLLGFLRQ PPYGTSAGLW TTAVQRPLEG RDRFAQADAV VRLGDVLRRC MVRTCKSHIE LPPLKRTTSM LQFSDVHADS YNDLVAFVKR NLLLADWGDP NHTESLLNPK NVREAAATVT NLREAACVVG QMPITCSSDE FDETILALMQ LLKERGFGDE ERKARVRRVS PMLMQCKGTC DLCLHKVLMP LVTPCAHILC CGCVMRGPPK GKTLNGKDVE DERVNEPELP PGVHRAPRGC PVCGSAYFME EDHDNNLNPV QEVPKDLIEL QPKLEQHSWK VDESRGGDGV YAQGESSKVD HLLAELRKIG AAVSAEIVKQ QDDRDAFFAR EANGMSENTP GWSGRLNANA QAREGTTQQG RRRKRMLDAA TPLRLRDFTP TAPPPKKCII YSDFRPHLDT IDLALYGARV PHESITRIGQ SRYDREQALK NFKNDPDCAV LLLNRAAAEG LDLSFVSYVF LMEPLSNMSL EQQVISRAHR MGQKDTVRVK VFAMANTAEE IMLDVQSELA RNGTKMSLDS LDATTRHDVT EDSEADVAPT VVAESLSRRR ILERLELVPS KSSDEAAARK AKAVTDHFER IRNGGDGRVL GNGKKKGEEE EARWMREALE ASARELEARA SDGGVLGIGS PQSGVPNREE IAEDDESEAW TMRVRDPNDG GAVKDFTLPN AGKTTVSAMR ARISSRIGAS IDAFVVRCGF PPRALTDRDL CTPIALLGIQ ANDLINLQAE NAPDMHQADA TIVTTRAKPK LTKAARAKAL DTSVLDAKVQ RRLKTVLKQS ETSTEMMLQD PDEDPRGAGA GGGAAGAMSI DLLRAAEGGK RAVNDASIRS LQTAFQAMVE ERAKESEGNM KCGAARSGAV EYSTLADGRL VVKYEAVDIS TGRRQKKTDC VQDIPAAMLP IVLALVAADA DTNAAARANL SPPSMAVASP RVFWAVVRHG GVGPHLSFPD ALKKIAPKLK FDWHAIDARE RRANPRYADY DTSH
|
| |