Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_42332 |
Symbol | |
ID | 5003299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | + |
Start bp | 336743 |
End bp | 342046 |
Gene Length | 5304 bp |
Protein Length | 1767 aa |
Translation table | |
GC content | 50% |
IMG OID | 640418720 |
Product | predicted protein |
Protein accession | XP_001419134 |
Protein GI | 145349424 |
COG category | [R] General function prediction only |
COG ID | [COG1204] Superfamily II helicase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGGTG GGCGCAAAGC GTTGCCGCCA GGGACCACGC GCATCGTTCA TCCAGAAGGG TACGAAGAGA TCTCTGTACC GGCGCGAGAG CCAGATCCCG TGGCCGCTGG TGAGCGTTCT GTAGCGATTG AAGAGCTCGA TGAGTGGGCG CAGCCCGCGT TTCAAGGCAT CAGAATGCTC AATAGGATCC AAAGCAAGAT TTTCCCGCAG GCGTACCACA CCAACGAAAA CTTACTCGTG TGCGCGCCGA CGGGAGCCGG GAAGACGAAC ATCGCCATGC TAACCGTCTT ACACGAAATC GGTCTTCACA TAGATGAGAA CGGCGACTAC CTTCCAGAGG ATTTCAAGAT TGTCTACGTG GCACCCATGA AAGCGCTCGC GGCTGAAGTT ACGGATGCAT TTAGCAGAAG ACTTGCACCT CTTGACATAG TCGTTGCCGA GCTCACGGGC GATACGCAAA TGAGTAAACG CGAGCTCGAG GAGACACAAA TGATCGTGAC GACACCCGAG AAATGGGATG TCATCACACG AAAAGGCGGA GAAGTTTCTG TGGCGAGTAC TTTACGTTTA CTCATCATTG ATGAGGTGCA CTTGTTGAAT GACGAGCGGG GGCCGGTGAT TGAAACGCTC GTCGCCCGTA CTCTTCGTCA AGTCGAACAA ACTCAGAGTA TGATACGGAT TGTCGGTCTT TCTGCGACGC TGCCGAATCC CGTAGATGTG GCTCGATTCT TAGGCGTCAA CAATGACGCC GGTTTGTTCG TTTTTGATCA GTCGTACCGG CCGATCCCAT TGACGCAAAA GTTTATCGGG GTCACAGAGA AAAACTCGAT GAAACGGCAA ACACTCATGG CGCAGATTGC GTACAACAAA GCGTGTGAAG CGTTGAGAAA TGGCAAACAA GCCATGGTCT TTGTACATAG CCGGAAGGAT ACAGTCAAGA CGGCGAAGCA ACTCGCTGAG TTTGCCGCTG CGCAGGATGG TATGGAACTC TTCTCCAACA ACCAACACGA GCGCAAGGCT GAGTTCGCGC AACAAGTGTC ACGAAGTCGG AATAACGAGT TGAAAGATCT GTTTCTCAAA GGACTAGGCT GTCACAACGC AGGTATGCTT AGAGCGGACC GTTCATTGAC GGAAAAACTT TTCGCCGCCG GACTCATTAA AGTTCTAGTG TGTACGGCAA CATTGGCGTG GGGTGTGAAC TTGCCCGCGC ACATCGTCGT CATCAAAGGC ACGCAGCTTT ACGACCCTCA ACGCGGTGGC TTCAGGAATC TCGGTGTTTT AGATGTGCAA CAAATTTTCG GTCGTGCAGG GCGACCAGGT TTTGACACTT CGGGCGAGGG TGTCATCGTC ACCGAGCACA AAAATTTGGC GCATTACGTG TCCATGCTCA CGCATTCGAC ACCGATTGAA TCACAATTTG TGAGCAACTT GGCTGATAAT TTGAACGCCG AAGTGACTCT CGGTACAGTT ACGAACGTGC GCGAAGGCGC GCAGTGGTTG GGATACTCTT ACCTACACAC GCGTATGGAG AAGAACCCAC TCGCCTATGG TCTGACGTGG GACGACATTC GTCTTGATCC AGGCTTACTA GACCACCGCA GAAAACTCAT CAAAGAAGCT GCAAGGGTTT TAGATCGTGC GAAAATGATT CGATTCGACG AGCGGAGTGG ACAGCTTTAT CAAACCGAAG CTGGCCGTAC GGCGAGTCAC TTTTACATAC GAGTCAATTC TATGGAAGTT TTCGATGGTT TGATGCACAG ACACATGACG CTGCCGGACA TCTTTCACAT GATTTCCCAT TCGAGCGAGT TCGAAAACAT AGTTCCCAGA GAAGATGAAA TTCCTGAGTT GGAGACATTG CGACGCAACC GCCGCGTGGT GCCGATTGAC ATCAAAGCCT CGCTCACAGA CAAAGTAGGC AAAGTCAATT TACTCTTGCA AGTCTACATC TCTCGTGCGA GCATGCAATC TTTTTCTCTT ATTGCCGATA GCATGTACAT TTCGCAGAAT GCGAGCAGAA TTTGTCGCGC ACTGTTTGAG TTATGCTTAC GACGTGGCTG GCCGTCGCTC GCGGAACAAT TGCTCACGGT GAGTAAATCG TGTGATTTGA GAATATGGCC GCATCAACAC GAGCTTCGAC AATTTGAAAA GTCGCTCAAG CCCGAAGTTT TGTTTAAGCT CGAAGAGAAG AAAGCCACTT TGGATCGTTT ATGGGATATG AGTGCGAGTG AGATTGGCAG TATGTTGCGA CTGAACACTC AAATTGGTGG ACAAGTCAAG TCGTGCATGC GCGCGATGCC GCATTTGAAC ATGACGGCTG TCGTGCAGCC CATCACGCGC TCCGTGTTAC GAGTCTCTGT GACACTTACT CCTGAATTCG AATGGAGAGA TGCAGTGCAT GGTGCTCTTC AACGATGGCT GATCTGGGTC GAAGATCCGG TAAACGAGCA CATATATCAC TCGGAAACTT TCAACTTGAG TAAAAAACAA AGCCGCGACG GGGCGCAATA CTTGGCATTC ACCATTCCCA TTTTTGAGCC AGTGCCGCCG CAATATTTCT TGCGTGCCAT GTCGGAAACC TGGCTCGGAT GTGAATCTTT CGTCGAACTA AACTTCCAGC ACCTCATCTT ACCTGAAGAA CATCCACCGC ACACAGAATT ACTCGATCTT GATCCTCTCC CGCGATCAGC TTTGAACAAT CCAGTGTATG AGTCGATGTA CGAGGGGAAG TTTACGCACT TCAATGCCAT TCAAACGCAA GCGTTTCACA CGCTCTATCA TACCGATACG AACGTTCTCC TTGGTGCGCC GACAGGCTCC GGTAAGACGA TTTCGGCCGA GCTATCGATG ATGAAAGTTT TCCGTGATTC TCCTGGTTCA AAAGTCGTCT ACATTGCACC GCTCAAGGCT TTGGTTCGAG AACGCATCAA GGACTGGAGA AAGAATTTGT GCCCGACGCT CGGCTTGCGC ATGGTCGAGC TCACAGGTGA TTACACGCCA GATCTTCGAG CGCTTCTTCA GGCCGATATC ATCGTTAGTA CTCCGGAAAA GTGGGATGGC ATCAGTCGTA ACTGGCAAAG TCGCGCGTAC GTCACCAAGG TTGCGCTCGT CGTTATTGAT GAGATTCACT TGTTGGCGAG CGACAGAGGT CCTATTTTGG AAGTCATCGT GTCTCGTATG CGATACATCT CTGCGCGAAC GGGTTCAAAT GTTCGTATCG TTGGTTTGTC AACAGCGTTG GCGAACGCGC GTGATCTCGG AGATTGGCTC GGCATTGACA AGGAAGGTTT GTTCAACTTT CGACCGTCGG TGCGCCCGGT ACCGCTCGAA TGTCATATTC AAGGCTTTCC GGGTAAATTT TACTGCCCGC GCATGATGAC GATGAATAAG CCAACGTACG CGGCTATTCG CACGCATAGT CCTGAAAAGC CGACGCTTGT TTTCGTGTCG AGTCGACGAC AAACGCGCCT GACAGCGTTG GATTTGATCG CGTACGCTGC CGCGGACGAG CGCCCGGATG GCTTCGTGCA CATGAGCGAC GACGAGTTAA CGATGCACTT GAGTAAAGTC AAAGATCCAG CGTTGAAGCA CACCTTACAA TTTGGCATTG GTCTACACCA CGCCGGTTTA ACGCCAGAAG ACCGAGAGTT ATGCGAAGAG CTTTTCGCAC AATGCAAGAT ACAAGTGTTG GTGACGACGT CAACGCTCGC GTGGGGCGTC AACTTGCCTG CGCACTTGGT CGTAATCAAG GGCACTGAAT ATTTTGACGG CAAAACGAAG CGTTACCAAG ACTTTCCCAT TACAGATGTT CTTCAAATGA TGGGTCGAGC GGGGCGACCG CAGTTCGATA AGTCTGGGTG CTGCGTCATT CTAGTACACG AGCCTAAAAA GACGTTTTAC AAGAAATTTC TCTACGAGCC TTTTCCAGTT GAATCGAGCT TGGCAGAAAA CTTATGCGAC CATTTCAACG CCGAAATCGT GAGTGGCACA ATAAAGACTA AGCAGGACGC AGTCGATTAC TTGACGTGGA CTTACTTTTT CCGTCGCTTG CTCAAAAATC CCACTTACTA CAACTTGGAT ACCATTCAAA CAGATAACTT GAATGAATAC CTCAGCGATT TGGTCGAGAA CGCTTTGGAG TTATTGGAGG ACGCTCGTTG CATTGCAATC GATGAAGAAG ACGATGGGTT GGAGCCTTTG ATGTTAGGAC GTGTCGCATC GTATTACTAC CTACAGTACC CGTCCGTCGC ACTTTTCGCG TCGAATATCA AGGCGAATAG TTCACTTGAA TCATTGCTGG AAACGTTGTG CGGTGTGGCA GAATACGATG AACTACCCGT TCGTCACAAC GAAGACAAGC TCAACACGGA GTTGGCAGAA GTTGTGGCTG ATGCCGGTGG TTTCCAAGTC GACATTCGTC TGGCTGAGGA CCCGCACGTG AAGACATCTT TGCTTTTCCA ATGCCACTTC TTGCGGCTAC CGCTACCTCT GAGTGATTAT TACACGGACA CAAAGAGCGT CCTCGATCAG GCGATTCGTA TTTTACAAGC GATGATTGAC GTCACTTCCG ATGCTGGTTG GCTACACACC GCACTGAGCA CGATGAATTT GATGCAGATG ATTATGCAGG GTAGAATGAT TACCGATTCT TCGTTGTTGA CGCTTCCGCA CATCGAACGC AGACACTTGA GAAACCTTGA GAAACACGGC TTGTCTATTT TACCTCAGTT GATGGATCTA TGCTCTTCCA ACAAGCAACA GGCGCGTCGA GTTCTCTCAG AATGTGGTAT CAACGGTCGT AAAATTGATC AAGTCGTCGA TTTATGTCTG AGACTTCCGG TGATTGACGC CAAGGCGACG ACGGAGACGA CCAAAGGCAT CAATGGTGAG AAGACGGTAC ACGTAAAGCT TCGACGTATC GGCAAGAAAT GCGGTTCGAA GGCACCGACG TCGTATACAC CGCGCTTTCC GAAAATAAAA GAAGAAGGTT GGTGGATAGT CGTTGGAGAC ACCGCGAATG ATGAACTGTT GGCGCTGAGA CGAATATCGT TTGGTGATGC TGCTAACGTC AAGCTGAAAT GTCCATCCGG CTCGTCTTCA CGCGCACGCC CCGATTTGGT GGTGTTTTTG ATGTCTGACT CATACATCGG ACTTGATCAA GAAGTCAAGA TTGACTCAAA CACGATGGTT GATGAAGATT CGAGCGATGA ATTTGCTGAA GACGACGATA CATTTTGGGC TTTGCCCCCT GACTCGACTG AGCCTTTCTG GCTAGGTGAA GGTGAAAACA CGCTACTAAC GTGA
|
Protein sequence | MDGGRKALPP GTTRIVHPEG YEEISVPARE PDPVAAGERS VAIEELDEWA QPAFQGIRML NRIQSKIFPQ AYHTNENLLV CAPTGAGKTN IAMLTVLHEI GLHIDENGDY LPEDFKIVYV APMKALAAEV TDAFSRRLAP LDIVVAELTG DTQMSKRELE ETQMIVTTPE KWDVITRKGG EVSVASTLRL LIIDEVHLLN DERGPVIETL VARTLRQVEQ TQSMIRIVGL SATLPNPVDV ARFLGVNNDA GLFVFDQSYR PIPLTQKFIG VTEKNSMKRQ TLMAQIAYNK ACEALRNGKQ AMVFVHSRKD TVKTAKQLAE FAAAQDGMEL FSNNQHERKA EFAQQVSRSR NNELKDLFLK GLGCHNAGML RADRSLTEKL FAAGLIKVLV CTATLAWGVN LPAHIVVIKG TQLYDPQRGG FRNLGVLDVQ QIFGRAGRPG FDTSGEGVIV TEHKNLAHYV SMLTHSTPIE SQFVSNLADN LNAEVTLGTV TNVREGAQWL GYSYLHTRME KNPLAYGLTW DDIRLDPGLL DHRRKLIKEA ARVLDRAKMI RFDERSGQLY QTEAGRTASH FYIRVNSMEV FDGLMHRHMT LPDIFHMISH SSEFENIVPR EDEIPELETL RRNRRVVPID IKASLTDKVG KVNLLLQVYI SRASMQSFSL IADSMYISQN ASRICRALFE LCLRRGWPSL AEQLLTVSKS CDLRIWPHQH ELRQFEKSLK PEVLFKLEEK KATLDRLWDM SASEIGSMLR LNTQIGGQVK SCMRAMPHLN MTAVVQPITR SVLRVSVTLT PEFEWRDAVH GALQRWLIWV EDPVNEHIYH SETFNLSKKQ SRDGAQYLAF TIPIFEPVPP QYFLRAMSET WLGCESFVEL NFQHLILPEE HPPHTELLDL DPLPRSALNN PVYESMYEGK FTHFNAIQTQ AFHTLYHTDT NVLLGAPTGS GKTISAELSM MKVFRDSPGS KVVYIAPLKA LVRERIKDWR KNLCPTLGLR MVELTGDYTP DLRALLQADI IVSTPEKWDG ISRNWQSRAY VTKVALVVID EIHLLASDRG PILEVIVSRM RYISARTGSN VRIVGLSTAL ANARDLGDWL GIDKEGLFNF RPSVRPVPLE CHIQGFPGKF YCPRMMTMNK PTYAAIRTHS PEKPTLVFVS SRRQTRLTAL DLIAYAAADE RPDGFVHMSD DELTMHLSKV KDPALKHTLQ FGIGLHHAGL TPEDRELCEE LFAQCKIQVL VTTSTLAWGV NLPAHLVVIK GTEYFDGKTK RYQDFPITDV LQMMGRAGRP QFDKSGCCVI LVHEPKKTFY KKFLYEPFPV ESSLAENLCD HFNAEIVSGT IKTKQDAVDY LTWTYFFRRL LKNPTYYNLD TIQTDNLNEY LSDLVENALE LLEDARCIAI DEEDDGLEPL MLGRVASYYY LQYPSVALFA SNIKANSSLE SLLETLCGVA EYDELPVRHN EDKLNTELAE VVADAGGFQV DIRLAEDPHV KTSLLFQCHF LRLPLPLSDY YTDTKSVLDQ AIRILQAMID VTSDAGWLHT ALSTMNLMQM IMQGRMITDS SLLTLPHIER RHLRNLEKHG LSILPQLMDL CSSNKQQARR VLSECGINGR KIDQVVDLCL RLPVIDAKAT TETTKGINGE KTVHVKLRRI GKKCGSKAPT SYTPRFPKIK EEGWWIVVGD TANDELLALR RISFGDAANV KLKCPSGSSS RARPDLVVFL MSDSYIGLDQ EVKIDSNTMV DEDSSDEFAE DDDTFWALPP DSTEPFWLGE GENTLLT
|
| |