Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_17878 |
Symbol | |
ID | 5004978 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009367 |
Strand | - |
Start bp | 346245 |
End bp | 351087 |
Gene Length | 4843 bp |
Protein Length | 1544 aa |
Translation table | |
GC content | 58% |
IMG OID | 640420399 |
Product | predicted protein |
Protein accession | XP_001421122 |
Protein GI | 145353655 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0363694 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGCCC CGTCGCGCGC GCCGTGCGCG TCCGCGCGCC CGGAGGCCGG CGTCGTCGAC GCCCTCGATG ATCTCGCCGC CGTCGACGCC CGCGCGTGCG GCATCGGCAA GCTCACGATC GCGCACGACG ACGACGACGA CGACGCGCGA GGTCTCCGCC ATCGCGACCG CGGCGCGTTC GCCCTCGATT TGCGCGCGTG GTCGTTCACC GCGCGCACGC AGCCGGTCCG CGCGCTCGGG GCGTGCGCGC GAGACGCGTC CGTCGACATG CGCGTGCCGT TCAGGGACGA ATACGAGCGA CACAGGGCGT CGCGACCGGA CGAACGCGCG CCGCGAGGGG CGAGTTGGCG CATCCATCGA GTGCGAGGAT GGCGCTTCGA CCTGCGCGCG CTGTACGAGG AGGTGAAAAG ACGCGGTGGC GCGGACGCGA TCGATGGAAA TCAAGGCTGG CGCGAGGTCG CGGAGGCGAT CGGGGCGCAC GGACGCGGAT GCGCGGCGGG ACACGCGGCG AGAGCGCTGC ACGGGGCGTG GCTGGAGGCG TTCGCGCGCG ACGCCAAGCG AAGAGAAGAC GCGGTGAAGA GCGCGCAGTT CGACGCGGTG AAGGGAGAGA TCGAACAAGA TGACGCGCTG GCGATCGATG CGCTGTGTGG GCTCGAATTT GGGGCGTCGG GCGCGCCCGA ACGGCGAGTC AAGGTGGAAA ATGTGCGTAA TTCGTGCCAT ATCTTCTTCG CGCGGGACCG GGATGACGGC GAATCGCACG CGTTCGCGGC GCGTGCGACG CGCGCGCGCG ATGATGGGCG TCGCGATGAC GTGCGACTGA CTTTTACGGC GCGTGTTTTC GCGACGCAGG GCCTCGATCT CGTCGCCTCG CAGCTCGCGG ACGCGCCCGA GGGCGACGCC GACTGCGACG TGTGCGGCGC GAGCGGGAAC GAAGACGCGA TGATCCTGTG CGATGGGTGC GATCGAGGGT GCGTATACGA CAATGCCACG CGAGAACGCG CGCGTGAAAA TTATCTCGAT CGTTCGGACG CGCGTTCGGG GAGAGAGAGC GCTCGCGCGG CGCGCGAGGG CGCGGCGAAC GCGCGCGTGG GCGACGGTTA GACGTGGGGT TTAGAGGCGC GCGGGGGAGA AGGAGCCGCG AAGGAGCGAC GAGACTGACG AATGATGCGC GCGTAGGTCA CATATGTACT GCTTAACGCC GAAGATGACT GAAGTGCCGA GTGGAGAGTG GTTTTGTGGA CGATGCGAAG AGATTGACGC GGAGGTGGAG CGATTGAGCG CGGATGAAGG GACGCAGTTT ACGCTCGGCG ATTTCAACGA GGCGTGCATA GAATTTGATA CGGCGTTCTT TGGCGAGGAC GCCAAGCGAA CTGGGATCGA TATGCAGGTG ATTGAGGAGT GTTTCTGGCG AATGGTTGAA GATGCGAGCT CGGTGGATGA TGTTTGCGAG GTCAAATGCG GCACGGCGAT TGACACGACA AAGTACGGAA GCGGGTTCCC GAGACACGGC GAAGCGCTTC AGGTAAAGAT TGACGGCGTG AGCCCGGAAT CTATCAAGCG CTGGTCAGAG TCGAAATGGA ATCTCAACAA CGTCGCGCGC GCGAGTGGGG AAAAGTCGTC GCTACTGGGC GCGTTGAAGG ATGACGTGGC GGGCGTGACG ACGCCATTCT TAGAAGTCGG AAGCACGTTT AGTTCCACGA CTTGGCGTCG AGAAGAGCAT AACATGTACA GTATCACTTA CAATCACTGG GGTGCCGCGA AACTGTGGTA CTGCGTGCCG GCGAGTGCAG CGGACAAGTT GGAAGAATGC TTCCAGAAGG TGATGCCCGA CGTCTACGAG GCGCACGTCA ACGATTTGGG AAGCGTCTTT ACGATGCTCT CGCCGAGCTT CTTGATGAGC GCGGGAGTGC CCGTGCACAC GTTGGAGCAA TTCCCAGGAG AATACGTGGT GACGTATCCC GGGGCGTACT ACGCGTCGTT CAACTGTGGA TTGAACTGCA CGGAGAGCGT CAACTTTGTG CCCGCTGACT GGCTCCCAGA GGGCTCGGCG AGCGTGGAGA GAAACCGATC GTACGCCAAG CGTTCATTGT TTAGTCACGA CGAGTTGGTT TGCAGAGTAG CAAACAATCC TAGTTCGAGC ATAGCGCCGC ACTTGTGGCC CGAGATTGCG CGCTTGTACG CTGAAGAGGC AAACGGTCGC GCCGAGCTTT TCGCGTCTGG CGTGACGCGC AGCGCGCAAA TGACGTCAGC AGACGACGAC GACGACGACG ATGGCTGCGA AAAGCCTCGT AAAGTACGAT CTAGGTTTGA CGATGCATCG AACTCTGGAT CCGATGAGTG CGTTGTTTGT CGACACATTT TGTACTCGTC TGGCGTCGGG TGCTCGTGCG ACGAGACGCG CAAGGCGTGC TTGCGACACG TCAACGACTT GTGCAAATGT GCGATGTCGA AGAAGACGAT GTTTTACAGA GAAACAGTGG CTGATTTAGA AAGTCTGGTG AAGAAGACGG AGAAGGCGCT TTCTCAGAAG GAACTAGCCA GCTTAAAATC CAAGCACAGC GACTTGGACT CAGTCACAGT GAACAAGAAT CTCGTCAAGA AGGCTCAGGC TTGGGTGAAA CGAGTGGGCG AAGAGCTCGT CAAACCACCT TTGCCGCCGC TGGACAAGAT GAGAAATTTA CTCGCTGCGG GTGAAGAGTT TATTTGGGGT GGTGCTGATA TGAAGGCGGC TCGAGAAGCG TACACTCGCG TGACGAACGC CGTCGCGTGG CAGACGAGTC TTGTGGCGCT GAAGCAAAGG CTTGGATCCA GTGCTGGTGC CGAGGCCGCG CACGATGACG CTGGTGAAGC GCGATTGAGA TTGAACAGAC TAAAAGAGCT CCTTGACAAC CCACCGGTAC CGATGCCTAA GGCTGATACT CAACCTTTCC GCGATTTGCT CGCGGCGGGC TTGAAATTGG AGGAACGCAT CAAGGCGGCT CTCGCCGAGG TACCGAATCC ATCTCCTCGC GCTTGTACGA CGCTTCAAAC TGAAGCGAAT AAGTTTGGTG TGGAAGTCCC TTCATATAAA AAGCTGAAGG ACGTCATCGT TAGAGCAGGC GCGTGGTCCA CGAAGGTGCG CGGCGCGCTT CCAGGACGAC GACAACTCCC GCCGCGCGAG GAACTAGCGA ATGCTCGAGA GATCGAGGCG TTATACGAAG AGGCGCATGG ATTACCCGTT CAGCAGAGCG AACTCTTGAC TTTGCGCAAA TCTTTAGAAG AGTTGAATTT TTGGCGCGCA AAGTCTGAGT CGCTGTTCGT CGCCAAGGTT GACGTGGAAG AAGCCGAGGC GCTTCTCAAG GAGGGCATGG CGTTGTCAAC AAAACTCGAC GAGGTCGACA GACTAGCCGA TCAAATCAAA GCTGTCAAAG TGTGGGCCGA TCATGCTCGA GCGAGTGATT ATCCGGGAGC AAGAGTGACG GATTTGCACA TGCTTCTCGT CGAGGGTGAA AAGTTTAGCG TGCGAGTGGA CGAAGTCGAG TGGTTGCGAA ACCGTATCGT CGTCCGGGAG CTGGCAGAGA AATTGAAGGA CATGGTTTCA TCGAAGAAAT ATCCGCTCGC CGAAGTCGAA GCGGCCGTGC GGGCGGGTAA CGAGTTCCTC GACTCCGAAG ATAAAGAGGT TGCCCCAGAT GAAGAAGCGC TGCTGGCGCA GTGCGAGAGC CACATCAACG CGGCGAAGAA GTGGAACGAG CGCGCGGCGG TGATGCTCAA ATCTCTCGAT AGTAAGGACC GACCGTCACT CGAAGACGCC GCGTCGCTCA TCCGCGAAGG AAGCTCGATT CCAATCTTTC TGAATGGATT CGACGTGCTT TCAGAAGCGG TCAATGTAGC CAAGTCTTGG TTGGATCGTG CACAACCTTG TTTGAAGGGA AAGCAGCTCA CTCGTCGCGG TGTGTCAAAT CCGATTCCGC CGCTTTCCGA GGCGCAAGAG CTCATGAAAG AGTCTTCGAA CTTGAAATTG TTTGTCAAGG AAGTTGAAGC TTTGGAAGAA CGCGTCGAAG CCGCAGAAGA GTGGGACGTG GACGCGAAGG ACGCCATCGA GCGCTGGCGA GAAGACGGCG CCGAGGTGAC GTTGACCGAG CTTGAACTTT CGCACGAAGA CTTTGGATTA GAATTACCAG CCATGGAGAC AGTTCGCATT CGTTTAAAGT CGCTCAAGTG GGAAGAACGC GTCGCTAAGA TCATTGCACC CAAGGCGAAG CTCGTCGAGG ACACCGTTTT GGACGAGCTC AGGGAAGAAA TCGACGTGCT TCAAGACTTG AAAGAGGATC TTGTCGCTGA AATCATGAAA CGGTACACGA TTGTGGACGA ATGGCGCAAG AAAGCGGATC GTTTGCTCGA TCCTCCTCTC CTTGAAGACG GTCGATTGGC TCCGAGTGCA TCACCGGAGG AAATAGACGC CCTCATCGCC GAAGGCAAGG CACTTCCCGC TGACGTCTCG AAGGTAGAAG ACCTCGAGGC GTCGTTGGCG GATCACGCAC AGTGGGTCGA CACCGTTCGT AAGTGCTTGA ACAGTGTCGC TGAAGGTCGA TCTCGCCCTT CGATTGACGA ATTATACGAT CTTCTAGCGG AGGTGGAAGA TTTGACCTTC AAGTGCTCAG AGAGACAAGC GCTCACAAAT GCGTGCAACG CCGCCACGGC GTGGACGGAG AGGCTCAACG CGTTGCTCTG GGCAAACGAT CAGGGTGAAG TCAAACAAGA AAAAACGCTT ATCGAAATGT TGAGCAATGT TCTTGATTCC ATCAAAGCTG GTGTGGAAGA CATCACCGGG ACTGGCGAAC CGCCGGAGAC GGAGGAGGGT TAG
|
Protein sequence | MRAPSRAPCA SARPEAGVVD ALDDLAAVDA RACGIGKLTI AHDDDDDDAR GLRHRDRGAF ALDLRAWSFT ARTQPVRALG ACARDASVDM RVPFRDEYER HRASRPDERA PRGASWRIHR VRGWRFDLRA LYEEVKRRGG ADAIDGNQGW REVAEAIGAH GRGCAAGHAA RALHGAWLEA FARDAKRRED AVKSAQFDAV KGEIEQDDAL AIDALCGLEF GASGAPERRV KVENVRNSCH IFFARDRDDG ESHAFAARAT RARDDGRRDD VRLTFTARVF ATQGLDLVAS QLADAPEGDA DCDVCGASGN EDAMILCDGC DRGSHMYCLT PKMTEVPSGE WFCGRCEEID AEVERLSADE GTQFTLGDFN EACIEFDTAF FGEDAKRTGI DMQVIEECFW RMVEDASSVD DVCEVKCGTA IDTTKYGSGF PRHGEALQVK IDGVSPESIK RWSESKWNLN NVARASGEKS SLLGALKDDV AGVTTPFLEV GSTFSSTTWR REEHNMYSIT YNHWGAAKLW YCVPASAADK LEECFQKVMP DVYEAHVNDL GSVFTMLSPS FLMSAGVPVH TLEQFPGEYV VTYPGAYYAS FNCGLNCTES VNFVPADWLP EGSASVERNR SYAKRSLFSH DELVCRVANN PSSSIAPHLW PEIARLYAEE ANGRAELFAS GVTRSAQMTS ADDDDDDDGC EKPRKVRSRF DDASNSGSDE CVVCRHILYS SGVGCSCDET RKACLRHVND LCKCAMSKKT MFYRETVADL ESLVKKTEKA LSQKELASLK SKHSDLDSVT VNKNLVKKAQ AWVKRVGEEL VKPPLPPLDK MRNLLAAGEE FIWGGADMKA AREAYTRVTN AVAWQTSLVA LKQRLGSSAG AEAAHDDAGE ARLRLNRLKE LLDNPPVPMP KADTQPFRDL LAAGLKLEER IKAALAEVPN PSPRACTTLQ TEANKFGVEV PSYKKLKDVI VRAGAWSTKV RGALPGRRQL PPREELANAR EIEALYEEAH GLPVQQSELL TLRKSLEELN FWRAKSESLF VAKVDVEEAE ALLKEGMALS TKLDEVDRLA DQIKAVKVWA DHARASDYPG ARVTDLHMLL VEGEKFSVRV DEVEWLRNRI VVRELAEKLK DMVSSKKYPL AEVEAAVRAG NEFLDSEDKE VAPDEEALLA QCESHINAAK KWNERAAVML KSLDSKDRPS LEDAASLIRE GSSIPIFLNG FDVLSEAVNV AKSWLDRAQP CLKGKQLTRR GVSNPIPPLS EAQELMKESS NLKLFVKEVE ALEERVEAAE EWDVDAKDAI ERWREDGAEV TLTELELSHE DFGLELPAME TVRIRLKSLK WEERVAKIIA PKAKLVEDTV LDELREEIDV LQDLKEDLVA EIMKRYTIVD EWRKKADRLL DPPLLEDGRL APSASPEEID ALIAEGKALP ADVSKVEDLE ASLADHAQWV DTVRKCLNSV AEGRSRPSID ELYDLLAEVE DLTFKCSERQ ALTNACNAAT AWTERLNALL WANDQGEVKQ EKTLIEMLSN VLDSIKAGVE DITGTGEPPE TEEG
|
| |