Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_39745 |
Symbol | |
ID | 4999994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | + |
Start bp | 565157 |
End bp | 568345 |
Gene Length | 3189 bp |
Protein Length | 1062 aa |
Translation table | |
GC content | 58% |
IMG OID | 640415415 |
Product | predicted protein |
Protein accession | XP_001415537 |
Protein GI | 145340863 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1523] Type II secretory pathway, pullulanase PulA and related glycosidases |
TIGRFAM ID | [TIGR02103] alpha-1,6-glucosidases, pullulanase-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGTCGA ACGTCGCGCT CCGAGCGTCC ACGACGTCCG CGCACCGATG GTCTCGCTTT TCGTCGTCGC CGTCGTCCAC GAGACGTCGC CAACATCAGC GCCAACATCA CCATCTTGAT GTCCAACACC GTCCTTCTCG TCCCGTTCGC GTCGCCGCCG CGCCCGCGGA GAAGCCGACG GTCGAAGAAG ACACCGCGCG CGTGCTTCGC GTGCGATATC ACCGCAAAGA TGGCGCGTAC GCGCGATGGG GCGCACACGT TTGGGGACGC GGCGCGGCGT CGCCGACGCC ATGGGACGCG CCGCTTGCGG CGACGACGGA TGACGCGTCG GAATGGGTGA CCTTTGACGT CGAGCTCTCG GATATCTCTC GCGGTGATGG ATCGGTGAGC GTGCTGATTC ATAAGGGCGA GGCGCAAGAC TGTCGAGTAG AGGATTTCGA CGCGACGGCG GCGCCAGAGG TGTTTTTGGT GAGCGGATAT TCGAGCGCGT TCGAGACGGA GCCCGATTTG ACGTCGTTGC CGAAAGGAGA CATAGAAAAG TATCGCGCGA TCTGGGTCGC CGAGCGCGTG ATCGCGGTGC CGGGGGATTT CGCGAACGAT GGGGACTCGT TCACGCTCGT GAGCTCGTCC ACGGCGGATT TGAAAGTCAC GGGCGAAGGC GTCGTCGGCG GCGACGACGT CGTCACCGTG GTTCGTTCGG GTGAGTTACC TTCGAGCGTG TGCGCAAAGT TTCCACACAT CAAAGCTGCG GGCTATCGCG CGCTCGAAGT TCCTTCGAGC GTCAACGTGC GAGACGCGTT GAAGCGCCAA ATCGCCGTAG CCGCGGTGGA CGCCGCGGGT AAGCCGACGG ATGCCACGGG CGTGCAGTTG CAAGGCGCGA TCGATGATTT ATTCGCTTAC GATGGACCTC TCGGAGCGGA ATTTGGAGTG AACGACAAAG TCACGCTACG CGTGTGGGCG CCCACGGCCT TGAACGTCGC GTTGGCGTTG TTCGACGAAC CCAGAGGAGA GGAGACGAGA CGCGTCGTCG CGATGACGCG GGACGAGACG TCGGGGGTGT GGAGTGCGAC CGGTGACGAT TTCAAGGATA AATATTACAA TTTCGAAGTC ACCGTATTCA ACCCGACGAC TGGGAAAGTG TCGACGAACG TCGCGTCGGA TCCGTACGCT CGGAGCCTCG CCGCCGACGG TCGCCGAGCG CACGTGTGCG ACATTTCGCG AGACGACCTC AAACCCAAGG GATGGGAGAC GTTTGAGAAG CCAAAGTTCA CGCATCCCGT GGATTGTTCA ATCTACGAGC TACACGTGCG AGATTTCAGC GCGTTGGATG AAACCGTGAG CGCTTCTGCG CGAGGCAAGT ATTTAGCGTT CTGCGAAGAA TCGAGCGTGT GCGTGTCCCA TCTCAAAAAG CTCGCCGACG CCGGTCTCAC GCACGTGCAC CTGTTGCCGT CTTACGACTT TGGTTCCGTG CCCGAGCTCC CAGAAAATCA GCTGTCGGTA GACTTCAAAG AGTTGGCCAA ATTACCACCG AATTCTCGAA AGCAACAAGA AGAAATCAGT AAAATTGCAT GGTCTGATTG CTTCAACTGG GGATACGACC CCGTGCACTA CGGCGTGCCC GAAGGCAGTT ACGCCACCAA TCCAGACGGT CCCCGGCGCA TTTTCGAGTA CCGCCAGATG GTGCATGCGC TCGCCTCGAA CGGATTACGC GTCATATGTG ATGTGGTGTA TAATCACACC TTGTCGTCCG GACCGAGTGA CGTCAACAGC GTCTTGGATA AAATTGTCCC GGGGTACTAC CATCGACGCA ACTTTGACGG ATTCATCGAG GCGAGCACGT GTTGCAACAA CACGGCCAGC GAGCATTACA TGATGGATCG TCTCATTGTG GACGATCTCG TGCACTGGGC GAAAGATTAT AAGGTTGATG GATTCCGATT TGATTTAATG GGGCACTTGA TGCTGTCCAC GATGCTTCGA GCGAAGGACG CGCTCCAATC GTTGACGCTC GAGAAGGATG GTGTCGATGG TAAATCACTG TACCTGTACG GAGAAGGATG GGATTACGCC GAGGTGGAAA AAGGTCGCGT CGGCAAGAAT GCATCGCAGC TAAATTTGGC TCACACTGGT ATCGGAAGCT TTAACGACCG CGTCCGCGAG GGTTGCATCG GAGGCAGTCC CTTTGGCGAT CCTCGCATGC AAGGTTTCCT CACGGGCTTG TACTACACCC CCAACGGCGC CGTCGATCAA GGAGACCAAG ACTCCCAACG CTATAGAATG ATGGAGGACG GCGAGAAGAT AATCGCCGCG CTCGCTGGAA ACGTCCGTGA TTTTGTCTTT GTCAATCGCC ACGGCGTCGA GGTTCCGTCG AGTTCGGCGG CTTGGCCAGA CTCAAACGTT GCCTACGCGG GCGAACCGGA AGAAACGGTG AACTACGTCA GCGCGCACGA TAACGAAACC CTATTCGATT GCATCATGTT AAGAGCGGCG GCGTCTGTGT CTCTGGAACA AAGGTGCCGA ATCAATCATT TAGCGACGGC GATTGTGGCG TTGTCGCAAG GCGTGCCATT CTTCCACGCG GGCGACGAAA TCTTACGGAG TAAATCTTTA GACCGAGACT CGTACTCCAG CGGTGACTGG TTCAATAGGC TGGACTACTC TGGGGACACG CACAACTTCG GCGTCGGGTT GCCCGGGGAG CAAAAGAACG GCGATAGATA CGACTTCATC ACGCCCATGC TCGCCGATAC GTCGATGCGC CCCTCGAAAG AGTTCATCGA AGAGGCGACG AGAAACTTTT GCGAACTTCT GAGCATCCGT CAATCGACGC CACTGTTGCG TCTTCAAACC ACGCGAGACA TTCAGCGTCG CATGAAGTTT TATAACCGCG GACCGGCGCA AACGCCTGGG CTCATCATCG CTAGTATCAA CGACGGCGAC GCGTCCACGC CTGGGTTACC ATCGCTCGAC GCGAACTACA AGCGCGTCGT GCTCGCGTTC AACGCCACTC CGAACGAAAT TTCACATCAC GAGGCTGGGC TTAAAGTTGA TTTCGCTGGC GTCGATCTCG AATTACATCC GCTCGTCGGC GGCGTCACCG CGGACGCCGT CGCGATGAGG AGCGTCTTCA TCGAAGGCGT TCCCACGATT CCTCCGTACA CGTGGACCGT GTTCGTACAG CATAGATAA
|
Protein sequence | MRSNVALRAS TTSAHRWSRF SSSPSSTRRR QHQRQHHHLD VQHRPSRPVR VAAAPAEKPT VEEDTARVLR VRYHRKDGAY ARWGAHVWGR GAASPTPWDA PLAATTDDAS EWVTFDVELS DISRGDGSVS VLIHKGEAQD CRVEDFDATA APEVFLVSGY SSAFETEPDL TSLPKGDIEK YRAIWVAERV IAVPGDFAND GDSFTLVSSS TADLKVTGEG VVGGDDVVTV VRSGELPSSV CAKFPHIKAA GYRALEVPSS VNVRDALKRQ IAVAAVDAAG KPTDATGVQL QGAIDDLFAY DGPLGAEFGV NDKVTLRVWA PTALNVALAL FDEPRGEETR RVVAMTRDET SGVWSATGDD FKDKYYNFEV TVFNPTTGKV STNVASDPYA RSLAADGRRA HVCDISRDDL KPKGWETFEK PKFTHPVDCS IYELHVRDFS ALDETVSASA RGKYLAFCEE SSVCVSHLKK LADAGLTHVH LLPSYDFGSV PELPENQLSV DFKELAKLPP NSRKQQEEIS KIAWSDCFNW GYDPVHYGVP EGSYATNPDG PRRIFEYRQM VHALASNGLR VICDVVYNHT LSSGPSDVNS VLDKIVPGYY HRRNFDGFIE ASTCCNNTAS EHYMMDRLIV DDLVHWAKDY KVDGFRFDLM GHLMLSTMLR AKDALQSLTL EKDGVDGKSL YLYGEGWDYA EVEKGRVGKN ASQLNLAHTG IGSFNDRVRE GCIGGSPFGD PRMQGFLTGL YYTPNGAVDQ GDQDSQRYRM MEDGEKIIAA LAGNVRDFVF VNRHGVEVPS SSAAWPDSNV AYAGEPEETV NYVSAHDNET LFDCIMLRAA ASVSLEQRCR INHLATAIVA LSQGVPFFHA GDEILRSKSL DRDSYSSGDW FNRLDYSGDT HNFGVGLPGE QKNGDRYDFI TPMLADTSMR PSKEFIEEAT RNFCELLSIR QSTPLLRLQT TRDIQRRMKF YNRGPAQTPG LIIASINDGD ASTPGLPSLD ANYKRVVLAF NATPNEISHH EAGLKVDFAG VDLELHPLVG GVTADAVAMR SVFIEGVPTI PPYTWTVFVQ HR
|
| |