Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54681 |
Symbol | |
ID | 7202246 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 184668 |
End bp | 187993 |
Gene Length | 3326 bp |
Protein Length | 1028 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | endo-1,3-beta-glucosidase |
Protein accession | XP_002181321 |
Protein GI | 219121954 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAAAGTGAAG CAGTCCAATA GTTCTTGGGA GGACTGTCCA TGTCGATTCG TCTCTTCTCT ACCGCCTTAC TAGCTGCTTG CTTAGCAAAG GCAACTGCCC AAACTTGCCC CACTCTTATC TGGAGTGACG AGTTTGACGG TTCCAATCTC AACGCCGACA ACTGGACTCC GCAAATTGGT GATGGTTGCG ATATCTCCGT AGACCTCTGT GGTTGGGGAA ATAACGAAGC TCAGTTTTAC CGTAGCCAAA ATATTGCAGT GGCGGATGGT TCACTCAAAA TCAACGCCTT GCGACAAACT TTCGGGGAGA AAGAGTTTAC GTCCGCACGC ATCCGTTCCT TCGAGAAGTT TGACATTGAT TTGACCCAGC CTTACGTTCG CATCGAAGCG CGAATCAAAG TCCCCCTTGG TCAAGGCTTA TGGGGTGCAT TCTGGATGAT GCCTTCTCCG GAAGTGCTGT GGCCCAAAGG TGGAGAGATT GGTACGTATT TATCACTATG TTGTGTGTAG CTGATAAGAT TCGGCCACAC TTATAGTTTC TTCTGTATTC TGATGCAGAT ATTATGGAGT TTATTGGCAG AGAACCCAAT TCAGTAAGTA CCGAAGACTA GAACACGACC ACAACGATTC GCTTTTTCAT AGCCTAACCC ATAACGTCCT TGGACATAAA ATGCAGGCTT ATGGAGTCAT TCACTACGGT AACTTGTTTC GAGACAAGTC TGAGTTGGGA GGTCCACTGA AAATGCCTAC CAGTCCTAGC GAAAATTTTC ATATTTACAC AATCGAGAAG ACACCAAACC GTATCGCCTG GTTGGTAGAT GGCTTTGAGT ATCAGTCGTA TACAAATTCC GACATTCAGC CCAAGTACAG CTGGCCATTC GAGCAGACTT ACCATTTGAT ACTCAATTTG GCTGTCGGAG GAAATTGGCC AGGATATCCG GATCGCGATC AAGCTGTGGA TGTGATAATG GAGGTGGATT ACGTTCGTGT CTACGATATG TCAGGAGGAT CTATCGGAAG CATCACCGGA GAGTCCCTGG TACAGATCAA TGGAGCCGAT GGGCTCTATT GTATTGACGA TAGTGACGTT TTGTTCACCG ATATTGCTTG GACAGTCCCG ACAGGCTCGA GTTTTACACC CTCCTTGGAT GATGGAAACT GCATCATTGT CGCCTTCGGT TCGGTCTCAG GGTATATTCA AGCTGTCGGC CAGACCGCTG ACTGTGGTCC CCTAAGCTAC AGTATGCCAG TGGAAGTTCA GCCCCTCTAC CAGAAAGAAT TCGCCGTGGT GCCAATTGAG GAAGGCAGCA CTACTATCGG TGCTTCAACA GGAAGTCAGA GCTTCCTCCT GATCGACGGT ATCCCAACAT TTCAATACGA TCGGTTAGCT TCTGACTTGT ACGATAACAT CATCATCGGT ACCTCTGATG TCTCCGACGC CAGTCTCTAT GTGTCAGAGC AGAAGAAGTT TTACATGGAC ATATCATCGC CAACGGCTGC CACTTGCACT CGAGTTATAA TTCAGTTTGA AGATACCACC ACAGCATTAC CGGACAACTA TCCAATCGGG CGACATAGTC GATACGTTGG ATACCTAGAC GATTCAGAGA GCTTCCAGCG CGTCGAGTTT ACATACTATG ATCGCCCGGA CACAAGCGTG GCTGACAACG CCGTCAGTCA GCTGGCCGTT TTGATTGATC CATTTCTGTT CCGCGGTGAC CGATATCTGA TCCGCAACTT TGACAGCTCT TCTGCAGGCT GCTCCAGCAA TTGCGAACCC CTTTCGACCA ATGCCTGCCG TACATACGCC AAGTCCGAAC AAGGAATGTG TACAGATGGC GAAAACAACG ATCGTGAGGG ATACAATGGC GACGATCAAA TAGACTGTGA GGATGCGGAT TGCTACGGAC TAGACCCTGC CTGTCCTCTG AGAGAGGGAC GGGCAACTGA AGTGCCTACG ATGCTGAGCC CTACATTGGC GCCCACTGGG TTGCCCACGG GCAACGCCTC TGACCCTCCC TCAATAGTGC CTTCGAGTGA AACTGCTTTC CCGACCGCCA CTGGGACTGT TAGAGTGACT AGTCGCTCAT CGGACATCCC GTCCTCGAAA CCAAATATCA TGCCCATAAC CTCCGGGCCT ACGAGCAGCG CTGCTCCGTC TTATTCCAAC GACGAAGCAG AGTGTGACGC GCAGCCGCGA TGCGCTGCGT TGGATCTTGC GGGAGTGTGT TGCCCTACGA TTGACGCAGT CTATCTTGAT TGCTGCGACA ACCGGCCGAT CGATCCAAAC TCAGAAGCAG GTTTCTGCAA CGACGGAATT GACAACGACA ATGATGGTTT GTTTGACTGC GAAGATCCTG ACTGTGCGAA CGACGAGATG TGTCGCGCCG ATTGCGTCGC CATTGGAACT TGCGCTGGGC TTTCAGGTCA GTGCTGTCCA ACGATTGACA GCATTTTTTT GGATTGCTGC GATGCAGCCG TTCTTTCGGC GGGATACTCT GTTGATTTTA GCCTGATCGA TCCGGCTACA ATCGAGGAAC AGGCTGCATA CTGGCTTTCG ACTTCACCCT ACACTGTTCC AGGTGTTGCT CCGGACGGCT CGACGCCGGT AGTGGATTAT GCACGCGACG AAGCTACTTT GTACGATATG ATTTACTACC AAACAGGTTC TATTGAAAAA GCATCGGAAT ATGTTATTGG TCGTCGAAAG ATATACATGG ATGTTTGGAC GGACGCGCCT GAGTGCACTC AGATAATTCT TCAGTTTGAT AGCTTGCCGA CCGCAGTGGC CGACAATTAT CCAACGGGCC GCCACAGTCG ATACGTGGCA TTTACCACTA GGAGAAACGA ATGGGAACGT ATAGCATTGG ACTTTTGGGA CCGCCCAGAT GGTGATCTGG ATGACACGGT TATCAATACC ATTGCTCTCT TCTTCGACCC AGGCACTCTA ACCTCTCACC AATTTTACTT CCGAAACATG GACAGTACCC TCTCAGGCTG CCGAGAGAAC TGTGAGGTTG CCGCAATAGA TGACTACTGT TTGGCTCTGA GCTCGGGGGA GAGTGGATTC TGCGGGGATA GTCTCGATCA GGATGGGGAC GGACTGATTG ACTGCGTGGA TCCAGACTGC ACACTGGACG AGGCCTGCTC CGCACAACTA TCCATCAGCT ATTCAGCCTT GCAAGACAAT TCATTCCGCA CCGCTGCCTC CAGTAGTTCT GCACAAGTTA TCTTGAACGG TGGTTTGATT GTGTCTTTTC TGGCGATCCT CCATACCATA GCTTAATCGG ACGTACCTTC ACGTAGAAAA AAATATTTTT GGTTTG
|
Protein sequence | MSIRLFSTAL LAACLAKATA QTCPTLIWSD EFDGSNLNAD NWTPQIGDGC DISVDLCGWG NNEAQFYRSQ NIAVADGSLK INALRQTFGE KEFTSARIRS FEKFDIDLTQ PYVRIEARIK VPLGQGLWGA FWMMPSPEVL WPKGGEIDIM EFIGREPNSA YGVIHYGNLF RDKSELGGPL KMPTSPSENF HIYTIEKTPN RIAWLVDGFE YQSYTNSDIQ PKYSWPFEQT YHLILNLAVG GNWPGYPDRD QAVDVIMEVD YVRVYDMSGG SIGSITGESL VQINGADGLY CIDDSDVLFT DIAWTVPTGS SFTPSLDDGN CIIVAFGSVS GYIQAVGQTA DCGPLSYSMP VEVQPLYQKE FAVVPIEEGS TTIGASTGSQ SFLLIDGIPT FQYDRLASDL YDNIIIGTSD VSDASLYVSE QKKFYMDISS PTAATCTRVI IQFEDTTTAL PDNYPIGRHS RYVGYLDDSE SFQRVEFTYY DRPDTSVADN AVSQLAVLID PFLFRGDRYL IRNFDSSSAG CSSNCEPLST NACRTYAKSE QGMCTDGENN DREGYNGDDQ IDCEDADCYG LDPACPLREG RATEVPTMLS PTLAPTGLPT GNASDPPSIV PSSETAFPTA TGTVRVTSRS SDIPSSKPNI MPITSGPTSS AAPSYSNDEA ECDAQPRCAA LDLAGVCCPT IDAVYLDCCD NRPIDPNSEA GFCNDGIDND NDGLFDCEDP DCANDEMCRA DCVAIGTCAG LSGQCCPTID SIFLDCCDAA VLSAGYSVDF SLIDPATIEE QAAYWLSTSP YTVPGVAPDG STPVVDYARD EATLYDMIYY QTGSIEKASE YVIGRRKIYM DVWTDAPECT QIILQFDSLP TAVADNYPTG RHSRYVAFTT RRNEWERIAL DFWDRPDGDL DDTVINTIAL FFDPGTLTSH QFYFRNMDST LSGCRENCEV AAIDDYCLAL SSGESGFCGD SLDQDGDGLI DCVDPDCTLD EACSAQLSIS YSALQDNSFR TAASSSSAQV ILNGGLIVSF LAILHTIA
|
| |