Gene PHATRDRAFT_54681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54681 
Symbol 
ID7202246 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp184668 
End bp187993 
Gene Length3326 bp 
Protein Length1028 aa 
Translation table 
GC content50% 
IMG OID 
Productendo-1,3-beta-glucosidase 
Protein accessionXP_002181321 
Protein GI219121954 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAAAGTGAAG CAGTCCAATA GTTCTTGGGA GGACTGTCCA TGTCGATTCG TCTCTTCTCT 
ACCGCCTTAC TAGCTGCTTG CTTAGCAAAG GCAACTGCCC AAACTTGCCC CACTCTTATC
TGGAGTGACG AGTTTGACGG TTCCAATCTC AACGCCGACA ACTGGACTCC GCAAATTGGT
GATGGTTGCG ATATCTCCGT AGACCTCTGT GGTTGGGGAA ATAACGAAGC TCAGTTTTAC
CGTAGCCAAA ATATTGCAGT GGCGGATGGT TCACTCAAAA TCAACGCCTT GCGACAAACT
TTCGGGGAGA AAGAGTTTAC GTCCGCACGC ATCCGTTCCT TCGAGAAGTT TGACATTGAT
TTGACCCAGC CTTACGTTCG CATCGAAGCG CGAATCAAAG TCCCCCTTGG TCAAGGCTTA
TGGGGTGCAT TCTGGATGAT GCCTTCTCCG GAAGTGCTGT GGCCCAAAGG TGGAGAGATT
GGTACGTATT TATCACTATG TTGTGTGTAG CTGATAAGAT TCGGCCACAC TTATAGTTTC
TTCTGTATTC TGATGCAGAT ATTATGGAGT TTATTGGCAG AGAACCCAAT TCAGTAAGTA
CCGAAGACTA GAACACGACC ACAACGATTC GCTTTTTCAT AGCCTAACCC ATAACGTCCT
TGGACATAAA ATGCAGGCTT ATGGAGTCAT TCACTACGGT AACTTGTTTC GAGACAAGTC
TGAGTTGGGA GGTCCACTGA AAATGCCTAC CAGTCCTAGC GAAAATTTTC ATATTTACAC
AATCGAGAAG ACACCAAACC GTATCGCCTG GTTGGTAGAT GGCTTTGAGT ATCAGTCGTA
TACAAATTCC GACATTCAGC CCAAGTACAG CTGGCCATTC GAGCAGACTT ACCATTTGAT
ACTCAATTTG GCTGTCGGAG GAAATTGGCC AGGATATCCG GATCGCGATC AAGCTGTGGA
TGTGATAATG GAGGTGGATT ACGTTCGTGT CTACGATATG TCAGGAGGAT CTATCGGAAG
CATCACCGGA GAGTCCCTGG TACAGATCAA TGGAGCCGAT GGGCTCTATT GTATTGACGA
TAGTGACGTT TTGTTCACCG ATATTGCTTG GACAGTCCCG ACAGGCTCGA GTTTTACACC
CTCCTTGGAT GATGGAAACT GCATCATTGT CGCCTTCGGT TCGGTCTCAG GGTATATTCA
AGCTGTCGGC CAGACCGCTG ACTGTGGTCC CCTAAGCTAC AGTATGCCAG TGGAAGTTCA
GCCCCTCTAC CAGAAAGAAT TCGCCGTGGT GCCAATTGAG GAAGGCAGCA CTACTATCGG
TGCTTCAACA GGAAGTCAGA GCTTCCTCCT GATCGACGGT ATCCCAACAT TTCAATACGA
TCGGTTAGCT TCTGACTTGT ACGATAACAT CATCATCGGT ACCTCTGATG TCTCCGACGC
CAGTCTCTAT GTGTCAGAGC AGAAGAAGTT TTACATGGAC ATATCATCGC CAACGGCTGC
CACTTGCACT CGAGTTATAA TTCAGTTTGA AGATACCACC ACAGCATTAC CGGACAACTA
TCCAATCGGG CGACATAGTC GATACGTTGG ATACCTAGAC GATTCAGAGA GCTTCCAGCG
CGTCGAGTTT ACATACTATG ATCGCCCGGA CACAAGCGTG GCTGACAACG CCGTCAGTCA
GCTGGCCGTT TTGATTGATC CATTTCTGTT CCGCGGTGAC CGATATCTGA TCCGCAACTT
TGACAGCTCT TCTGCAGGCT GCTCCAGCAA TTGCGAACCC CTTTCGACCA ATGCCTGCCG
TACATACGCC AAGTCCGAAC AAGGAATGTG TACAGATGGC GAAAACAACG ATCGTGAGGG
ATACAATGGC GACGATCAAA TAGACTGTGA GGATGCGGAT TGCTACGGAC TAGACCCTGC
CTGTCCTCTG AGAGAGGGAC GGGCAACTGA AGTGCCTACG ATGCTGAGCC CTACATTGGC
GCCCACTGGG TTGCCCACGG GCAACGCCTC TGACCCTCCC TCAATAGTGC CTTCGAGTGA
AACTGCTTTC CCGACCGCCA CTGGGACTGT TAGAGTGACT AGTCGCTCAT CGGACATCCC
GTCCTCGAAA CCAAATATCA TGCCCATAAC CTCCGGGCCT ACGAGCAGCG CTGCTCCGTC
TTATTCCAAC GACGAAGCAG AGTGTGACGC GCAGCCGCGA TGCGCTGCGT TGGATCTTGC
GGGAGTGTGT TGCCCTACGA TTGACGCAGT CTATCTTGAT TGCTGCGACA ACCGGCCGAT
CGATCCAAAC TCAGAAGCAG GTTTCTGCAA CGACGGAATT GACAACGACA ATGATGGTTT
GTTTGACTGC GAAGATCCTG ACTGTGCGAA CGACGAGATG TGTCGCGCCG ATTGCGTCGC
CATTGGAACT TGCGCTGGGC TTTCAGGTCA GTGCTGTCCA ACGATTGACA GCATTTTTTT
GGATTGCTGC GATGCAGCCG TTCTTTCGGC GGGATACTCT GTTGATTTTA GCCTGATCGA
TCCGGCTACA ATCGAGGAAC AGGCTGCATA CTGGCTTTCG ACTTCACCCT ACACTGTTCC
AGGTGTTGCT CCGGACGGCT CGACGCCGGT AGTGGATTAT GCACGCGACG AAGCTACTTT
GTACGATATG ATTTACTACC AAACAGGTTC TATTGAAAAA GCATCGGAAT ATGTTATTGG
TCGTCGAAAG ATATACATGG ATGTTTGGAC GGACGCGCCT GAGTGCACTC AGATAATTCT
TCAGTTTGAT AGCTTGCCGA CCGCAGTGGC CGACAATTAT CCAACGGGCC GCCACAGTCG
ATACGTGGCA TTTACCACTA GGAGAAACGA ATGGGAACGT ATAGCATTGG ACTTTTGGGA
CCGCCCAGAT GGTGATCTGG ATGACACGGT TATCAATACC ATTGCTCTCT TCTTCGACCC
AGGCACTCTA ACCTCTCACC AATTTTACTT CCGAAACATG GACAGTACCC TCTCAGGCTG
CCGAGAGAAC TGTGAGGTTG CCGCAATAGA TGACTACTGT TTGGCTCTGA GCTCGGGGGA
GAGTGGATTC TGCGGGGATA GTCTCGATCA GGATGGGGAC GGACTGATTG ACTGCGTGGA
TCCAGACTGC ACACTGGACG AGGCCTGCTC CGCACAACTA TCCATCAGCT ATTCAGCCTT
GCAAGACAAT TCATTCCGCA CCGCTGCCTC CAGTAGTTCT GCACAAGTTA TCTTGAACGG
TGGTTTGATT GTGTCTTTTC TGGCGATCCT CCATACCATA GCTTAATCGG ACGTACCTTC
ACGTAGAAAA AAATATTTTT GGTTTG
 
Protein sequence
MSIRLFSTAL LAACLAKATA QTCPTLIWSD EFDGSNLNAD NWTPQIGDGC DISVDLCGWG 
NNEAQFYRSQ NIAVADGSLK INALRQTFGE KEFTSARIRS FEKFDIDLTQ PYVRIEARIK
VPLGQGLWGA FWMMPSPEVL WPKGGEIDIM EFIGREPNSA YGVIHYGNLF RDKSELGGPL
KMPTSPSENF HIYTIEKTPN RIAWLVDGFE YQSYTNSDIQ PKYSWPFEQT YHLILNLAVG
GNWPGYPDRD QAVDVIMEVD YVRVYDMSGG SIGSITGESL VQINGADGLY CIDDSDVLFT
DIAWTVPTGS SFTPSLDDGN CIIVAFGSVS GYIQAVGQTA DCGPLSYSMP VEVQPLYQKE
FAVVPIEEGS TTIGASTGSQ SFLLIDGIPT FQYDRLASDL YDNIIIGTSD VSDASLYVSE
QKKFYMDISS PTAATCTRVI IQFEDTTTAL PDNYPIGRHS RYVGYLDDSE SFQRVEFTYY
DRPDTSVADN AVSQLAVLID PFLFRGDRYL IRNFDSSSAG CSSNCEPLST NACRTYAKSE
QGMCTDGENN DREGYNGDDQ IDCEDADCYG LDPACPLREG RATEVPTMLS PTLAPTGLPT
GNASDPPSIV PSSETAFPTA TGTVRVTSRS SDIPSSKPNI MPITSGPTSS AAPSYSNDEA
ECDAQPRCAA LDLAGVCCPT IDAVYLDCCD NRPIDPNSEA GFCNDGIDND NDGLFDCEDP
DCANDEMCRA DCVAIGTCAG LSGQCCPTID SIFLDCCDAA VLSAGYSVDF SLIDPATIEE
QAAYWLSTSP YTVPGVAPDG STPVVDYARD EATLYDMIYY QTGSIEKASE YVIGRRKIYM
DVWTDAPECT QIILQFDSLP TAVADNYPTG RHSRYVAFTT RRNEWERIAL DFWDRPDGDL
DDTVINTIAL FFDPGTLTSH QFYFRNMDST LSGCRENCEV AAIDDYCLAL SSGESGFCGD
SLDQDGDGLI DCVDPDCTLD EACSAQLSIS YSALQDNSFR TAASSSSAQV ILNGGLIVSF
LAILHTIA