Gene PHATRDRAFT_46976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46976 
Symbol 
ID7202083 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp55129 
End bp59033 
Gene Length3905 bp 
Protein Length918 aa 
Translation table 
GC content46% 
IMG OID 
Productendo-1,3-beta-glucosidase 
Protein accessionXP_002181294 
Protein GI219121898 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGACAC CGGACTCTAA AAGCGGCGCT GCAAACCAAC GGCCTCAATA CCACTACTTC 
TACGGCAGCA TTGACGACCA ATCGCAAGTC AACCGCCGGG AAAACGCAGT GTCCGTCCCA
ACTGATCAAG ACTCGTTGGT AGGCGAATCT TTGCTTCCAA CCGACAATGA GCAAGACGAT
GATCGATCAA AAATTGCAAA GCAGGACTTT CAGCAAATTG TCATGTCCAA TGTCAATGGC
GTACGGCCCA TCGACTCGAG TCGAAGTGAA CGGATGGAGC CCAAATCCTG CTTGCGATTC
TTCTCGATAA ACAATTTACG CCGTCAGCAT CCTATGGGTT ATCTTCTTTT GTCTGTGGGT
TCCCTAACAT TGTTGGCTTT CTTGTGCTCG GTTATTTTCT TTTCAAAAGG CCAGTCGGGG
AGAGTGATAA GAACACCCAA GGAAGAGTTG AGCTTTCGCA TGCCGTTTCC TTGGGTGGAT
CGGGCACAGT ACGGCGACCC AGTCTCTAAA GTTCTGGACA AAGACCTTTT TCATCCGAGT
CTCGTATACA AAGGAAAGGA TCCCTCCCGA GTTTTTATCT TTCCCTTTCC AACCGGCGCA
TTTTGGACGA ATTTAGTGTT GCCTACGACC GCGGATCGCG GCTTGTCGTA TCCAATTGTT
GTCTACCCGT ACGCGTATAA ATGGTCCAAG GAGCTGTTAC AAGTGTCTTA TCCAGCAAGT
CATCGACGAG AATTACCTAA GGAGATTCAC GATGACTTTT TCCCTGACCT TACTTTCGCC
AGTGTTGAAT CTACCAGTCA GCGCCATATC ACGGCCTTTG ATCCGTTGAG TGTGACAGTG
CAATACTCTA CTATGAATGG AGGTACTTGG GAATCATATC TTGTTCAAGG TAGTCCGTAT
GTTACCCTCA GATATGATAA TGTGACACCA AAAATTCGGG CACTTTCCAT TTTCAAGAGT
ACGTTTTGTC CGCGAGTCGA GGCGTTAAGG AATGATACAG ATCGCTATTT AAGTTATGGA
GTGTGTTACA GCATTGAAAA TACGGATCCT TCAAAAGAGA GCCAAATGTC GCTCCATGGA
GTTCAGTTTA TACTGGAAAC ACAGGAGGGC ATGAATTGGA TTGTCTTTGC TTCCGAGCCC
GTTACTTTGC AATTCGATAA AATCCAAAAG ACCACAATTA CAGCATCGAG CGCATTCAAA
GGTGTTATCC GGCTCGCCTA TATTCCTCCG TCAGTTTCCA AACCATCCAA CAATGTAATT
GATGTAACAA AGTCAACTGG TCTGCAGCGT TTGATATATC ATGCTGGAGT ATACCCTATT
TCGGGGGAAG TGAGCTGGTC GTTTCGGACG TCGGCGCATA ATTCTGCATT GTCAGCTGCC
ACCAAGTCGT TAAATTTCTT GTCGGGAGCA AAAGGAACAA ACGCCGCTTC GGAGTCAGCT
GTGAAATCCG TTCTGTCGTC TTCTTCCTCT CCTGGTTGCA TTGGTACAAT ACAATTTGAC
TTTGGAGTGA AAGCCTTTGC TCCCGCAACC AGCGCAAGCG CAGCCGATTC GTTACTGATG
CTAGCTCTGC CTCATCATGC GCAGAGTCTT CTTTCAAGCG TGCAACTTCC TGATGAAACG
TTTGATCTTG CTTACAAGTG TATTAAAGGC ACTATGACTC CTATTCTCGG ATCGTCTTGG
GTATACGATG AAAACCTGCC ATCCCTGGGC TTCGATGGAG ACACGGGATC GAATAGCAAC
AAGGCGTACC TTGATCTCAA CATTCGATCC ACCCTCATTG AAAGTCTCGA AAAAGATATG
AACCTGGCCT TACCAACTCT TACCGAGAAT ATTTACGGAT TCGGCAAGCA AATCGCTCGT
CTGTCACAGC TCGCGCACAT TGCGGACGTG CTCCGGACTG GTGGCGTTGA AGTCGATGAA
AATACAAATG TTATGAACAA TAGCAAGCTC TTTGAATCGC AAGAGAAAAG GCTAGATTTG
ATCTTCAACG GGACGCTTTC CCTTTTGACA AGTCGCCTAC AGCAGTTTTT GACGAGCAAC
ATTTCTGACT CACTAGTCTA CGATACAAAT TTTGGCGGAA TGGTGAGTGT CGACGGCCTT
CGAGATTCAA ACAGGGACTT CGGAAACGGC AGATACAACG ATCACCATTT TCATTACGGC
TACATATTGT ATGCGTGTGC AGTGTTGGGA AGACTAGATC GTTCGTTCAT TTTGAAATAC
GGAGACAAGG TCGATGCGTA AGTCCAAACA TCCAAATACT AGGACAACTT TTCTTCGCAA
ACCTTATCCG CTTTTTGTTC CGTTCAGTAT CTTTTATGAT ATCGCACATG ATTCTAATTC
GGCGTCACAG ACGGGCAACG GGGCCTTCTT CCCTCTAGCG CGGCATAAGT CATGGTTTGA
CGGTCACTCC TTTGCGTCTG GTCTCTTTCC GTTTGGAAAT GGAAAGTCGC AAGAGAGTTC
GAGTGAGGCG GTCAATGGTT ACTACGGAGC ATATCTTTGG TCGCTAGTTC GGCATAAAGA
AGCGGAAACT CCAAATGAAA GTACGTCGGA GGGGACAGAC TTCGCCAGAT TGCTTCTGGC
TACAGAAATA CGTGGAGCAC AAACATACTG GCAGATGATA CCTTTAAAGT CAACAAATCA
AACAGAGGTG ACCGGTGTCT ATAGTAATTC GTTTGCTAAA AATTACATGG TCGGAAACCT
TGGAATGCTT GACGTCATAT GCTCAACTTG GTTCGGTACG TCACAGCTCT ACGTGCACAT
GATCAATTTC ATCCCACTTA CGGCGGCTAC TGGAGAGCTA TTCAGTCGAA AATATGTCAC
TGAGGAGTAC ACAAACATAC TCAATGGTCT TGGGGAAGTT GAGATGGCTT GGAGGGGCTT
TGTTGTTGGG GATCACGCAA TCATGGACCC TCAGGCGGCC TGGGAGGAAG CGCAAGGAGT
CTTCAGCGCT GAACTCGACG CCGGACTCAG CAAAAGCCAA TTGCTGTTCT GGATAGCAAA
TCGTAAAGGA TTTTCGCCGT CTATGATTAC ATTATCAGCG GTTGAGAAAA AATCTGAAGC
ATCAACTATT GTGCATGATC GTTTTATAAA GTATTCCTCT TGCCTGAACT ATGATTCATG
CCGCAATGGT GGACTTAGTG GTAATTGTTG TCCAACAGAC GATGGGTTGA TTCTCGATTG
CTGTAATATA TGATAACTTT ACAACGAAGG AGTGGGAGCC AACAGAAGTT TAGATCTCGG
AAAAGTGATG AAAGGGATTA ACAGTTAGAT TAACTTCTTA TCAATGTAGA TTCTATTGGA
ACCGCTATCT GCACATAATT CGAGAAAAAA GTCTTTTTCT GCATGTCTGG GGAGACTTTT
CAAAGTTCGA CGATGTTTGA GAGATCAGCC CTGAAGCTTG CCTGGCGAGG ATTCAAGATA
GTTACCGGCA GCATCAGGGA GGAAGGGTCC GGGCTTCATG GGAGATTCTT CTGAAGACGA
ACATTGAAGC CCAACTGAAG GGTTACTATT GTTGAGCACC GGTGCATCAG CAAGCACATC
GCCTACTTCG GTGAATCCAG TAGATCGGTC AACTTTGATT GCTGGAGAAA ATGATTCTAC
GTCTATTTCT CCTTCGTCTT CTTCAGTATT TTTATTTTTC TGGACGCTGC CGTCTCCACA
AAGCGTAGCA CTCCACGCAA TTCCATCGGC CCTGGCTTGT TGCAGCTCTT TCACAGACTC
AGAACTCCAT GTAAACCCTG ATTCTGAGGA TTTCTCTCTT ATTTTGTTGA GTAAATTTGA
CGTTAATCGA ATCCCAACAC TTACATTGTA CGCATACACG GCAATGATTT TCCACTGACG
AGTCCACCCG AAATCATCTC GTCGATGATC GCGTCGTTGT CTGCAAAGAC TGACGACAAA
CCAAC
 
Protein sequence
METPDSKSGA ANQRPQYHYF YGSIDDQSQV NRRENAVSVP TDQDSLVGES LLPTDNEQDD 
DRSKIAKQDF QQIVMSNVNG VRPIDSSRSE RMEPKSCLRF FSINNLRRQH PMGYLLLSVG
SLTLLAFLCS VIFFSKGQSG RVIRTPKEEL SFRMPFPWVD RAQYGDPVSK VLDKDLFHPS
LVYKGKDPSR VFIFPFPTGA FWTNLVLPTT ADRGLSYPIV VYPYAYKWSK ELLQVSYPAS
HRRELPKEIH DDFFPDLTFA SVESTSQRHI TAFDPLSVTV QYSTMNGGTW ESYLVQGSPY
VTLRYDNVTP KIRALSIFKT SSAFKGVIRL AYIPPSVSKP SNNVIDVTKS TGLQRLIYHA
GVYPISGEVS WSFRTSAHNS ALSAATKSLN FLSGAKGTNA ASESAVKSVL SSSSSPGCIG
TIQFDFGVKA FAPATSASAA DSLLMLALPH HAQSLLSSVQ LPDETFDLAY KCIKGTMTPI
LGSSWVYDEN LPSLGFDGDT GSNSNKAYLD LNIRSTLIES LEKDMNLALP TLTENIYGFG
KQIARLSQLA HIADVLRTGG VEVDENTNVM NNSKLFESQE KRLDLIFNGT LSLLTSRLQQ
FLTSNISDSL VYDTNFGGMV SVDGLRDSNR DFGNGRYNDH HFHYGYILYA CAVLGRLDRS
FILKYGDKVD ATTFLRKPYP LFVPFSIFYD IAHDSNSASQ TGNGAFFPLA RHKSWFDGHS
FASGLFPFGN GKSQESSSEA VNGYYGAYLW SLVRHKEAET PNEIIRLLKI TWSETLECLT
SYAQLGSLFS RKYVTEEYTN ILNGLGEVEM AWRGFVVGDH AIMDPQAAWE EAQGVFSAEL
DAGLSKSQLL FWIANRKGFS PSMITLSAVE KKSEASTIVH DRFIKYSSCL NYDSCRNGGL
SGNCCPTDDG LILDCCNI