Gene PHATRDRAFT_50351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50351 
Symbol 
ID7199179 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011697 
Strand
Start bp29746 
End bp32841 
Gene Length3096 bp 
Protein Length909 aa 
Translation table 
GC content46% 
IMG OID 
Productbeta-glucosidase 
Protein accessionXP_002185317 
Protein GI219130323 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00482034 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAATTTCACT TCATTGTTCA ATTGATCCCT GCCATTACTG GGGAAAAGGC GAACTACATT 
TGTGTGAATA GGAAAAGAAC ATTATTTTAG CACCAAAATT CGCCGAATTT GATTCGCCGG
GACGAGACTG TCGGGAAACG GTGAAGATTT CCGAAAGGGA GCGTTCGGAA GCATGTCTCG
AGAGGTCAGC AATGGCATAT CTTCAACGGA GCATACCAAA TTACTTACAA TAACAGCTGA
AAATGACAGC GCTACAGGTA GTCTCTACTG CCTACGAAAA CAACTAGTTG TGGGAGCTGT
AACGACTGCG GCGGTTTTGC TTCTATGTAC TTCCCCTTCA AGTCCACTAT CATTCTTGGA
ATGGTCACAG CGAAACAAAA TTGAATCAAG TAAGCCAGTC CGCTTTCCTG AAACTTTTAT
TTGGGGGGTT GCAACCAGCA GCTACCAAAT AGAAGGTGCC ATAGACGAAG GTGGCCGCGG
CAAAACCATT TGGGATAATT TTTGTCACCA AGGTATCCAT ATCTCGGACA ACTCTACTGG
GGATGTTGCT TGCGACCATT ACCATCGTAT GAAGGAAGAT GTTGCAATGA TGAAGCAACT
TAATATAGAA GCCTATCGGT TTTCGATTGC GTGGTCTCGA ATACTGCCCA ATGGAACAGG
GGGAGTAAAT CAAGCTGGAG TCGATTTCTA CAACGATTTA ATTGATACTC TGGTTGGCCA
TGGAATAGAG CCTTGGGTAA CACTGTACCA TTGGGATTTA CCAGAGGCGT TGCAAGTCAA
GTACGGCGGA TGGCTAGATC CAAGAATTGT GGATGTGTTT GCTGAGTATG CACAAGTGTG
TTTTCTTGCC TTTGGAGACC GCGTGAAAAA CTGGATTACC ATCAATGAGG CGTGGACAGT
TTCTGTTAAT GGTTTTTCGA CTGGAATACA CGCCCCGGGG CATCTTTCTT CGACTGAACC
GTATCAAGTT GGTCATCATC TTCTGTTGGC TCACTCAAAA GCAGCAAGCA TATACAAATC
CTTTTTTCAA CTCCGCCAGA AAGGGAGGAT CGGGATTGCC AATTGCGGTG ACTTCCGATA
TCCTCGAACG GATAGACCAG AGGACCGTGA AGCTGCAGAG CGCGCTATGT TATTTCAATT
CGGTTGGTTC ACTGATCCAC TTTTGCTTGG TGACTATCCA CCAATCATGC GACAGCTACT
TGGGGACAGA TTGCCAAGCT TCACTGAGGA TAATCGAGCC GAACTGGTAA ATTCAACCGA
CTTTATTGGG CTGAACTACT ACTCGTCATT TCTTGCTTCA AAGCCCGCTT TTAAAACTGC
AGACAATTCG TACTGGGCTG ACATGTATGT AGACTTCTCT GGGGATGCAA AGTGGACAAC
AAATGACATG GGTTGGTATG TGGTACCAGA TGGTCTCCGA GAAATGCTTC TCTGGATCTC
AAAGCGGTAC AGGAATCCAC TGCTTTTCAT AACAGAGAAT GGTACTGCGG AAAAGGATGA
TAATTTGGAA CTTGTTAAAC AAGACGAGAG ACGCAGGGTT TTTTTCGAGT CGCACTTGAG
AGCCTGTTAC GATGCTATTG TTCAGGGTGT TAGCCTTGGC GGGTACTTTG CGTGGAGCTT
GATGGACAAC TTTGAATGGC AATTTGGATA CACTCGTCGG TTTGGGCTAT GCTCCGTTAA
CTTTCAAACC ATGGAGAGGA CACCGAAAAT GTCTGGCCAA TGGTACGGTG CCACAGCTCT
AGCCAATGGG GCAAACATTG ATATTGAGAA TGGAGGCAAC AAAAACTGGC AGCACAGGAG
ACTCTTGCCA GCTTCCAAGT ACGGCAGACG GGTCGAAATA CCTAAAAGGG TTTTAATTGG
CTACGGATCC AACATGGATA TGGTTAAGGA GGCTGTCTAT AATGGTGTGA ACATTGTCGT
TTGGTCGTTT ATTTCGATAA TTCCCAACCG TGGAGGAGCA TTGCAGAAGG CTCGAGCGAG
GAATTTCAAT GTTGTTGGCA GTGTGAGCAA TGCTGGAGCG GTTCTGGTGA CAAAGCTGAA
CCTCACAGCT CTTACGTTGC TGATCGAAGA TCTATCACAA AATGGTTTTG GTGACGTCGT
CCATTTGGCA AGTATAGGAG GTTGGAACGG TGGTCACTTA TCTCCGCTCG TTTCAGCAAG
GGAGTGGTGG ATCACTTTCA GTGAAGCTGC TGGCTTTATT TTTGATGGAA TCGACTGGGA
TCTGGAAGGA GACGATTTCC TATCCAGTCC AAGTAACGTG TTCGCAATCG ACTGCTTGGA
CAAGATGGGG CACATCAGTC AGCTCGCAAG TGAAGGTAAG CGCTGGAGTG GTTGACTGTC
TCCGCATTAA GGTTTGCGTT TGCTACACTC ATTGGCGTGC ACTCCAGAGA ACTACATAGT
TACAATGGCA CCGCCTCAAT CCTACTTGGA TGTGGATGGA AACGGACGGT TCAGTCGATA
CCTGAATCTG ACTGATCCGA CAAGAAGCTG GCACAACGAA TTTTCTCACT TTGGTGCCAA
TGCATACGCA TATCTCCTGG CTAGATTTGG AGACTCCATT GATTTAATTT CTGTTCAGTT
TTACGAGAGC TATTCGAGAA TTGCCATGGC TACGTTCAAT TCAGGCACAT CTCCAGCTAT
TTCTATAGCG CAATACGTCA CACAGCTGCT TGAAATGGAT TCGAAATATT TGGTGAAATT
TTCCTCAGAT CCAAATCTCG TCATAACCAA TCAGCTGGTC TCAATTCCAC TTTCAAAGTT
GGTGCTGGGT TTTGCGAACG GATGGGCTCT CGAGGAGGCA AATCTGCAAA AGGTTTTCTT
TGCCCCTATT GACCACGTGC AATGGGCTTG GTCGACGCTT TTGACGAAAA ACTGTACTCC
AAGAGGATTT ATGTTTTGGA CAATTGATGA GGAGGGTAAA AATGGAATCA AACTAGCTGC
TGGCTTGCGG AGAGTCTTGG ACATTAAGCC ATGAGTTCGG ATCTTCACAT ACGTAGGGCA
TCGCTGCTGT GAGTTAGGTC TTTTACTGTT AGACGGTGTG AGTTTTCTGC AGGTATCTAC
AGTTAGCTTG TTTCATTTTA AGAACTTTTT GTAAGC
 
Protein sequence
MSREVSNGIS STEHTKLLTI TAENDSATGS LYCLRKQLVV GAVTTAAVLL LCTSPSSPLS 
FLEWSQRNKI ESSKPVRFPE TFIWGVATSS YQIEGAIDEG GRGKTIWDNF CHQGIHISDN
STGDVACDHY HRMKEDVAMM KQLNIEAYRF SIAWSRILPN GTGGVNQAGV DFYNDLIDTL
VGHGIEPWVT LYHWDLPEAL QVKYGGWLDP RIVDVFAEYA QVCFLAFGDR VKNWITINEA
WTVSVNGFST GIHAPGHLSS TEPYQVGHHL LLAHSKAASI YKSFFQLRQK GRIGIANCGD
FRYPRTDRPE DREAAERAML FQFGWFTDPL LLGDYPPIMR QLLGDRLPSF TEDNRAELVN
STDFIGLNYY SSFLASKPAF KTADNSYWAD MYVDFSGDAK WTTNDMGWYV VPDGLREMLL
WISKRYRNPL LFITENGTAE KDDNLELVKQ DERRRVFFES HLRACYDAIV QGVSLGGYFA
WSLMDNFEWQ FGYTRRFGLC SVNFQTMERT PKMSGQWYGA TALANGANID IENGGNKNWQ
HRRLLPASKY GRRVEIPKRV LIGYGSNMDM VKEAVYNGVN IVVWSFISII PNRGGALQKA
RARNFNVVGS VSNAGAVLVT KLNLTALTLL IEDLSQNGFG DVVHLASIGG WNGGHLSPLV
SAREWWITFS EAAGFIFDGI DWDLEGDDFL SSPSNVFAID CLDKMGHISQ LASEENYIVT
MAPPQSYLDV DGNGRFSRYL NLTDPTRSWH NEFSHFGANA YAYLLARFGD SIDLISVQFY
ESYSRIAMAT FNSGTSPAIS IAQYVTQLLE MDSKYLVKFS SDPNLVITNQ LVSIPLSKLV
LGFANGWALE EANLQKVFFA PIDHVQWAWS TLLTKNCTPR GFMFWTIDEE GKNGIKLAAG
LRRVLDIKP