Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50351 |
Symbol | |
ID | 7199179 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011697 |
Strand | - |
Start bp | 29746 |
End bp | 32841 |
Gene Length | 3096 bp |
Protein Length | 909 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | beta-glucosidase |
Protein accession | XP_002185317 |
Protein GI | 219130323 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00482034 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAATTTCACT TCATTGTTCA ATTGATCCCT GCCATTACTG GGGAAAAGGC GAACTACATT TGTGTGAATA GGAAAAGAAC ATTATTTTAG CACCAAAATT CGCCGAATTT GATTCGCCGG GACGAGACTG TCGGGAAACG GTGAAGATTT CCGAAAGGGA GCGTTCGGAA GCATGTCTCG AGAGGTCAGC AATGGCATAT CTTCAACGGA GCATACCAAA TTACTTACAA TAACAGCTGA AAATGACAGC GCTACAGGTA GTCTCTACTG CCTACGAAAA CAACTAGTTG TGGGAGCTGT AACGACTGCG GCGGTTTTGC TTCTATGTAC TTCCCCTTCA AGTCCACTAT CATTCTTGGA ATGGTCACAG CGAAACAAAA TTGAATCAAG TAAGCCAGTC CGCTTTCCTG AAACTTTTAT TTGGGGGGTT GCAACCAGCA GCTACCAAAT AGAAGGTGCC ATAGACGAAG GTGGCCGCGG CAAAACCATT TGGGATAATT TTTGTCACCA AGGTATCCAT ATCTCGGACA ACTCTACTGG GGATGTTGCT TGCGACCATT ACCATCGTAT GAAGGAAGAT GTTGCAATGA TGAAGCAACT TAATATAGAA GCCTATCGGT TTTCGATTGC GTGGTCTCGA ATACTGCCCA ATGGAACAGG GGGAGTAAAT CAAGCTGGAG TCGATTTCTA CAACGATTTA ATTGATACTC TGGTTGGCCA TGGAATAGAG CCTTGGGTAA CACTGTACCA TTGGGATTTA CCAGAGGCGT TGCAAGTCAA GTACGGCGGA TGGCTAGATC CAAGAATTGT GGATGTGTTT GCTGAGTATG CACAAGTGTG TTTTCTTGCC TTTGGAGACC GCGTGAAAAA CTGGATTACC ATCAATGAGG CGTGGACAGT TTCTGTTAAT GGTTTTTCGA CTGGAATACA CGCCCCGGGG CATCTTTCTT CGACTGAACC GTATCAAGTT GGTCATCATC TTCTGTTGGC TCACTCAAAA GCAGCAAGCA TATACAAATC CTTTTTTCAA CTCCGCCAGA AAGGGAGGAT CGGGATTGCC AATTGCGGTG ACTTCCGATA TCCTCGAACG GATAGACCAG AGGACCGTGA AGCTGCAGAG CGCGCTATGT TATTTCAATT CGGTTGGTTC ACTGATCCAC TTTTGCTTGG TGACTATCCA CCAATCATGC GACAGCTACT TGGGGACAGA TTGCCAAGCT TCACTGAGGA TAATCGAGCC GAACTGGTAA ATTCAACCGA CTTTATTGGG CTGAACTACT ACTCGTCATT TCTTGCTTCA AAGCCCGCTT TTAAAACTGC AGACAATTCG TACTGGGCTG ACATGTATGT AGACTTCTCT GGGGATGCAA AGTGGACAAC AAATGACATG GGTTGGTATG TGGTACCAGA TGGTCTCCGA GAAATGCTTC TCTGGATCTC AAAGCGGTAC AGGAATCCAC TGCTTTTCAT AACAGAGAAT GGTACTGCGG AAAAGGATGA TAATTTGGAA CTTGTTAAAC AAGACGAGAG ACGCAGGGTT TTTTTCGAGT CGCACTTGAG AGCCTGTTAC GATGCTATTG TTCAGGGTGT TAGCCTTGGC GGGTACTTTG CGTGGAGCTT GATGGACAAC TTTGAATGGC AATTTGGATA CACTCGTCGG TTTGGGCTAT GCTCCGTTAA CTTTCAAACC ATGGAGAGGA CACCGAAAAT GTCTGGCCAA TGGTACGGTG CCACAGCTCT AGCCAATGGG GCAAACATTG ATATTGAGAA TGGAGGCAAC AAAAACTGGC AGCACAGGAG ACTCTTGCCA GCTTCCAAGT ACGGCAGACG GGTCGAAATA CCTAAAAGGG TTTTAATTGG CTACGGATCC AACATGGATA TGGTTAAGGA GGCTGTCTAT AATGGTGTGA ACATTGTCGT TTGGTCGTTT ATTTCGATAA TTCCCAACCG TGGAGGAGCA TTGCAGAAGG CTCGAGCGAG GAATTTCAAT GTTGTTGGCA GTGTGAGCAA TGCTGGAGCG GTTCTGGTGA CAAAGCTGAA CCTCACAGCT CTTACGTTGC TGATCGAAGA TCTATCACAA AATGGTTTTG GTGACGTCGT CCATTTGGCA AGTATAGGAG GTTGGAACGG TGGTCACTTA TCTCCGCTCG TTTCAGCAAG GGAGTGGTGG ATCACTTTCA GTGAAGCTGC TGGCTTTATT TTTGATGGAA TCGACTGGGA TCTGGAAGGA GACGATTTCC TATCCAGTCC AAGTAACGTG TTCGCAATCG ACTGCTTGGA CAAGATGGGG CACATCAGTC AGCTCGCAAG TGAAGGTAAG CGCTGGAGTG GTTGACTGTC TCCGCATTAA GGTTTGCGTT TGCTACACTC ATTGGCGTGC ACTCCAGAGA ACTACATAGT TACAATGGCA CCGCCTCAAT CCTACTTGGA TGTGGATGGA AACGGACGGT TCAGTCGATA CCTGAATCTG ACTGATCCGA CAAGAAGCTG GCACAACGAA TTTTCTCACT TTGGTGCCAA TGCATACGCA TATCTCCTGG CTAGATTTGG AGACTCCATT GATTTAATTT CTGTTCAGTT TTACGAGAGC TATTCGAGAA TTGCCATGGC TACGTTCAAT TCAGGCACAT CTCCAGCTAT TTCTATAGCG CAATACGTCA CACAGCTGCT TGAAATGGAT TCGAAATATT TGGTGAAATT TTCCTCAGAT CCAAATCTCG TCATAACCAA TCAGCTGGTC TCAATTCCAC TTTCAAAGTT GGTGCTGGGT TTTGCGAACG GATGGGCTCT CGAGGAGGCA AATCTGCAAA AGGTTTTCTT TGCCCCTATT GACCACGTGC AATGGGCTTG GTCGACGCTT TTGACGAAAA ACTGTACTCC AAGAGGATTT ATGTTTTGGA CAATTGATGA GGAGGGTAAA AATGGAATCA AACTAGCTGC TGGCTTGCGG AGAGTCTTGG ACATTAAGCC ATGAGTTCGG ATCTTCACAT ACGTAGGGCA TCGCTGCTGT GAGTTAGGTC TTTTACTGTT AGACGGTGTG AGTTTTCTGC AGGTATCTAC AGTTAGCTTG TTTCATTTTA AGAACTTTTT GTAAGC
|
Protein sequence | MSREVSNGIS STEHTKLLTI TAENDSATGS LYCLRKQLVV GAVTTAAVLL LCTSPSSPLS FLEWSQRNKI ESSKPVRFPE TFIWGVATSS YQIEGAIDEG GRGKTIWDNF CHQGIHISDN STGDVACDHY HRMKEDVAMM KQLNIEAYRF SIAWSRILPN GTGGVNQAGV DFYNDLIDTL VGHGIEPWVT LYHWDLPEAL QVKYGGWLDP RIVDVFAEYA QVCFLAFGDR VKNWITINEA WTVSVNGFST GIHAPGHLSS TEPYQVGHHL LLAHSKAASI YKSFFQLRQK GRIGIANCGD FRYPRTDRPE DREAAERAML FQFGWFTDPL LLGDYPPIMR QLLGDRLPSF TEDNRAELVN STDFIGLNYY SSFLASKPAF KTADNSYWAD MYVDFSGDAK WTTNDMGWYV VPDGLREMLL WISKRYRNPL LFITENGTAE KDDNLELVKQ DERRRVFFES HLRACYDAIV QGVSLGGYFA WSLMDNFEWQ FGYTRRFGLC SVNFQTMERT PKMSGQWYGA TALANGANID IENGGNKNWQ HRRLLPASKY GRRVEIPKRV LIGYGSNMDM VKEAVYNGVN IVVWSFISII PNRGGALQKA RARNFNVVGS VSNAGAVLVT KLNLTALTLL IEDLSQNGFG DVVHLASIGG WNGGHLSPLV SAREWWITFS EAAGFIFDGI DWDLEGDDFL SSPSNVFAID CLDKMGHISQ LASEENYIVT MAPPQSYLDV DGNGRFSRYL NLTDPTRSWH NEFSHFGANA YAYLLARFGD SIDLISVQFY ESYSRIAMAT FNSGTSPAIS IAQYVTQLLE MDSKYLVKFS SDPNLVITNQ LVSIPLSKLV LGFANGWALE EANLQKVFFA PIDHVQWAWS TLLTKNCTPR GFMFWTIDEE GKNGIKLAAG LRRVLDIKP
|
| |