Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46976 |
Symbol | |
ID | 7202083 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 55129 |
End bp | 59033 |
Gene Length | 3905 bp |
Protein Length | 918 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | endo-1,3-beta-glucosidase |
Protein accession | XP_002181294 |
Protein GI | 219121898 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGACAC CGGACTCTAA AAGCGGCGCT GCAAACCAAC GGCCTCAATA CCACTACTTC TACGGCAGCA TTGACGACCA ATCGCAAGTC AACCGCCGGG AAAACGCAGT GTCCGTCCCA ACTGATCAAG ACTCGTTGGT AGGCGAATCT TTGCTTCCAA CCGACAATGA GCAAGACGAT GATCGATCAA AAATTGCAAA GCAGGACTTT CAGCAAATTG TCATGTCCAA TGTCAATGGC GTACGGCCCA TCGACTCGAG TCGAAGTGAA CGGATGGAGC CCAAATCCTG CTTGCGATTC TTCTCGATAA ACAATTTACG CCGTCAGCAT CCTATGGGTT ATCTTCTTTT GTCTGTGGGT TCCCTAACAT TGTTGGCTTT CTTGTGCTCG GTTATTTTCT TTTCAAAAGG CCAGTCGGGG AGAGTGATAA GAACACCCAA GGAAGAGTTG AGCTTTCGCA TGCCGTTTCC TTGGGTGGAT CGGGCACAGT ACGGCGACCC AGTCTCTAAA GTTCTGGACA AAGACCTTTT TCATCCGAGT CTCGTATACA AAGGAAAGGA TCCCTCCCGA GTTTTTATCT TTCCCTTTCC AACCGGCGCA TTTTGGACGA ATTTAGTGTT GCCTACGACC GCGGATCGCG GCTTGTCGTA TCCAATTGTT GTCTACCCGT ACGCGTATAA ATGGTCCAAG GAGCTGTTAC AAGTGTCTTA TCCAGCAAGT CATCGACGAG AATTACCTAA GGAGATTCAC GATGACTTTT TCCCTGACCT TACTTTCGCC AGTGTTGAAT CTACCAGTCA GCGCCATATC ACGGCCTTTG ATCCGTTGAG TGTGACAGTG CAATACTCTA CTATGAATGG AGGTACTTGG GAATCATATC TTGTTCAAGG TAGTCCGTAT GTTACCCTCA GATATGATAA TGTGACACCA AAAATTCGGG CACTTTCCAT TTTCAAGAGT ACGTTTTGTC CGCGAGTCGA GGCGTTAAGG AATGATACAG ATCGCTATTT AAGTTATGGA GTGTGTTACA GCATTGAAAA TACGGATCCT TCAAAAGAGA GCCAAATGTC GCTCCATGGA GTTCAGTTTA TACTGGAAAC ACAGGAGGGC ATGAATTGGA TTGTCTTTGC TTCCGAGCCC GTTACTTTGC AATTCGATAA AATCCAAAAG ACCACAATTA CAGCATCGAG CGCATTCAAA GGTGTTATCC GGCTCGCCTA TATTCCTCCG TCAGTTTCCA AACCATCCAA CAATGTAATT GATGTAACAA AGTCAACTGG TCTGCAGCGT TTGATATATC ATGCTGGAGT ATACCCTATT TCGGGGGAAG TGAGCTGGTC GTTTCGGACG TCGGCGCATA ATTCTGCATT GTCAGCTGCC ACCAAGTCGT TAAATTTCTT GTCGGGAGCA AAAGGAACAA ACGCCGCTTC GGAGTCAGCT GTGAAATCCG TTCTGTCGTC TTCTTCCTCT CCTGGTTGCA TTGGTACAAT ACAATTTGAC TTTGGAGTGA AAGCCTTTGC TCCCGCAACC AGCGCAAGCG CAGCCGATTC GTTACTGATG CTAGCTCTGC CTCATCATGC GCAGAGTCTT CTTTCAAGCG TGCAACTTCC TGATGAAACG TTTGATCTTG CTTACAAGTG TATTAAAGGC ACTATGACTC CTATTCTCGG ATCGTCTTGG GTATACGATG AAAACCTGCC ATCCCTGGGC TTCGATGGAG ACACGGGATC GAATAGCAAC AAGGCGTACC TTGATCTCAA CATTCGATCC ACCCTCATTG AAAGTCTCGA AAAAGATATG AACCTGGCCT TACCAACTCT TACCGAGAAT ATTTACGGAT TCGGCAAGCA AATCGCTCGT CTGTCACAGC TCGCGCACAT TGCGGACGTG CTCCGGACTG GTGGCGTTGA AGTCGATGAA AATACAAATG TTATGAACAA TAGCAAGCTC TTTGAATCGC AAGAGAAAAG GCTAGATTTG ATCTTCAACG GGACGCTTTC CCTTTTGACA AGTCGCCTAC AGCAGTTTTT GACGAGCAAC ATTTCTGACT CACTAGTCTA CGATACAAAT TTTGGCGGAA TGGTGAGTGT CGACGGCCTT CGAGATTCAA ACAGGGACTT CGGAAACGGC AGATACAACG ATCACCATTT TCATTACGGC TACATATTGT ATGCGTGTGC AGTGTTGGGA AGACTAGATC GTTCGTTCAT TTTGAAATAC GGAGACAAGG TCGATGCGTA AGTCCAAACA TCCAAATACT AGGACAACTT TTCTTCGCAA ACCTTATCCG CTTTTTGTTC CGTTCAGTAT CTTTTATGAT ATCGCACATG ATTCTAATTC GGCGTCACAG ACGGGCAACG GGGCCTTCTT CCCTCTAGCG CGGCATAAGT CATGGTTTGA CGGTCACTCC TTTGCGTCTG GTCTCTTTCC GTTTGGAAAT GGAAAGTCGC AAGAGAGTTC GAGTGAGGCG GTCAATGGTT ACTACGGAGC ATATCTTTGG TCGCTAGTTC GGCATAAAGA AGCGGAAACT CCAAATGAAA GTACGTCGGA GGGGACAGAC TTCGCCAGAT TGCTTCTGGC TACAGAAATA CGTGGAGCAC AAACATACTG GCAGATGATA CCTTTAAAGT CAACAAATCA AACAGAGGTG ACCGGTGTCT ATAGTAATTC GTTTGCTAAA AATTACATGG TCGGAAACCT TGGAATGCTT GACGTCATAT GCTCAACTTG GTTCGGTACG TCACAGCTCT ACGTGCACAT GATCAATTTC ATCCCACTTA CGGCGGCTAC TGGAGAGCTA TTCAGTCGAA AATATGTCAC TGAGGAGTAC ACAAACATAC TCAATGGTCT TGGGGAAGTT GAGATGGCTT GGAGGGGCTT TGTTGTTGGG GATCACGCAA TCATGGACCC TCAGGCGGCC TGGGAGGAAG CGCAAGGAGT CTTCAGCGCT GAACTCGACG CCGGACTCAG CAAAAGCCAA TTGCTGTTCT GGATAGCAAA TCGTAAAGGA TTTTCGCCGT CTATGATTAC ATTATCAGCG GTTGAGAAAA AATCTGAAGC ATCAACTATT GTGCATGATC GTTTTATAAA GTATTCCTCT TGCCTGAACT ATGATTCATG CCGCAATGGT GGACTTAGTG GTAATTGTTG TCCAACAGAC GATGGGTTGA TTCTCGATTG CTGTAATATA TGATAACTTT ACAACGAAGG AGTGGGAGCC AACAGAAGTT TAGATCTCGG AAAAGTGATG AAAGGGATTA ACAGTTAGAT TAACTTCTTA TCAATGTAGA TTCTATTGGA ACCGCTATCT GCACATAATT CGAGAAAAAA GTCTTTTTCT GCATGTCTGG GGAGACTTTT CAAAGTTCGA CGATGTTTGA GAGATCAGCC CTGAAGCTTG CCTGGCGAGG ATTCAAGATA GTTACCGGCA GCATCAGGGA GGAAGGGTCC GGGCTTCATG GGAGATTCTT CTGAAGACGA ACATTGAAGC CCAACTGAAG GGTTACTATT GTTGAGCACC GGTGCATCAG CAAGCACATC GCCTACTTCG GTGAATCCAG TAGATCGGTC AACTTTGATT GCTGGAGAAA ATGATTCTAC GTCTATTTCT CCTTCGTCTT CTTCAGTATT TTTATTTTTC TGGACGCTGC CGTCTCCACA AAGCGTAGCA CTCCACGCAA TTCCATCGGC CCTGGCTTGT TGCAGCTCTT TCACAGACTC AGAACTCCAT GTAAACCCTG ATTCTGAGGA TTTCTCTCTT ATTTTGTTGA GTAAATTTGA CGTTAATCGA ATCCCAACAC TTACATTGTA CGCATACACG GCAATGATTT TCCACTGACG AGTCCACCCG AAATCATCTC GTCGATGATC GCGTCGTTGT CTGCAAAGAC TGACGACAAA CCAAC
|
Protein sequence | METPDSKSGA ANQRPQYHYF YGSIDDQSQV NRRENAVSVP TDQDSLVGES LLPTDNEQDD DRSKIAKQDF QQIVMSNVNG VRPIDSSRSE RMEPKSCLRF FSINNLRRQH PMGYLLLSVG SLTLLAFLCS VIFFSKGQSG RVIRTPKEEL SFRMPFPWVD RAQYGDPVSK VLDKDLFHPS LVYKGKDPSR VFIFPFPTGA FWTNLVLPTT ADRGLSYPIV VYPYAYKWSK ELLQVSYPAS HRRELPKEIH DDFFPDLTFA SVESTSQRHI TAFDPLSVTV QYSTMNGGTW ESYLVQGSPY VTLRYDNVTP KIRALSIFKT SSAFKGVIRL AYIPPSVSKP SNNVIDVTKS TGLQRLIYHA GVYPISGEVS WSFRTSAHNS ALSAATKSLN FLSGAKGTNA ASESAVKSVL SSSSSPGCIG TIQFDFGVKA FAPATSASAA DSLLMLALPH HAQSLLSSVQ LPDETFDLAY KCIKGTMTPI LGSSWVYDEN LPSLGFDGDT GSNSNKAYLD LNIRSTLIES LEKDMNLALP TLTENIYGFG KQIARLSQLA HIADVLRTGG VEVDENTNVM NNSKLFESQE KRLDLIFNGT LSLLTSRLQQ FLTSNISDSL VYDTNFGGMV SVDGLRDSNR DFGNGRYNDH HFHYGYILYA CAVLGRLDRS FILKYGDKVD ATTFLRKPYP LFVPFSIFYD IAHDSNSASQ TGNGAFFPLA RHKSWFDGHS FASGLFPFGN GKSQESSSEA VNGYYGAYLW SLVRHKEAET PNEIIRLLKI TWSETLECLT SYAQLGSLFS RKYVTEEYTN ILNGLGEVEM AWRGFVVGDH AIMDPQAAWE EAQGVFSAEL DAGLSKSQLL FWIANRKGFS PSMITLSAVE KKSEASTIVH DRFIKYSSCL NYDSCRNGGL SGNCCPTDDG LILDCCNI
|
| |