Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_26742 |
Symbol | |
ID | 7200191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 278219 |
End bp | 281888 |
Gene Length | 3670 bp |
Protein Length | 953 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | beta-glucosidase |
Protein accession | XP_002179173 |
Protein GI | 219116757 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATCTTCGTG TCGCGCAGCC TATCCTTCCA CGCATTTTTT AGTGCCAAAC CCTGTTTCAT CAAAGATACG ATTTGCTGTA TTTACCCAGT TGATAACACT GCGAGAATGG TGGCGGAAAA CAAATCCAGG CAGATTGATT CTTCCTTCCG GAACTTGAAG AGAAAAAGTC AAATTGAAGA AATGCTTTCC TCCATGACTA TCGAAGAAAA GATAGGGCAA ATGTCCCAGA TTGATATCAA CATGATTATC GAAGACGATT CCGAAGGTAA TGGAAAGAAA CGGCTGAACG TGGCCGCCGC AGAACTTTAC CTCGGTGAAC GTGGAATCGG ATCTGTCCTC AACCTTGTAA GTGGCGCTTC CTGGACCGCG CAAGATTTTC GTCAGGTCGC CGTGCAATTG AAGGAAATCA GCGACCGCTA CAACCGTCCA CCCGTAATTT GGGGTCTAGA TAGTGTGCAC GGGGCAAATT ACATTCTTGG AGCAAGTATT CCACCGCAAC CCATCAACAT GGCCGCAACC TTTAACTTAA CAGTTCCCTA CCAGGCTGGC ATTATTGCGA GTCGTGATAC AAGGGCTGCC GGTATCACGT GGTTATTCTC GCCGCTTTTG GGTATTGCTT TGGAGCCACG TTGGAGCCGG GTCTACGAAA CCTTTGGGGA AGATCCTACG ATGGTTGGTC TCATGGCGGC CCAGATGATT GCCGGCATTC AAAAACCGGA TTCGAATCCT TTGGCCATCC CTTCCAGGGC AGCCGCTTGC GCTAAACACT TTATTGGCTA TTCCATGCCG CGCGATGGTC ATGATCGTAG TCCAAGTTGG ATTCCCACTC GCCACTTGTA CCAATATTTT GTCCCACCGT GGCAGTATGC AATGAAGCAA AACGCACTCA CCGTGATGGA ATCATACACG GAAACAGACG GTGTACCCAA CGTCGCAAAT CCTCAAGCGC TGAACTATCT ACTGCGTCAG CGTTTGGGTT TTGATGGCGT TCTTGTGACC GACTATGAGG AGATTCGCAA TGCAAATACA TGGCACCACA TCGCTGTGAA TGACACACAG GCAACTATCA AGTCGTTATT GGATGGTAGC GTGGACATGA GTATGATTCC ATGGGACGCG GATGGATTTC GCGATGGAAT TTTGGCAGGA ATTCAAGGCC ATCGGTTGTT TGAATGGCGA TTGAACCAAT CAACGGAGCG TGTCCTAAAG CTGAAGGAGA CCCTGAACAT GTATCACGAA AATCTCGCGA TAGAAGATCC CAATCTTGCA ATGATTGGTT CGGACGAACC GGCGGTCTTG GACATGGCCC AACAATCACT CATTCTAGCA GAGAATGATG GGTTGTTGCC TTTGAGTTTA AATACTCGGC ACAAGATTCT CGTGACTGGT CCAACGTCAA GCTCGTTGAT ACATCAGTCT GGGGGTTGGA CTGGTCAGTG GCAAGGGGCT TTATCTGACG ATTGGTTTGC CCATGGATCA ACAGTATTCG ATGCCTTTTC ACGGGAAGAG GCCTGGGATG TTTCCTTTAG TTGTGGTGTC AATATTTTGG GCGGTGAATG CGATGACGAA GTGAGTCGCC AGAAAACTTT TTTCATCGAA GAGATCGAAG AATGGGTCGG AAGGGGACCT TCAACGTCCA TCGAGCGTGC AGTCAAAGCG GCAGCATCGA AAGACGTTGT GCTGGTGTGC GTCGGCGAAG AAGCTTACAC CGAGAAACCT GGTGACATAC GGTCTATGGA GCTACCACAA GGACAATATG AACTGGTGAA AGCGCTCAAA GAGAACTCGG TTGTCAAGAT TGTACTTGTA TACTTTGGTG GACGTCCAAG GCTGCTAAGG AAAATGGTCG AGCAGGCGGA CGCAACCATT GTGGCTTTCC TGCCTGGACC GACAGCGGGC GAAGCGCTGA AAAATTTAGT TAGCGGGCAA ATCAATCCTA GCGGCAAAAT GCCCATTACG TATCCTAAGT ATTCGGACAA CGGAGGCATC CCCTACTTCC ATTCCGTGTC GGACAAATGC ACTGATGGGT TAGCTGCCAT GCCACATTTC GATTATGTCC CTTGTGAAGT GCAGTGGTCG TTTGGGCACG GTCTGAGCTA CACTAGCTTT CAGTATTCTG ATCTAAAGTC ATCCGCTAAA GACGGTAGTG ATTTGATCGT CTCAGTCAGA ATTAAAAACA CTGGGTCAAC GGGTGGGTCA GAAGCGGTCA TGCTTTTTAC GTTTGACGAA AATCGACCCA CTACTCCAGA ATACAAGAGA CTTCGGGCTT TTGAAAAAAT TTGGCTTGCC TCAGGAGAGG AGAGAACAGT CACACTCACA GTCTCCCCTG AGGAGCTCCA CTTTATTGGC CCCCATAATG ACAAGCACTA CATCAGTGAT CCTGCTATGA GGTTTTGGGT AGGCATGGGA GCTTCAACGG ATTGTCGCTC AAATCCCGAT TCAGATCTCT GCTCTATTGT TGAGCCCTCC GTCGGAGAAG AGGCCATGTT CAATAATTGC TGCGATGTGG CGTGTGACTT GTGGACGAGC AGCCAATGTG CCGACCAATA TGGCTTCGAT CAAGCTTCCT GCTTAGCGCT TTGTTCTTCA ATCAGTAGCT ACCCGACTAG TGCTACTACC TTGGGCAAAG ACGGCTGGGG CTGGAACTAT GTCGAATGTC TTGAATCTGT GCTGTGGGGA TTCGAGCAAG TAAATGAATT ACCACAGTGC TGGAAGATGA CCACATTGTG TCGTGATGTT TTCCAAACAA AAAACTTTGA CGAATATGGC CTTGGACCCG GAATCACACC ACAAAAGATC CTGCCAAGGG GCGTCACCTG GATGCCGAAC GCAGTGGCCG TAATAGCAGC TTTGATTTCA TCCTATATGA TGGCTCAGGC GATGCGTGGT GGATTTTGTA GACGAACGGA TGAATCCAAT GAGAGAGATG CCATTCAATT TTCGAGAGTA CAGACGGAAC AGGACTAATG TCAAAAGCTC AAAATTGTGT TTCTTGGTAC TCATGGTGGC ACTATAGAGA AAAATGTGAA AATTTCCTTT GTTGTATTGA AAAAAAGTCA CGCAGCTTGC TCCAAACGCA TATCAAATAA CCCACAGAAT CTGATGACGA AGCTATTTGG ATTTCTGTAG AAGAAAGTCA CCAATGGATG AGAGTCCTCG ATTGACGATG CGCATTTTAC TCGCAGTCAA TATTAAATAC ATTGTTCAAG TAATCTCCAC ATCGTCCTTA GCAGGATTGT GCTTAAAACG TACACTCCAT TTCAATGTCT TAAAACAGCT GGATGAATGT CAGATGCATC TTGGAATGAA CGAAGTAAAT CAAGTTCCAT AGTCTTGACG CCACAACCGT CTGAACAAAA AACAGAGTCA AATCTCGGCT TGTTGCCGCA GAATTGAGCT GCGCAACATG AGTAGGAGGG CGTGATATTT GCATTTGAAC CGTCGCAGGC TTCACCTTGT AGTTCTGCCT TGGCGTCTTC GTCACCGACT GCCCTATTTT TCAAGATTGG AGGATCTTCA GCTGGATCGA AAAAGTTGCG TCTCTGGTTT TCGAAACCTG CCGGGCTTTC TCTAGATGTG TCTCTCATGA GAAAGAGAAG CATATTGTGC CGGTCTGTAT AGCCTTCGGC ATCTTCAATT GAAACAAGCA GTCGTTCTAG CTCCTTTGTT
|
Protein sequence | MVAENKSRQI DSSFRNLKRK SQIEEMLSSM TIEEKIGQMS QIDINMIIED DSEGNGKKRL NVAAAELYLG ERGIGSVLNL VSGASWTAQD FRQVAVQLKE ISDRYNRPPV IWGLDSVHGA NYILGASIPP QPINMAATFN LTVPYQAGII ASRDTRAAGI TWLFSPLLGI ALEPRWSRVY ETFGEDPTMV GLMAAQMIAG IQKPDSNPLA IPSRAAACAK HFIGYSMPRD GHDRSPSWIP TRHLYQYFVP PWQYAMKQNA LTVMESYTET DGVPNVANPQ ALNYLLRQRL GFDGVLVTDY EEIRNANTWH HIAVNDTQAT IKSLLDGSVD MSMIPWDADG FRDGILAGIQ GHRLFEWRLN QSTERVLKLK ETLNMYHENL AIEDPNLAMI GSDEPAVLDM AQQSLILAEN DGLLPLSLNT RHKILVTGPT SSSLIHQSGG WTGQWQGALS DDWFAHGSTV FDAFSREEAW DVSFSCGVNI LGGECDDEVS RQKTFFIEEI EEWVGRGPST SIERAVKAAA SKDVVLVCVG EEAYTEKPGD IRSMELPQGQ YELVKALKEN SVVKIVLVYF GGRPRLLRKM VEQADATIVA FLPGPTAGEA LKNLVSGQIN PSGKMPITYP KYSDNGGIPY FHSVSDKCTD GLAAMPHFDY VPCEVQWSFG HGLSYTSFQY SDLKSSAKDG SDLIVSVRIK NTGSTGGSEA VMLFTFDENR PTTPEYKRLR AFEKIWLASG EERTVTLTVS PEELHFIGPH NDKHYISDPA MRFWVGMGAS TDCRSNPDSD LCSIVEPSVG EEAMFNNCCD VACDLWTSSQ CADQYGFDQA SCLALCSSIS SYPTSATTLG KDGWGWNYVE CLESVLWGFE QVNELPQCWK MTTLCRDVFQ TKNFDEYGLG PGITPQKILP RGVTWMPNAV AVIAALISSY MMAQAMRGGF CRRTDESNER DAIQFSRVQT EQD
|
| |