Gene PHATRDRAFT_26742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_26742 
Symbol 
ID7200191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp278219 
End bp281888 
Gene Length3670 bp 
Protein Length953 aa 
Translation table 
GC content48% 
IMG OID 
Productbeta-glucosidase 
Protein accessionXP_002179173 
Protein GI219116757 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATCTTCGTG TCGCGCAGCC TATCCTTCCA CGCATTTTTT AGTGCCAAAC CCTGTTTCAT 
CAAAGATACG ATTTGCTGTA TTTACCCAGT TGATAACACT GCGAGAATGG TGGCGGAAAA
CAAATCCAGG CAGATTGATT CTTCCTTCCG GAACTTGAAG AGAAAAAGTC AAATTGAAGA
AATGCTTTCC TCCATGACTA TCGAAGAAAA GATAGGGCAA ATGTCCCAGA TTGATATCAA
CATGATTATC GAAGACGATT CCGAAGGTAA TGGAAAGAAA CGGCTGAACG TGGCCGCCGC
AGAACTTTAC CTCGGTGAAC GTGGAATCGG ATCTGTCCTC AACCTTGTAA GTGGCGCTTC
CTGGACCGCG CAAGATTTTC GTCAGGTCGC CGTGCAATTG AAGGAAATCA GCGACCGCTA
CAACCGTCCA CCCGTAATTT GGGGTCTAGA TAGTGTGCAC GGGGCAAATT ACATTCTTGG
AGCAAGTATT CCACCGCAAC CCATCAACAT GGCCGCAACC TTTAACTTAA CAGTTCCCTA
CCAGGCTGGC ATTATTGCGA GTCGTGATAC AAGGGCTGCC GGTATCACGT GGTTATTCTC
GCCGCTTTTG GGTATTGCTT TGGAGCCACG TTGGAGCCGG GTCTACGAAA CCTTTGGGGA
AGATCCTACG ATGGTTGGTC TCATGGCGGC CCAGATGATT GCCGGCATTC AAAAACCGGA
TTCGAATCCT TTGGCCATCC CTTCCAGGGC AGCCGCTTGC GCTAAACACT TTATTGGCTA
TTCCATGCCG CGCGATGGTC ATGATCGTAG TCCAAGTTGG ATTCCCACTC GCCACTTGTA
CCAATATTTT GTCCCACCGT GGCAGTATGC AATGAAGCAA AACGCACTCA CCGTGATGGA
ATCATACACG GAAACAGACG GTGTACCCAA CGTCGCAAAT CCTCAAGCGC TGAACTATCT
ACTGCGTCAG CGTTTGGGTT TTGATGGCGT TCTTGTGACC GACTATGAGG AGATTCGCAA
TGCAAATACA TGGCACCACA TCGCTGTGAA TGACACACAG GCAACTATCA AGTCGTTATT
GGATGGTAGC GTGGACATGA GTATGATTCC ATGGGACGCG GATGGATTTC GCGATGGAAT
TTTGGCAGGA ATTCAAGGCC ATCGGTTGTT TGAATGGCGA TTGAACCAAT CAACGGAGCG
TGTCCTAAAG CTGAAGGAGA CCCTGAACAT GTATCACGAA AATCTCGCGA TAGAAGATCC
CAATCTTGCA ATGATTGGTT CGGACGAACC GGCGGTCTTG GACATGGCCC AACAATCACT
CATTCTAGCA GAGAATGATG GGTTGTTGCC TTTGAGTTTA AATACTCGGC ACAAGATTCT
CGTGACTGGT CCAACGTCAA GCTCGTTGAT ACATCAGTCT GGGGGTTGGA CTGGTCAGTG
GCAAGGGGCT TTATCTGACG ATTGGTTTGC CCATGGATCA ACAGTATTCG ATGCCTTTTC
ACGGGAAGAG GCCTGGGATG TTTCCTTTAG TTGTGGTGTC AATATTTTGG GCGGTGAATG
CGATGACGAA GTGAGTCGCC AGAAAACTTT TTTCATCGAA GAGATCGAAG AATGGGTCGG
AAGGGGACCT TCAACGTCCA TCGAGCGTGC AGTCAAAGCG GCAGCATCGA AAGACGTTGT
GCTGGTGTGC GTCGGCGAAG AAGCTTACAC CGAGAAACCT GGTGACATAC GGTCTATGGA
GCTACCACAA GGACAATATG AACTGGTGAA AGCGCTCAAA GAGAACTCGG TTGTCAAGAT
TGTACTTGTA TACTTTGGTG GACGTCCAAG GCTGCTAAGG AAAATGGTCG AGCAGGCGGA
CGCAACCATT GTGGCTTTCC TGCCTGGACC GACAGCGGGC GAAGCGCTGA AAAATTTAGT
TAGCGGGCAA ATCAATCCTA GCGGCAAAAT GCCCATTACG TATCCTAAGT ATTCGGACAA
CGGAGGCATC CCCTACTTCC ATTCCGTGTC GGACAAATGC ACTGATGGGT TAGCTGCCAT
GCCACATTTC GATTATGTCC CTTGTGAAGT GCAGTGGTCG TTTGGGCACG GTCTGAGCTA
CACTAGCTTT CAGTATTCTG ATCTAAAGTC ATCCGCTAAA GACGGTAGTG ATTTGATCGT
CTCAGTCAGA ATTAAAAACA CTGGGTCAAC GGGTGGGTCA GAAGCGGTCA TGCTTTTTAC
GTTTGACGAA AATCGACCCA CTACTCCAGA ATACAAGAGA CTTCGGGCTT TTGAAAAAAT
TTGGCTTGCC TCAGGAGAGG AGAGAACAGT CACACTCACA GTCTCCCCTG AGGAGCTCCA
CTTTATTGGC CCCCATAATG ACAAGCACTA CATCAGTGAT CCTGCTATGA GGTTTTGGGT
AGGCATGGGA GCTTCAACGG ATTGTCGCTC AAATCCCGAT TCAGATCTCT GCTCTATTGT
TGAGCCCTCC GTCGGAGAAG AGGCCATGTT CAATAATTGC TGCGATGTGG CGTGTGACTT
GTGGACGAGC AGCCAATGTG CCGACCAATA TGGCTTCGAT CAAGCTTCCT GCTTAGCGCT
TTGTTCTTCA ATCAGTAGCT ACCCGACTAG TGCTACTACC TTGGGCAAAG ACGGCTGGGG
CTGGAACTAT GTCGAATGTC TTGAATCTGT GCTGTGGGGA TTCGAGCAAG TAAATGAATT
ACCACAGTGC TGGAAGATGA CCACATTGTG TCGTGATGTT TTCCAAACAA AAAACTTTGA
CGAATATGGC CTTGGACCCG GAATCACACC ACAAAAGATC CTGCCAAGGG GCGTCACCTG
GATGCCGAAC GCAGTGGCCG TAATAGCAGC TTTGATTTCA TCCTATATGA TGGCTCAGGC
GATGCGTGGT GGATTTTGTA GACGAACGGA TGAATCCAAT GAGAGAGATG CCATTCAATT
TTCGAGAGTA CAGACGGAAC AGGACTAATG TCAAAAGCTC AAAATTGTGT TTCTTGGTAC
TCATGGTGGC ACTATAGAGA AAAATGTGAA AATTTCCTTT GTTGTATTGA AAAAAAGTCA
CGCAGCTTGC TCCAAACGCA TATCAAATAA CCCACAGAAT CTGATGACGA AGCTATTTGG
ATTTCTGTAG AAGAAAGTCA CCAATGGATG AGAGTCCTCG ATTGACGATG CGCATTTTAC
TCGCAGTCAA TATTAAATAC ATTGTTCAAG TAATCTCCAC ATCGTCCTTA GCAGGATTGT
GCTTAAAACG TACACTCCAT TTCAATGTCT TAAAACAGCT GGATGAATGT CAGATGCATC
TTGGAATGAA CGAAGTAAAT CAAGTTCCAT AGTCTTGACG CCACAACCGT CTGAACAAAA
AACAGAGTCA AATCTCGGCT TGTTGCCGCA GAATTGAGCT GCGCAACATG AGTAGGAGGG
CGTGATATTT GCATTTGAAC CGTCGCAGGC TTCACCTTGT AGTTCTGCCT TGGCGTCTTC
GTCACCGACT GCCCTATTTT TCAAGATTGG AGGATCTTCA GCTGGATCGA AAAAGTTGCG
TCTCTGGTTT TCGAAACCTG CCGGGCTTTC TCTAGATGTG TCTCTCATGA GAAAGAGAAG
CATATTGTGC CGGTCTGTAT AGCCTTCGGC ATCTTCAATT GAAACAAGCA GTCGTTCTAG
CTCCTTTGTT
 
Protein sequence
MVAENKSRQI DSSFRNLKRK SQIEEMLSSM TIEEKIGQMS QIDINMIIED DSEGNGKKRL 
NVAAAELYLG ERGIGSVLNL VSGASWTAQD FRQVAVQLKE ISDRYNRPPV IWGLDSVHGA
NYILGASIPP QPINMAATFN LTVPYQAGII ASRDTRAAGI TWLFSPLLGI ALEPRWSRVY
ETFGEDPTMV GLMAAQMIAG IQKPDSNPLA IPSRAAACAK HFIGYSMPRD GHDRSPSWIP
TRHLYQYFVP PWQYAMKQNA LTVMESYTET DGVPNVANPQ ALNYLLRQRL GFDGVLVTDY
EEIRNANTWH HIAVNDTQAT IKSLLDGSVD MSMIPWDADG FRDGILAGIQ GHRLFEWRLN
QSTERVLKLK ETLNMYHENL AIEDPNLAMI GSDEPAVLDM AQQSLILAEN DGLLPLSLNT
RHKILVTGPT SSSLIHQSGG WTGQWQGALS DDWFAHGSTV FDAFSREEAW DVSFSCGVNI
LGGECDDEVS RQKTFFIEEI EEWVGRGPST SIERAVKAAA SKDVVLVCVG EEAYTEKPGD
IRSMELPQGQ YELVKALKEN SVVKIVLVYF GGRPRLLRKM VEQADATIVA FLPGPTAGEA
LKNLVSGQIN PSGKMPITYP KYSDNGGIPY FHSVSDKCTD GLAAMPHFDY VPCEVQWSFG
HGLSYTSFQY SDLKSSAKDG SDLIVSVRIK NTGSTGGSEA VMLFTFDENR PTTPEYKRLR
AFEKIWLASG EERTVTLTVS PEELHFIGPH NDKHYISDPA MRFWVGMGAS TDCRSNPDSD
LCSIVEPSVG EEAMFNNCCD VACDLWTSSQ CADQYGFDQA SCLALCSSIS SYPTSATTLG
KDGWGWNYVE CLESVLWGFE QVNELPQCWK MTTLCRDVFQ TKNFDEYGLG PGITPQKILP
RGVTWMPNAV AVIAALISSY MMAQAMRGGF CRRTDESNER DAIQFSRVQT EQD