Gene PHATRDRAFT_49793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49793 
Symbol 
ID7198460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011692 
Strand
Start bp320447 
End bp323259 
Gene Length2813 bp 
Protein Length874 aa 
Translation table 
GC content52% 
IMG OID 
Productbeta-glucosidase 
Protein accessionXP_002184524 
Protein GI219128657 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAATA CGAACAAGGC GGAATCACAC CAAACACTCC GTACCGTTAC CGACGATGAT 
TCCGTGACCA TGACTGCCTC GCCGGATGTC CAGTACCACC AAACCAACTC CACGGATCCG
GACGTCGAAT GCCCCCCGTC TACTCCAACA TACGTCTCGG TGCATGACTT GACTTTGGAG
GAAAAACTCT TGCTGCTTTC CGGTCAAACC TTGTGGTTGT TGCCGGATTT GCCCCGCTTC
AATTTACCTT CTCTCACTGT CGCCGACGGT CCCCACGGCG TGCGCAAGCC CGTCAAAGAG
CTGTCGCTAC AGGAAGCGCT ACCCGCCACG TGCTTTCCCA CCGCCGCCGC TCTCTCCTGC
AGTTGGGACG TGGATTTCTT GCGCCAAGTG GGTATCGCAC TCGGCAACGA ATGCGCGCAC
TACCAGGTCG CCGTCCTCCT CGGCCCGGGC ATGAACCTCA AGCGCCACGG CGGCGGCGGG
CGCAATTTCG AATACTTTTC CGAAGACCCC CTCGTTTCGG CCAAACTCGC CACGGCCTAT
GTACAAGGCG TCCAAGCCAA CGGACGCGTC GGCGCCTGCA TTAAACACTT TGCCGTCAAC
AATCAGGAAT CACACCGATT CGTCGTGGAC GCTGTCGTCG ACGAACGCAC CGCACGCGAA
CTCTACTACC GCGGTTTCGA AGCCGTGGTC CGCGACGCGC AGCCCGCCAC TATCATGTGC
GCCTATAACA AAATTAACGG CGTGTACTGT AGCGAAAACG AATTCTTGAA CACGCAGTTG
CTCCGCGACG AGTGGGGCTT CCAGGGCGTG GTGATAACGG ACTGGGGTGC TACCAACGAT
CGGCCGGCCG CCATTGCCGC CGGCATGGAT TTGGAAATGC CCGGATCGCA CGGAGCCCAC
GGCAGGGAAA TACGGCGCGC CCTTCGGGAA GGGACGGTTT TGCGCATGGA ACACGTCGAC
GCCTGCGCCC AACGTATGCT AAATCTCATG TGCCGGTACA AAGAATCGGT TCGGGATACC
TACGAATTGT CGAGCTGGCA TGATCAGCAT AAGTTGGCCA AGCAAGTGGC TATGCAGTGC
GCCGTCTTGT TGCAGAACCA GGGCAATTTG TTGCCGCTGA AACAAGGAAC GTCGGTAGCC
GTTATTGGAG ACTTTGCCAA GGAACATCCC CGCTACCAAG GCATGGGGAG TTCGCAGGTC
TGCACTAATT CTGTCGTAAC GGCGTACGAT GAGTTGTTTC GTCATACGAA GGACGTTTTC
TTTGCACCCG GATACCATGC CGATGACGAC CATATTGAAG CCGTGAACGA GGAATTATTG
GCCGAAGCGG TGAGGGTGGC GCAGCAAGCC GAAGTCGTCT TGTTGTGCCT GGGACTTCCG
GAAATTATGG AGTCCGAGGG TTTTGACCGT TTACACTTGA ATATCCCTGC ACAACACAAT
GCTCTAGTGG ACGCGGTTAG CAAAGTGAAC AGTAACGTGA TTGTGATGTT GAGCAATGGC
GGAGCGATTG AGATCCCGTG GGCTGACAAG GTCAAAGCTA TTTTTGAAGG CTACCTCTTG
GGTGAAACCG GTGGAGCCGC CACGGTAGAT TTGATTTTCG GTGTGCAATC GCCCTGCGGC
AAACTCGCCG AAACCTTTCC AATTGTCCAA GAGGACATAC TCGCAGACCG GTACTTTCCG
GGCAGTCGCG ATCGTGTGGA GTATCGAGAA GGTTTGGATG TCGGCTACCG TTACTTTGAC
ACCGCCCAAA AAGACGTTCG TTTTCCCTTT GGGCACGGAT TGACGTACAC GACCTTTGAA
TACGGCAATC TTAATGTGCA AGTCAATCGC GACGATGCTA CATCCAAATC TGTGCATGTC
TCGTTCGACT TAACCAATAC TGGCGCAGTA GCTGCCAAGG AGGTTGTGCA ATGCTATATT
CATCAAGACT CGCCGTCGGT TTATCGACCT GTTCACGAGC TCAAGTATTT TTGCAAAATA
CATTTGGAGC CTCAGCAAAG TAAACAGGTA GAATTTGATC TCCTTACCGA TGCGTTTTCG
TTTTACGACA TTGGAGTCTC AGATTGGACG GTAGAAGCTG GTGGTTTTGA AATACGCATT
GCGTCGAGCA GTCGAGACAT TCGCTTGGAA GCGCCTGTAG TATTTGCGGA AGGGCGTGGG
CCGAGTGATC TGGCGAAGGA AACCTATCCT CCCGTTGCTG GAGGTGGCAC ACTCAGTCAA
GTGGACGACG AAACATTCGC TAAGCGGTTT GCGAAGAGAA AAGAATTTGT ATTGGCAGAA
TGCGTGGCGT CTGCCGAATC TAGTACGGTC TCAAGAGTTG GCGGTTTTCA TCGGAATTCG
CTACTTAAGG AAGTAGCAAG TCGCAGACTT ATGGGCAAGC TCTTACTGTC AGTTGTTCTA
TCCGCGGCAG CAAAAGAGGT CAAGAAAGGA CCTACCCGAA AAAGGCAAAA ACGTATGGTT
CGAGCCAATG TGGAAAATCT CCCGCTTCGA ACGCTGGTGC TATTCAGTAA AGGTGTACTG
AGTTTTGAAC TGCTGGATGC CTGTATTGCA GCCATGAACT ACCAGGTATT TCGTGCCATC
GGAGGCTTTG GTTTGGCTTT TGCCTGTTTA TTTAAACGAA ACTAAATTGT ACAAACCGTG
CTCTTGACAG TGAGGTTTTT CGTCAAGTAA ACTCTTCTGC CTATTATTGG TTCCGCTGTC
ACTGTGAAGA CGCTTCCATC AGGCTCTGTC TCCGTTAGAG CAAACGAATT CCAAAAAAAG
TATGGTCACT TCCAAAATCC CTGAAATAGC AGATAAGGAC GGATATGTCA TTT
 
Protein sequence
MSNTNKAESH QTLRTVTDDD SVTMTASPDV QYHQTNSTDP DVECPPSTPT YVSVHDLTLE 
EKLLLLSGQT LWLLPDLPRF NLPSLTVADG PHGVRKPVKE LSLQEALPAT CFPTAAALSC
SWDVDFLRQV GIALGNECAH YQVAVLLGPG MNLKRHGGGG RNFEYFSEDP LVSAKLATAY
VQGVQANGRV GACIKHFAVN NQESHRFVVD AVVDERTARE LYYRGFEAVV RDAQPATIMC
AYNKINGVYC SENEFLNTQL LRDEWGFQGV VITDWGATND RPAAIAAGMD LEMPGSHGAH
GREIRRALRE GTVLRMEHVD ACAQRMLNLM CRYKESVRDT YELSSWHDQH KLAKQVAMQC
AVLLQNQGNL LPLKQGTSVA VIGDFAKEHP RYQGMGSSQV CTNSVVTAYD ELFRHTKDVF
FAPGYHADDD HIEAVNEELL AEAVRVAQQA EVVLLCLGLP EIMESEGFDR LHLNIPAQHN
ALVDAVSKVN SNVIVMLSNG GAIEIPWADK VKAIFEGYLL GETGGAATVD LIFGVQSPCG
KLAETFPIVQ EDILADRYFP GSRDRVEYRE GLDVGYRYFD TAQKDVRFPF GHGLTYTTFE
YGNLNVQVNR DDATSKSVHV SFDLTNTGAV AAKEVVQCYI HQDSPSVYRP VHELKYFCKI
HLEPQQSKQV EFDLLTDAFS FYDIGVSDWT VEAGGFEIRI ASSSRDIRLE APVVFAEGRG
PSDLAKETYP PVAGGGTLSQ VDDETFAKRF AKRKEFVLAE CVASAESSTV SRVGGFHRNS
LLKEVASRRL MGKLLLSVVL SAAAKEVKKG PTRKRQKRMV RANVENLPLR TLVLFSKGVL
SFELLDACIA AMNYQVFRAI GGFGLAFACL FKRN