Gene PHATRDRAFT_50836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50836 
Symbol 
ID7199551 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp359528 
End bp361787 
Gene Length2260 bp 
Protein Length712 aa 
Translation table 
GC content49% 
IMG OID 
Productalpha subunit of glucosidase 
Protein accessionXP_002178760 
Protein GI219115930 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.312978 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAGAAT CGGAAGAGCA TCGTAAGCTT TCGGAACAAG ACTTGGACCG GGAGGGACTT 
TGGGAAGAGC ATTTTCAATC GCATCAGGAT AGCAAGCCGT TTGGGCCCAT GAGTGTTGGC
ATGGATATAG ATTTTCCCGA ATCGAATCAT TTATACGGTA TTCCTGAGCA CGCCTCATCG
GCGACTTTAA AAACGACATT TGGGGAAAAT GCTCATTTTA AGGAGCCGTA CCGGCTTTAC
AACTTGGACG TGTTTGAATA CGAATTGGAC GAGACTATGG CACTTTACGG AAATGTGCCT
CTTGTAATCA GCCAGAGCGT CAAAACTGGC ACCACGGGAG TCTTCTGGTT CAACCCTACC
GAGACCTATG TTGACATCGC GTCTTCGTCT GGATCAACGG GAACGCACTG GATCAGCGAA
AGCGGAATCA TTGACGTTTT CTTACTACCA GGACCAGATC CGTCTTCCGT TTACCACCAG
TACGCTACTT TGACGGGAAC GACACCAACA CCACCCATGT TCTCCTTGGG ATACCACCAG
TGTCGCTGGA ATTACAAGGA CGAGAAGGAC GTATACATGG TACACGGCAA GTTCGAAGAG
CTGGACTATC CCTACGATGT ACTTTGGCTG GACATAGAGC ACACAAACGG AAAACGTTAC
TTTACTTGGG ACAGCAGCTT GTTCCCAGAT CCCAAAACCA TGCAGAAAAC TCTGGCCGAT
CAAGGTCGCC GTATGGTGAC GATTGTTGAC CCGCACATTC TCCGCGATAA CAATTATTAT
ATTCATAAAG AGGCGACGGC CAAGGGACTT TACATAAAAG ATAAGCAAGG GGAGAAGGAT
TACGACGGCT GGTGCTGGCC GGGATCCTCG TCCTACCTTG ACTTTACGGA CGAAAACGTT
CGTTCTTGGT GGGCTGACCA ATTTTCATAC TCTCGCTATG AAGGATCCAC GCCAACGTTG
TTTACTTGGA ATGACATGAA TGAACCCTCA GTTTTTAATG GTCCGGAGGT GTCCATGCAA
AAGGACCTTC GAAACTTGCA CGGTGACGAA CATCGCGAGT GGCATAATCT ATACGGCATG
CTTTTCCATC GCGCTACGGG AGAAGGACAC ATTCGTCGTA GTCCGAGCGA AGATATTCGT
CCATTTGTCT TGAGTCGAGC ATTTTTTGCC GGTAGCCAAA AATACGGAGC GATATGGACT
GGCGACAATA CAGCCGATTG GGGACATCTA CAGGTCGCGG GGCCGATGTT GCTAAGTTTG
AATACGGCGG CACTCTCTTT CGTCGGCGCA GATGTTGGTG GTTTTTTCGG TAATCCAGAC
GCTGAGCTGT TTACACGGTG GATGCAGGCT GGTGCGTACC AACCATTCTT TCGGGGTCAT
GCCCATCACG ACTCCAAACG AAGAGAGCCA TGGATGTATG GCGAAGAGAC AATGCAACGG
CTTCGTCAAG CTGCATTGTG GAGGTATCAG CTGCTTCCTT TCTGGTACAC AGTTTTTCAT
GAGGCTGAGA TTTCTGGGAT GCCGGTCATG CGTATGATGT GGATGCAATA TCCCGAAACG
GAGGCAATAT TTGGGGTAGA CGATCAGTAT TTGATTGGGG CCGATCTGCT GGTGAAGCCA
ATTACGGCAG CTGGTGTTTC GGAGGTAGAA GTGCTCTTTC CCACAGACCA TCTTTGGTAT
GATGTTAAAT CCCTGCAAAT CGTTTGCTCT ACAATTGCTT CCATGAGCGT CGAGTCCCGG
ACTATATCGT CAGCAATAGA TGAGATTCCA GTTTTTCAGC GCGGAGGCTC AATCATACCA
CGTAAGCTGC GACTCCGTCG GAGTACCACG ACGATGAAGA CTGATCCCTA CACACTTTTT
GTTGCTTTGG ATGATTCCTA CCAAGCATCG GGTACGCTGT ACATGGATGA CGAAGAAACG
CTGGGCTACA GCAAACGTGC TGAGTGCGCT GATGCGAGCA TTAGCGCTGA TTTGAGTGAA
GGCGCTGGTA CTGTTAGTGC TCATGTGAAC CTCGGTTCCG GATGGTCCAG CGCAGTCCAT
TTCTTAGCTG AGGATAGAAT GATTGAACGT ATTGTGATCA TGGGGTTTCC GAGCACACCA
AAAGCTGTGA GCGTCAATGA AGAGTCGCTT GACTTTAGCT TTAACAAGTC TTCGAACGTT
TTGGTTGTTC GCAAGCCTGA AGTCTCGGCC CTCAGGGACT GGAGCATCAA GATTGTGTGA
AATAAAAACA ACGAATAAAC GTAAAACTCG TTAAGCGTCC
 
Protein sequence
MIESEEHRKL SEQDLDREGL WEEHFQSHQD SKPFGPMSVG MDIDFPESNH LYGIPEHASS 
ATLKTTFGEN AHFKEPYRLY NLDVFEYELD ETMALYGNSV KTGTTGVFWF NPTETYVDIA
SSSGSTGTHW ISESGIIDVF LLPGPDPSSV YHQYATLTGT TPTPPMFSLG YHQCRWNYKD
EKDVYMVHGK FEELDYPYDV LWLDIEHTNG KRYFTWDSSL FPDPKTMQKT LADQGRRMVT
IVDPHILRDN NYYIHKEATA KGLYIKDKQG EKDYDGWCWP GSSSYLDFTD ENVRSWWADQ
FSYSRYEGST PTLFTWNDMN EPSVFNGPEV SMQKDLRNLH GDEHREWHNL YGMLFHRATG
EGHIRRSPSE DIRPFVLSRA FFAGSQKYGA IWTGDNTADW GHLQVAGPML LSLNTAALSF
VGADVGGFFG NPDAELFTRW MQAGAYQPFF RGHAHHDSKR REPWMYGEET MQRLRQAALW
RYQLLPFWYT VFHEAEISGM PVMRMMWMQY PETEAIFGVD DQYLIGADLL VKPITAAGVS
EVEVLFPTDH LWYDVKSLQI VCSTIASMSV ESRTISSAID EIPVFQRGGS IIPRKLRLRR
STTTMKTDPY TLFVALDDSY QASGTLYMDD EETLGYSKRA ECADASISAD LSEGAAEDRM
IERIVIMGFP STPKAVSVNE ESLDFSFNKS SNVLVVRKPE VSALRDWSIK IV