Gene PHATRDRAFT_35668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_35668 
Symbol 
ID7201098 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp481169 
End bp483249 
Gene Length2081 bp 
Protein Length682 aa 
Translation table 
GC content50% 
IMG OID 
Productbeta-xylosidase 
Protein accessionXP_002180246 
Protein GI219118959 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.12421 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTCTT GCTTCAGGAT ATTGACTATA TGCACGTCGT ATTTCTTCTT ACTGCTTGCG 
CTCTGGGCAT CGTACTCCAT TTTTGAAGGC TCTGTTCGGC TTCAAGGTCC AATCGGGCTC
GACCGCAACG TCAGCCCCGT CAACAACTAT ATTTGGAAAG ACAAAGTCCC GAATTTTTGG
GGATGCCAGA ATGACGTAGC CAAAGCACTG CCTTATTGCG ACATGTCGCT CTCAATTGAC
GAGAGACTGG AGGATTTGCT CTCACACTTG ACCCTGGATG AGAAAGTTGA CATGATCGGC
GCCGACCCTA CGCAAGACGT TTGTATGACT CATACTATGA ATGTGTCTCG CATAGGCTTA
CCAGACTACT ACTGGCTCGT GGAAACAAAT ACGGCCGTTG GATCGGCATG TATTGCCGAA
AACAAATGTG CGACTGAGTT TTCGGGCCCG TTATCGATCG CCGCTTCTTT CAATCGATCA
TCCTGGTTTC TCAAAGGTAG CGTTTTTGGC ACCGAGCAAA GGGCGCTGAT GAATGTCCAT
GGCGAACGAT TTCATACCCA TAGCGGCCGA CATATTGGTC TGACAGCGTT CGGCCCAAAT
ATCAATCAAC AACGCGATCC GAGGTTCGGG CGCTCATCGG AGTTGCCGGG GGAAGACCCG
TTTCTGTCGG GGCAGTACGC CGCGCACATG GTACAGGGTA TGCAAGAGCG AGATGCCAAC
GGATATCCTA AAGTTTTGGC GTATCTGAAG CATTTTACGG CGTACAGCCG AGAGGAAGGG
CGCGGGAACG ACGACTACAA TATTTCGATG TACGATCTGT TTGATACATA TTTGCCCCAG
TACGAAATGG GCATGGTCCA AGGCGGAGCC ACCGGAGTTA TGTGCTCGTA CAATGCTGTC
AATGGTATTC CCGCGTGTGC CAATGACTAT TTACTCAATA AAATTTTGCG GCAACGCTGG
AATCGTTCCG ATGCGCACGT GACGACCGAC TGTGGGGCGG TGAACAATCT GCGTGGCAAA
CCAATCCAGG CGGCCGATGA AGCGCAAGCT GCCGCAATGG CACTCATGAA TGGCGCGGAT
ATTGAGATGG GATCAACCTT ATTTGTACAC AATCTCACTA CTGCTATAAC ACTGGGATAT
GCGACCGAAG AAGCAGTCAA TCAAGCTATT CGTCGTTCAT ATCGTCCTCA TTTTATTGCG
GGTCGCTTCG ATGATCCTAC CTTGAGCGAA TGGTTCAGTC TAGGGCTAGA CGACATTCAG
TCAAAAAAGC ACCAGGAGAT CCAATTGGAA GCAGCACTTC AAGGACTAGT TTTGCTGAAA
CATGAGGACA GCATTTTGCC TATTGCTGCG GGCACTAAAT TGGCGGTTCT AGGTCCATTA
GGAATGACGC GGTCCGGCCT GATGAGCGAC TACGAAAGCG ACCAAAGCTG TTTTGGTGGC
GGGCATGATT GCATACCAAC GTTGGCCGAG TCAATCGGAT TCATAAATGG AAAGGAGTTC
ACCGTTGCAG CTGCTGGTGT CGATGTGGAC TCTCGCAATA CATCGGATGT TGAGAGAATC
TTGCAGCTTG CCGCTGACAG GGATCTTATA GTGCTTTGTC TCGGGAACAC AAAAACTCAG
GAGCAAGAAG GATTTGATCG CAAGGACACA GCTTTGCCGG GTCAACAATA CGCCTTGTTT
GAGGCCGTAC TTACTCTTCG CAAACCTGTA GTTCTTGTTT TGGTAAATGG TGGCCAGATC
GCGCTTGACG GAATGACCGG ATACCCTTCG GCTATCATTG AAGCCTTCAA TCCCAACGGT
ATTGGCGGGA CTGCCTTAGC TGCGTCTCTA TTTGGTCAAG AGAATCGCTG GGGGAAACTT
CCGTACACAA TATATCCGTA CAGTGTGATG CAGTCGTTCG ACATGAAAGA CCATAGCATG
TCAGCCCCGC CGGGCAGGAC GTATCGATAT TTCACGGGAA AAGCAACGTA TCCATTCGGA
TACGGTCTTT CACTCACAGC ATTCGAGACA TCATGCTTTC ATCAAAGGAT CAGCGATTCC
TCGATTCTAC TGGAATGTAC TGTCTGGAAC ACTGGAAATA G
 
Protein sequence
MPSCFRILTI CTSYFFLLLA LWASYSIFEG SVRLQGPIGL DRNVSPVNNY IWKDKVPNFW 
GCQNDVAKAL PYCDMSLSID ERLEDLLSHL TLDEKVDMIG ADPTQDVCMT HTMNVSRIGL
PDYYWLVETN TAVGSACIAE NKCATEFSGP LSIAASFNRS SWFLKGSVFG TEQRALMNVH
GERFHTHSGR HIGLTAFGPN INQQRDPRFG RSSELPGEDP FLSGQYAAHM VQGMQERDAN
GYPKVLAYLK HFTAYSREEG RGNDDYNISM YDLFDTYLPQ YEMGMVQGGA TGVMCSYNAV
NGIPACANDY LLNKILRQRW NRSDAHVTTD CGAVNNLRGK PIQAADEAQA AAMALMNGAD
IEMGSTLFVH NLTTAITLGY ATEEAVNQAI RRSYRPHFIA GRFDDPTLSE WFSLGLDDIQ
SKKHQEIQLE AALQGLVLLK HEDSILPIAA GTKLAVLGPL GMTRSGLMSD YESDQSCFGG
GHDCIPTLAE SIGFINGKEF TVAAAGVDVD SRNTSDVERI LQLAADRDLI VLCLGNTKTQ
EQEGFDRKDT ALPGQQYALF EAVLTLRKPV VLVLVNGGQI ALDGMTGYPS AIIEAFNPNG
IGGTALAASL FGQENRWGKL PYTIYPYSVM QSFDMKDHSM SAPPGRTYRY FTGKATIRDI
MLSSKDQRFL DSTGMYCLEH WK