Gene PHATRDRAFT_40017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40017 
SymbolTYR1 
ID7195493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp605879 
End bp607519 
Gene Length1641 bp 
Protein Length546 aa 
Translation table 
GC content52% 
IMG OID 
Producttyrosinase 
Protein accessionXP_002184029 
Protein GI219127618 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.645981 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGCTCAA TGCTTCTCGT ACTCTTCCTT GAATGCCATT CGACGACTGG TAGGACAATG 
GCTGCAACAA CAAGAAAGGA AAGTGCACAA ACCAACGATA CCGTGCGACC ATTGGGCGCC
AACGCATTTT GTAAAAACGG GGAAAAGACA AGGATTCGAC GTGACTGGGA CTTGATCTCT
CGTGACGACC AGGACCTCTA CATCGAGGCG ATCGAAGAAG CAATAGATCG AGAATTATAC
CAGGCTTTCT TGGGCTACCA TGCGGACTCT GTAGCGTTAG TACAATCGCA CGAAACATGT
GGTTTCGCAT TGTGGCATCG AACCTTTCTG TTAGCTTTCG AAAACATGTT GCGGTCGCTG
GCACCACGGT TTGCTTGTCT CACTATCCCT TATTGGAATG TGATGGAGCA TTCTAGTGAT
CAAGCCAGAG GCTTGTGCAC CAGCTATCGA ACCTGCTCCA GAATAGTAGG AGACCTTGGT
GGTAGTCCTG TCGCGGCAGC AGCGACCCGT CTGTATGCCG GCATTGAGGC TACTGGAGAT
TTGATTACCG GTCGTCCAAT TCGCAATCTC CATGATGACA ACAACGCTAC TGGCATAGTG
CGCGATGACT TATTCTGGGT CACTCTGCCT GCGAGTACAG CGTACGATAG CGTTCTCGAT
ATCCTGGTTA CTAGTCCGTC CTACGTTCAA TACACACGTC GGATCCAGGA AACCATTCAC
GACGATGTTC ACGACACGTT GGGTGGCTTC ATGCCCACAT ACTCAAGCCC CACTGATCCA
CTCTTTATGC CGTGGCACAG CTTTATCGAT CTTGCTTTGT TCATGTGGGA AGCTTGCTAC
TTGGATCCAT CGGAACAAGC TTCCGGTACG CGTCTTGCTG CAGACTGGGC GTTTGAGGGC
GCCGGATCGA ACTGCTCCCG AAGTGGTCGC AGTAAGGTCC TGTTCCCCAT CCTCAACGCC
ACGAGCGAGC TATATCTTAT GAGGGGTGAC TTTCATGTGC TTGAAGATCC CTTGATTGGA
ATCTATTTTG CCGATATTGG AATTCTTTTC TCGGATGTGG CGTCTATTCG TGACTTGGGA
GAGGACTTTG ATTTCACCTA CGATCACGTG ACGGAGCGGA TTTGGAATGT CCTGCAAGAT
CCGTCTCAAT GTCCGTCCTC GGGATCCTGG ACAGCCTTTC CGACTCCGTC GCCGACAACG
GCATCCCCCG TGGTAGGATC ACCAAACGAT GCCGATGCTC GTAGCGAATG GCTGGCTGGG
ATCCGTCAGC GCCTCGAGGA AATGTTTGCC GAAACACACC CGGGATACGT TGCCCAATAC
ATGTCTTACT TCACCTGTGT CACGCAGGAT GAGACAAGTC TCTCTGTATA CACAGAGGAT
CCGGGCGAGT ACTTGGTTGA TGTGTTGAAT GGCAACGCGA TTATTAGAGC GCGTTGCGCC
TTTTTCCTTC CCGAAACAGA GTCCGTTACG AATGAGGAAT TGCCCACGTC TTCGACTCCT
GTGGCTGCGC CCTCTGAGAA CAGGGACCAG TTTGCAGGCG ACGACGAGGA TGATGACCAG
ACTAGTACCG CCCTAAGAGC ACCTCAACCG TACTTTGGCA TTGGGTTGGT CAGCTGCTTG
GTGCTACTCG TTGTGGACTA G
 
Protein sequence
MCSMLLVLFL ECHSTTGRTM AATTRKESAQ TNDTVRPLGA NAFCKNGEKT RIRRDWDLIS 
RDDQDLYIEA IEEAIDRELY QAFLGYHADS VALVQSHETC GFALWHRTFL LAFENMLRSL
APRFACLTIP YWNVMEHSSD QARGLCTSYR TCSRIVGDLG GSPVAAAATR LYAGIEATGD
LITGRPIRNL HDDNNATGIV RDDLFWVTLP ASTAYDSVLD ILVTSPSYVQ YTRRIQETIH
DDVHDTLGGF MPTYSSPTDP LFMPWHSFID LALFMWEACY LDPSEQASGT RLAADWAFEG
AGSNCSRSGR SKVLFPILNA TSELYLMRGD FHVLEDPLIG IYFADIGILF SDVASIRDLG
EDFDFTYDHV TERIWNVLQD PSQCPSSGSW TAFPTPSPTT ASPVVGSPND ADARSEWLAG
IRQRLEEMFA ETHPGYVAQY MSYFTCVTQD ETSLSVYTED PGEYLVDVLN GNAIIRARCA
FFLPETESVT NEELPTSSTP VAAPSENRDQ FAGDDEDDDQ TSTALRAPQP YFGIGLVSCL
VLLVVD