Gene PHATRDRAFT_50598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50598 
Symbol 
ID7199439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011700 
Strand
Start bp31411 
End bp34514 
Gene Length3104 bp 
Protein Length1008 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185554 
Protein GI219130822 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.732398 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGCGTGTACA GGTTAACAAC AGTAACTAAG CTTTGCAACA TTGTCACTGT AGCGTTTCAC 
TGTCACTGTC ACGGATCATG TCTGATTCTC CGTCAACCAG GAACGCCGAC GATCTGGAAG
TTCCTTTGGT AGACGATCGT TCCTATCGCT ACTGGACTCT TCCCGCGACG TCCACTGCCG
ACGTCGTCAC AACGGATCGT CCCGGACTCC GCGTCCTGTT GGTTCACGAT GATTCCGTCG
ACAAGGGTGC TGCTGCGGTA GACGTGGCGG TCGGACAGTT CCAGGACGGT GACTTGCCGG
GCCTCGCACA CTTGACGGAA CACATGCTCT TCCTCGGCAC GCAACGCTTT CCGCAAGAAA
ACGCTCTGGA CAGTTTCCTC GCCGCACACG GGGGACACTC CAACGCCTAC ACGGATCTGG
AACACACCGT GTACTACATG GATGTGCAAG CGGCACAGTT GGAACCCGCA CTCGATCGAT
TCGGTTCCTG CTTCGAAGCA CCGCTCCTAC TCGAGAACTG CGTCGCCCGT GAATTGCAAG
CCGTCGACAG TGAACACGGC AAAAACAAAC AGTCCGATTT CTGGCGGTAC CATCAACTCA
CCAAAACACT TCTGGGACAG CACAATAGTC ACGTCTATCA ACAATTCGGG ACGGGCAATC
TAGAGAGCTT GCAACCCCAA GGAACGGCCG TTTTGCGGCA AGCCGTACAC GACTTTTATC
AGCGTTACTA CCACACCGCT CGTATGACCC TCTGTGTCCT TGGCAATCAG GATCTTGACG
TGCTGCAAGG ATGGGTGGAA AAGTATTTTG GCAGCTTGCC CAGTCAGCCG AGTGACACCT
TGGTGGAACC ACCCGTGCCG CCGTTGACAC CGGTCCTCCC ACAACGCGTC CACGTCGTTC
CGACACGAGA AACCAACGTA CTCGAATTGC AATGGTGCCT CCGGGAAATA CAATCTCTCT
ACCGGTCCAA GCCTACCCGA ATACTATCGC ACTTGTTAGG GCATGAAGGC CCCGGCAGTT
TATTGGCCGT CCTACGGGAA CGACTCTGGG TGCAGGAATT GTACGCCGAT GACTCCAGCA
AAACTACCTC CGCCTTTAGT ATATTCTGCG TACAACTCGA ACTCACCGTG CTAGGATGGG
AACACGTTAA CGACGTCGTG GCCACGGTGT ATCGGTATAT TGGACTGTTG CAGAACGAGA
TTCCCGCCTG GGTTGCGGAC GAATTGCAAA CCACCGCGTC TACGCAGTTT CGATTCTTGT
CCAAAAGCTC ACCCTCCGAC ACCGTGTCCA GAGTTGCACA CCAAATGCAA GAGTTTGCGA
TAGCGCACGT ACTGTCGGGA CCGTACCTAG TCTACGAGCA CGACATGGCT GCCGTCCAAT
CCTGCCTCGC CAGTTTGCAC GTCGACAATA TGCTCGTACT TGTGGCCTCC AAGGAGTATA
CTGGACAGAC CACCGCGACC GATCCGTGGT ACGGTACCCA GTATGCCACG GTTGCGCTGG
AACCAGACGC GTTGGAAGCG TGGCGTCAAG CGCGCAGCGC TGCGACGGAT GGTAGCGGTG
TCGATTTCAT CGGTCTACAT CTCCCTGATC GCAACGACAT GCTTGCTACC GATTTTGAGC
TCAAAACGTC TCCCTACGCC GTCTTTGCCA AAACGAACAC GAACGACAGC AATGGCGACA
ACGGCAACGT TCCACCCCCG CCCCGTTGCT TATTGGACAC AGATACGTGT CGCCTTTGGT
ACAAACCTGA TACAGAATTC CGCATGCCCA AGGTCAACAT CATGTGTGTC TTGCGTAGTG
CTACGGCCTA CGAAAGCGTG ACACAGTCTG TCTTGGCATC GTTGTGGTCG GAAACTGCAG
ACGAACTTTG CAACGTGTTT TCGTACGCCG CGTCCATGGC TGGCCTGCAT TGCAACTTTT
CCAATACGCG GAATGGTATG GAACTCCACC TGTCCGGCTA TCACGACAAA GCTCACGTTT
TGCTGCAACG AATTGTGGAC ACGGTTCGGG ACTTTCGGGT AACGCCGGAT TTGTTTGAAC
GTATTCAATC AAAATTGGAA CAGCAGTTTC AGGAATTCTT GGTAGCACAG CCGTATCAAC
ACGCCATTTA CGCTGGCGAT TTGTGTTTGG AAACACCCAA ATGGGACATT CACGACCGGT
TGCAGTGTCT CGCTTCGCTG ACTTTAAATG ACCTTCAGCA CTTTGGTCGT CACATTCTGG
CTCGGTTTCA ACTCGAAATG CTGGTCCACG GGAACGTGAC CGCGTCCGAA GCGGTTCAAC
TATCGGATAT TGTTTTGCTC GGTTGGCGAC CTCAAGCACC ACTCAATCAA ATCGATGTCC
GAGTAGTCCA GCTCCCTGCA CAAGGTTCCG AGGGTACGTC GACTGTGCAT CGATTTTCCG
GCTGGAACGA AGACGATGAA AATAGCTCGG TGTGCAACAT TTATCAGGTA GGAACCATGG
ACACCAAGAT GAATGCAACT CTGGGCCTTT TGCATCATTT GATTCGCGAG CCGGCTTTTG
GTCAATTGCG CACGCAAGAA CAATTGGGAT ATATTGTTCA CACACAGGTC AAAACGAGCG
GGGACAAAGT AAAGTCGTTG CTATTCTTGA TTCAGAGTGA CTCCTTCGAT CCGATCCACA
TGGACCAACG GATCGAAGCG TTTTTGGTAG ATTTTCGTCA TAAACTGGTG CAAATGTCGG
AGCCTGACTT TGCCGCCAAT GTTGGCGCCT TGTGCCAAAG CTTTTTGGAG AAAAACAAGA
ACTTGAGTGA AGAATCGTCC CGATATTGGC ACGTGATCAC CAACCAAACC TATCGATTCT
ACCGGATGTC CGAATTGGCG GCTGCTGCCC AAACCGTAAC AAAATTGGAT GTTTTGCGTT
TCTTGGACCG TCACGTCCTG GCAACGTCCC CGTACCGCCG TAAGCTGTCT GTGCAAGTGT
TTGGACAAAA TCATATTGCG GATCTCTTAG ACAAGACGGA TGTTGCTGGG GATGGTATTG
TTCTTGTCGA GAGCGCCAAC GACTTCCGTC GGTCACAGGC GCTCTTTCCT ATGCAAGCGT
CCGCTTCGAT TGAGGATTGG CGATTAGACG CGAAAGACGA CTAA
 
Protein sequence
MSDSPSTRNA DDLEVPLVDD RSYRYWTLPA TSTADVVTTD RPGLRVLLVH DDSVDKGAAA 
VDVAVGQFQD GDLPGLAHLT EHMLFLGTQR FPQENALDSF LAAHGGHSNA YTDLEHTVYY
MDVQAAQLEP ALDRFGSCFE APLLLENCVA RELQAVDSEH GKNKQSDFWR YHQLTKTLLG
QHNSHVYQQF GTGNLESLQP QGTAVLRQAV HDFYQRYYHT ARMTLCVLGN QDLDVLQGWV
EKYFGSLPSQ PSDTLVEPPV PPLTPVLPQR VHVVPTRETN VLELQWCLRE IQSLYRSKPT
RILSHLLGHE GPGSLLAVLR ERLWVQELYA DDSSKTTSAF SIFCVQLELT VLGWEHVNDV
VATVYRYIGL LQNEIPAWVA DELQTTASTQ FRFLSKSSPS DTVSRVAHQM QEFAIAHVLS
GPYLVYEHDM AAVQSCLASL HVDNMLVLVA SKEYTGQTTA TDPWYGTQYA TVALEPDALE
AWRQARSAAT DGSGVDFIGL HLPDRNDMLA TDFELKTSPY AVFAKTNTND SNGDNGNVPP
PPRCLLDTDT CRLWYKPDTE FRMPKVNIMC VLRSATAYES VTQSVLASLW SETADELCNV
FSYAASMAGL HCNFSNTRNG MELHLSGYHD KAHVLLQRIV DTVRDFRVTP DLFERIQSKL
EQQFQEFLVA QPYQHAIYAG DLCLETPKWD IHDRLQCLAS LTLNDLQHFG RHILARFQLE
MLVHGNVTAS EAVQLSDIVL LGWRPQAPLN QIDVRVVQLP AQGSEGTSTV HRFSGWNEDD
ENSSVCNIYQ VGTMDTKMNA TLGLLHHLIR EPAFGQLRTQ EQLGYIVHTQ VKTSGDKVKS
LLFLIQSDSF DPIHMDQRIE AFLVDFRHKL VQMSEPDFAA NVGALCQSFL EKNKNLSEES
SRYWHVITNQ TYRFYRMSEL AAAAQTVTKL DVLRFLDRHV LATSPYRRKL SVQVFGQNHI
ADLLDKTDVA GDGIVLVESA NDFRRSQALF PMQASASIED WRLDAKDD