Gene PHATR_18572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_18572 
Symbol 
ID7204390 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp598333 
End bp600997 
Gene Length2665 bp 
Protein Length736 aa 
Translation table 
GC content47% 
IMG OID 
Productcatalase-peroxidase 
Protein accessionXP_002186090 
Protein GI219113013 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGAAAGCATT CACCGATTGG AAGGAATGCG AAAGACCACG CCTTTCTTCG AAACCGATAG 
ACGAACGAGA TCGATTCCTT GAGGATGTAA GATTGAGATG GGACAAGAGT AGGTTGTCCC
ATGATATATC GCTTTCATAA ATTTGATTGT TTTATGTCAT TCCGTGTCAT ACTGAAATCA
CATCTGAAAA AGCTTTCGAC AGAGATACGT CGACGTACAC TAATACACGC AGCCACCAGA
CAAACCACTT CATCGATACC ATGGGCTCCG ATAGCAAATG CCCCTTTGCA CCCAAAATGA
ACAAAGCTGC TTCGGCGAAC AGCTATTGGT GGCCTGATAA TATCAACCTG AAAATACTAA
ATCAGCAAGG GGCCAACAAC CCCGACCCTT CCTTTAATTA CAAGGAAGCG TTCCGAGAGT
TGGACGTTGA GATGGTGAAG CGTGATGTCA ACAAGATGCT GACTACCTCT CAGGACTGGT
GGCCTGCTGA TTGGGGTCAC TATGGGCCTT TTATGATTCG AATGGTACGT ACACCAACGC
CTTGTAGACT TCGTCGCGGT TTCGATTGCT GTGCACCTAA CGCGTATACT TTCTTGTTTC
GCGCACAGGC ATGGCACGCG GCAGGAACCT ACCGTATTTC CGATGGAAGA GGCGGTGCCA
ACACCGCTAA CCAACGTTTC GCGCCATTGA ATAGCTGGCC GGACAACGCC AACCTTGATA
AGGCTCGTCG TCTCATGTGG CCAGTTAAGC AAAAGTATGG CCGAAAGCTT TCGTGGGGTG
ATTTAATGAT GCTCACGGGT AATCAGGCTT TGGAAATTAT GGGCCTAAAA ACAATTGGCT
TTGCATTTGG TCGTGAAGAT ATATACTCCC CCGAGGACGA TGTGTACTGG GGTCCGGAAA
AGGAGATGCT CAGTAACGAT AGGTTCGACG AGAACGGTGA TATTAAACGC CCACTTGGAG
CCTCCGAAAT GGGTCTCATT TACGTAAATC CTGAAGGTCA TGATAACGAG CCGAATCCAA
CGAAGTCTGC TCACGATATC CGTCAGACCT TCAGAAATAT GGCCATGGAC GATTACGAAA
CTGTCGCACT TATTGCTGGT GGACATACAT TTGGAAAGAC CCATGGCGCC GCTCCAGTAT
CACACCAAGG ACCAGAACCG GAAGGTGCAT CAATCGAACA TCAACAGCTT GGCTATCTGA
GCGACTACGG GACCGGTAAA GGAAAAGATA CCACCACTAG TGGTCTCGAA GGAGCTTGGA
CCGAAACCCC AATCCAATGG GATATGAACT ATTTCAAGAA TTTGTTCGAG TATGAATGGG
AGGTTCATAA AGGCCCGGGG GGTCGCCACC AATGGAGACC GACCGACAAG TCTACCTTTG
AAATGGTACC CGATGCTCAC GAGAAGGGTA AAAAGAACCC CCCAATGATG TTCACCACTG
ACATTTCGTT GAAGATCGAT CCAATCTACG GGCCGATCTC TCGTCACTTC TACCATCATC
CAGATGAATT TTCTGCTGCT TTTGCAAAGG CATGGTACAA GTTATTACAC CGTGATATGG
GACCAGTCTC TCGTTGTCTA GGAAGCGATG TTCCGGAACC CCAATTGTGG CAGGATCCGA
TCCCCTCTCT TGACCATGAG CTGATCGACG ACGAAGATGT TGCCAAACTT AAAAAGCAAA
TTCTTGGCTC GACCGGTATC GCCGGCAAGA TTTTGGGGTC TAGCGGCCCT TGTGTATCTG
AACTAGTGAA GGTTGCCTGG GGTTCTGCAT GTACATACCG AGGCACGGAT CACCGTGGCG
GAGCGAATGG TGCTCGTATC CGTTTGGCAC CCCAAAACAC TTGGAAGGTA AATGACCCGG
AAGAGCTCAA AAGAGTGCTA GAACATCTGG AACAGATTCA ACTAGACTTC AATGAGCAAC
AGAAAGGCAG TAACAAGCGA GTATCACTAG CAGATTTGAT TGTTCTAGGT GGTTGCGCTG
CAATTGAGTA TGCTGCCAAA AATGCTGGCA ATGATATCAA TATACCGTTC TCCCCTGGAC
GCACGGACGC GTCATCGGAG CAGACGGTTG CAGAATCATT TGACGCTCTG GAACCGAGCG
CAGATGGCTT TCGCAACTAT TTGAAAGAAG GCCAAAGTGC CAAACCGGAA GAGCTCCTGT
TAAATCATGC TCACTTGCTT ACTTTGACTT CCCCCGAGAT GACGGTGCTA TTGGGAGGGC
TTCGCGTTTT GAAGGCAAAC ACGGGAAATT CTGAGATGGG TGTATTTACA AAAAACCCAG
AAACGCTAAC GAATGATTTC TTTGTCAATC TTCTGGACAT AAACACGACC TGGTCGTCTG
TGGAACAAGA CAAGAACCTA TTCGATGGTC TGGAATACGG CACAGGGAAA CTGAATTGGA
AAGCGAGTCG TTTCGATTTG ATCTTTGGTT CCAATTCTGA GCTCCGTGCG ATTGCAGAGT
ATTACGGTAG TGATGATTCA AACGAAGTGT TTTTAAAGGA TTTTGTCAAA GCTTGGACAA
AGGTGATGGA GCTTGATCGT TTTGATTTGA AATAACCGTA AGTATTGACA GATCCAAGTT
AGAAGTGTTC AATTTTATGA TAGTTGATGT AGATAAAATT CCTGGCCATT ATTCTGAATA
AAGTAGAAGA GTGTTAATGC TATGA
 
Protein sequence
MGSDSKCPFA PKMNKAASAN SYWWPDNINL KILNQQGANN PDPSFNYKEA FRELDVEMVK 
RDVNKMLTTS QDWWPADWGH YGPFMIRMAW HAAGTYRISD GRGGANTANQ RFAPLNSWPD
NANLDKARRL MWPVKQKYGR KLSWGDLMML TGNQALEIMG LKTIGFAFGR EDIYSPEDDV
YWGPEKEMLS NDRFDENGDI KRPLGASEMG LIYVNPEGHD NEPNPTKSAH DIRQTFRNMA
MDDYETVALI AGGHTFGKTH GAAPVSHQGP EPEGASIEHQ QLGYLSDYGT GKGKDTTTSG
LEGAWTETPI QWDMNYFKNL FEYEWEVHKG PGGRHQWRPT DKSTFEMVPD AHEKGKKNPP
MMFTTDISLK IDPIYGPISR HFYHHPDEFS AAFAKAWYKL LHRDMGPVSR CLGSDVPEPQ
LWQDPIPSLD HELIDDEDVA KLKKQILGST GIAGKILGSS GPCVSELVKV AWGSACTYRG
TDHRGGANGA RIRLAPQNTW KVNDPEELKR VLEHLEQIQL DFNEQQKGSN KRVSLADLIV
LGGCAAIEYA AKNAGNDINI PFSPGRTDAS SEQTVAESFD ALEPSADGFR NYLKEGQSAK
PEELLLNHAH LLTLTSPEMT VLLGGLRVLK ANTGNSEMGV FTKNPETLTN DFFVNLLDIN
TTWSSVEQDK NLFDGLEYGT GKLNWKASRF DLIFGSNSEL RAIAEYYGSD DSNEVFLKDF
VKAWTKVMEL DRFDLK