Gene PHATRDRAFT_54731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54731 
SymbolAPX1 
ID7202439 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp518716 
End bp519987 
Gene Length1272 bp 
Protein Length261 aa 
Translation table 
GC content50% 
IMG OID 
Productascorbate peroxidase 
Protein accessionXP_002181742 
Protein GI219122832 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGGATCGTCT CTCTACGAGG CTGCACGTTT GTCACTGTCA GTGACCCTTG TACATTGTCA 
AGAGCCTGTT TCGCACCTCA AGGGCTGCTT CGCATTCCAA TCCAGTAGTG AGGCCCATAC
AATAGTGTCT TGATACTGTT TTTCAAGAAT TCTTGCTCTC GACTGTGAAT AAGGTAAGTT
GTGGCGAAAT TACTTTACTG ATAACTCTAA GGAACCGCAT CGCAGCGCTT CTTTCGAATG
ACAACTTCTC ATACGCCGTG GTCCCCTCTG ACGATTTAGC AATACAATGC CCGTCTCGAA
AGAAGCACTT TCCTCGGCAA AAGAGATGAT CGATGCCCTC ATTCTGGAGA AAAATTGCGG
TCCAATCATG GTACGCGTGG GCTGGCACGA TTCAGGCACG TTCGATAAAA ACGTCAGCGG
CGCATGGCCT AGTGCCGGGG GTGCAGTCGG TTCCATCCGT TTCGATCCCG AAATTACGCA
CGGTGCCAAC GCCGGTTTGA TCAACGCCAT CAAGCTCTTG GAGCCTATCA AAGAGGCCAA
TCCGGATGTC AGCTACGCTG ATATTTTTCA GATGGCGTCG GCTCGTTCCA TCGAATTGGC
GGGAGGTCCT CGGATTGACA TGAAGTACGG ACGAATCGAT TCAAACGGTC CCGAAAACTG
CTCCAAAGAA GGCAACCTGC CCGATGCCGA ACCGGGAAGC AACGGCATGT ACGGTGGTCC
TGGTGGTAGT GCATCTACGG AAGATTCGAC GGCAGCCGGT CATTTACGTA AAGTCTTCTA
CCGCATGGGA CTGAATGATG AGGAGATTGT TGCTCTCTCC GGTGCCCACA CCTTTGGCCG
CGCTTACAAA AACCGTTCCG GTCTCGGGGC TGAAAAGACT AAATTTACGG ATGGAAGTAA
ACAAATGCGA GCGGATGGCA TCGAGGCCAA GTATACTCCA GGTGGTTCGA GCTGGACGGA
GAATTTTCTC ATTTTCGACA ATTCGTACTA CAAGGTCATC CCAGACGAGT CCGCCGATCC
TGAACTACTC AAGTTGTCAA CTGACAAGGT AGTTTTTATG GACGATGGGT TTAGGCCATT
TGCCGAGAAA TTCCGTGACT CGCAGGATGC TTTCTTCGAG TCATACGCCA AGGCGCACAA
GAAGCTGTCC GAACTCGGAT CCAACTTTGA CCCGTCGGAA GGCATATCCA TGTAAACATG
ACCCGAATTC AATTATATGA GTGTTACCTT TTGTTTCCGT AGAAAAAATC TACTAGCTAC
TCGTTGGGTT CC
 
Protein sequence
MPVSKEALSS AKEMIDALIL EKNCGPIMVR VGWHDSGTFD KNVSGAWPSA GGAVGSIRFD 
PEITHGANAG LINAIKLLEP IKEANPDVSY ADIFQMASAR SIELAGGPRI DMKYGRIDSN
GPENCSKEGN LPDAEPGSNG MYAGHLRKVF YRMGLNDEEI VALSGAHTFG RAYKNRSGGS
SWTENFLIFD NSYYKVIPDE SADPELLKLS TDKVVFMDDG FRPFAEKFRD SQDAFFESYA
KAHKKLSELG SNFDPSEGIS M