Gene PHATRDRAFT_45845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45845 
SymbolZEP1 
ID7201093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp450390 
End bp452226 
Gene Length1837 bp 
Protein Length565 aa 
Translation table 
GC content56% 
IMG OID 
Productzeaxanthin epoxidase 
Protein accessionXP_002180238 
Protein GI219118943 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.915807 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCCGATCTC CTTGCTTTCT CATTCGATTC TTCTTTCCAA TTTGATCACA CCGTGGAAAT 
AACAAGTAAC ATTTGTTTCA ATACTCTAGC GGCCGACGAG ATCAAAATTG CTCTTTCCCA
ATGAAGTTTT CTACCACGGT GTCATCGGCA CTGTTCCTGA TCGCGTCGGT ATCGACCACG
ACTTCCTTCA CACCCGTCCA GTCCTTTGGC GTGCACCGTC GAACCCTGCT CGTCACTCCC
CGCCACGCTA CCGTAGAACC CCCGGTACGG GAACCAGAAA CCTCCGATCG CGTACGGCAG
GTCCGCGATC GTTTCCGCAA GGCTTCCCAG GACGCCGCCA ACGCCAAAGG CTGCGTCGCC
CAGGACGACG GCGACGAATC CAGCTGGTGG CGCAAGCCCC TGCCGGAAGA CAACGACGTC
ATTAGTAACC AGCGACCGCT CCGAGTCGTC ATTGCCGGTG GAGGTGTTGC GGGACTCGTC
ACGGCCGCAG CCTGCCACGC CAAGGGCATG CAAGTGGCCA TCTTCGAACA GGCCTCGCAG
TACGCACCCT ACGGTGGACC CATACAGATA CAATCCAACG CACTGCGGGC ACTCGAACGC
ATCAATCCCG TTATTTGTGA AGAAATTCGC AAGGCCGGCA CCGTCACGGC GGACCGTGTG
TCGGGACTCA AGATTGGTTA CAAGAAGGGC GTCTTTCTAG GACTCGGCAA GCAGTACGAA
AAGGGGGACT GGTTGGTACG CTTCGATACC CTACAGCCAG CGCTCGATGC CGGTCTCTAC
CCCACCGTCG TCGTCGACCG ACCCGTCATT CAACAAATTC TACTGGAACA CGGTATTCCG
GAAAAGACGG TCCGCATCAA GTCCCGTATT GCCAATTACG AAGAACTCGG ACCCGGCAAG
GGCGTGCGGA TTCTCCTCGA AGACGGCACG GTGGCCTACG CGGACGTTTT GATCGGTTCC
GACGGTATTT GGTCCTCCGT GCGGCGGATT ATGCACGGAC TGGATCAGGG CGCCGACGGG
TTCGCGGCCT CGGGCGCCGC CGGTGGGGCC CTCAACGAAG CCGAAGCCCG ACGGATGGCC
AAAGACTCGG TGCTCATGGC CAATAACGCG AATCGACGGT ATTCCAAATT TACGTGTTAC
GCAGCCTTGA CGGAGCACCG CGCGAGCAAT ATTGAAGAAG TCAGTTACCA GATTCTACTC
GGCAAGGACA AGTACTTTGT CAGTACCGAT GGTGGCGGCG AACGCCAGCA ATGGTTCGCA
CTGATACGAG AACCAGCCGG TGGAGTGGAT CCCGAACCCA CTCCGGAAAA TCCAACCCCC
AAACTGACTC GTCTCCTGCA AGAATTCAAT CACGAGGAGC CAGGAGATCA GAATGGTGAT
GTGTGGGATG ACTTTGCCTA CGAGCTGTTC AAGGCCACCC CGGAAGAAGA TATCAAACGT
CGTGACTTGT ACGATGGATC GCCATTGTTG ATGCAAGGCT GGAGCAAGGG ACAAGTTGCC
ATTTGCGGAG ATGCGGCTCA TCCTATGATG CCCAACCTCG GCCAAGGTGG CTGTCAGGCT
ACCGAAGATG GCTACCGGCT CGCCGAAGAA CTGGCAACGG TCCGCACCAC GAAAGACATT
GAAGGTGCAT TACAAGAGTA CTACCGCAAA CGTATTCCCC GAACCACGAT CATACAAGCT
TTGGCACAAT TGGGATCCGA TTTGCTCGTG GATTTTGACA AAATGATGAC CATTCCGTTG
GTTGGGCCAT TTTTCTTGTT CATGACACAA GTGTCCATGC CCTTTGTGCT ACGGTTTCTA
TACACGCCAG AGTTTTAATT AGGCAAGAAT TACCCTT
 
Protein sequence
MKFSTTVSSA LFLIASVSTT TSFTPVQSFG VHRRTLLVTP RHATVEPPVR EPETSDRVRQ 
VRDRFRKASQ DAANAKGCVA QDDGDESSWW RKPLPEDNDV ISNQRPLRVV IAGGGVAGLV
TAAACHAKGM QVAIFEQASQ YAPYGGPIQI QSNALRALER INPVICEEIR KAGTVTADRV
SGLKIGYKKG VFLGLGKQYE KGDWLVRFDT LQPALDAGLY PTVVVDRPVI QQILLEHGIP
EKTVRIKSRI ANYEELGPGK GVRILLEDGT VAYADVLIGS DGIWSSVRRI MHGLDQGADG
FAASGAAGGA LNEAEARRMA KDSVLMANNA NRRYSKFTCY AALTEHRASN IEEVSYQILL
GKDKYFVSTD GGGERQQWFA LIREPAGGVD PEPTPENPTP KLTRLLQEFN HEEPGDQNGD
VWDDFAYELF KATPEEDIKR RDLYDGSPLL MQGWSKGQVA ICGDAAHPMM PNLGQGGCQA
TEDGYRLAEE LATVRTTKDI EGALQEYYRK RIPRTTIIQA LAQLGSDLLV DFDKMMTIPL
VGPFFLFMTQ VSMPFVLRFL YTPEF