Gene PHATRDRAFT_56488 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_56488 
SymbolZEP2 
ID7196921 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp2222646 
End bp2224755 
Gene Length2110 bp 
Protein Length604 aa 
Translation table 
GC content50% 
IMG OID 
Productprecursor of protein zeaxanthin epoxidase-like protein 
Protein accessionXP_002176935 
Protein GI219110367 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAGTGACATT GCAAGCCGAT CCGAGGGAAG GTCACCAATC TTGTTCGAAA GAAACACATC 
CGCTGCCTGG AATCCTTCTG TCACTAACAG TCAATCCGGC AAGAAAGGTC TCCCAAGGAA
CGAATCGATC TGAATTGGCT ACGTTGCCTC TGATTGACGA CAACGATCGT ATGGGTCTTT
CGTTTCTATC ATTATGCGCC GTGCTGACGG CGTCGTCCGC AATGGCCTTT GTCACCACCC
GATCACCGGC TTGCAACGAC GTGACTCGAT CACTGCATAG AATCAATACA AGGCACATGA
CCTACCCATT TTACCCGGCA TCCTCTCTCC GTATCTCCAC GCGCGTTGCG AGCACGGCGG
TTCCTCCGGA AGACGTTGCC TTTGACAAGC TAAGTTTACC CGCTCGCGAG GGCCGGCCTC
TGAAGATTGC TATTGCCGGC GGTGGGGTAG GCGGTTTGAC GACCGCTCTC TGCATGTTGA
AAAAGGGTTT CGACGTGACG GTGTACGAGA AGACTGCCGC CTTTGCCCGC TTTGGGGGAC
CCATCCAGTT TGCCTCCAAC GCCTTGTCAG TCCTGAAAGA AATCGACGAA ACCCTCTTTG
AGCGTGTCAT GGACAAGTTC ACTTTCACCG GAACACGAAC CTGTGGGATC AAGGACGGTT
TGCGAGCGGA TGGAAGCTTC CGGATGACGG AAGATCGTCT GGATTATTTG TGGAATCCGG
ATGCTCCGGC CGATTGGTTC GTCAAATTTC CGCTCAAAGT ACGTTGCAAC ACACTCCAAC
CATTCACAAT CGAGGCCGTA ACACGAAAAC TTACCTTTCA CTTCTTTGGT ATTATTATCT
CTAGCAATGC GCCGATTTGT TCGGTTTACC CTACACGGGT GTCATTGACC GTCCCGATTT
ACAAGAAATT CTCATTGACG AATGCCGCAA ACTTAAACCC GACTTTCTCA TCAACGGCAA
TCCCGTCGTA GGTTACGAAG ACTTAGGCAA AGGCCAAGGT GTGACAATCA ATTTAAACGA
TCAGACAACT GCGTCCGCCG ATGTTCTAGT AGGATCGGAC GGAATTTGGT CAGCCGTACG
TGACCAAATG TACAAGGAAG GTGGTGTCAA ATCTACCAGT GCCAACAAGA AGAAACGGCA
GGGATGCGAC TATTCCGGAT ACACCGTCTT TGCGGGAGAA ACAATCCTCA AGACCCCGGA
CTACTATGCA ACTGGATACA AGGTATACAT CGGACCCAAG CGGTACTTTG TTACGAGTGA
TGTCGGTGAT GGCCGCATTC AGTGGTACGC CTTCTTTGCG CTACCTCCCG GAACCAAAAA
AGCTCCCAGC GGTTGGGGTG GCTCCACTCG TGACGGACAG ACGGATCCGG AGGAGAATCT
GGTAGACTAC GTCAAGGGCT TGCATGAAGG ATGGAGCGAT GAAGTCATGA TGGTTTTGGA
CTCGACGTCA CCAGACAGTG TGGAACAACG TGACTTGTAC GATCGGGCGC CCGAATTATT
TCGCAGTTGG GCGAATGGCA ACGTCGTACT CATTGGTGAT GCAGTACACG CCATGATGCC
AAACTTGGGT CAAGGAGGTT GCCAGGCAAT TGAAGACGCC TACGTCTTGA CGGAAACTTT
GGCGAACACA CGCACCACAG AAAAGCTGCA AGATGCATTA CAAGAGTACT ACCGCAAGCG
CATCGTACGA GTGAGTATTG TGCAGTTTCT GAGTAAGCTG GCTAGCGACT TGATCATCAA
CGCGTTTGAC ACCCCCTGGA GTCCACACGA CAACCTCGGA AAGTCCTGGA AATCCTACTT
GACATTCTTT TGGAAGCCTA TCTTGCAGTT CGCCATTTTT CCAATGCAGT TCGCCTATCT
TTATTCCTAC TATCCGACGG GGAATATGGG TGATCTTCCG GCCAAATTAG AAGCAATTTG
GAAAGAGAAG CACAAAACAG ATGCCGAAGC TGTGTTTGAG CAAGCATCCA AGGAAGGCTT
TGTCATGGAA CACGAAGCGT CCTTTTTCAA AAAAGCGGAA GTAGAATTGT CCCCAACAGC
TTTGGCAGCT ACGAAAGAAG AACTAAGCTA GAAGTGAAGA AATTCCATAG TTTTCATTGT
AGCCTTTCGG
 
Protein sequence
MGLSFLSLCA VLTASSAMAF VTTRSPACND VTRSLHRINT RHMTYPFYPA SSLRISTRVA 
STAVPPEDVA FDKLSLPARE GRPLKIAIAG GGVGGLTTAL CMLKKGFDVT VYEKTAAFAR
FGGPIQFASN ALSVLKEIDE TLFERVMDKF TFTGTRTCGI KDGLRADGSF RMTEDRLDYL
WNPDAPADWF VKFPLKQCAD LFGLPYTGVI DRPDLQEILI DECRKLKPDF LINGNPVVGY
EDLGKGQGVT INLNDQTTAS ADVLVGSDGI WSAVRDQMYK EGGVKSTSAN KKKRQGCDYS
GYTVFAGETI LKTPDYYATG YKVYIGPKRY FVTSDVGDGR IQWYAFFALP PGTKKAPSGW
GGSTRDGQTD PEENLVDYVK GLHEGWSDEV MMVLDSTSPD SVEQRDLYDR APELFRSWAN
GNVVLIGDAV HAMMPNLGQG GCQAIEDAYV LTETLANTRT TEKLQDALQE YYRKRIVRVS
IVQFLSKLAS DLIINAFDTP WSPHDNLGKS WKSYLTFFWK PILQFAIFPM QFAYLYSYYP
TGNMGDLPAK LEAIWKEKHK TDAEAVFEQA SKEGFVMEHE ASFFKKAEVE LSPTALAATK
EELS