Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_56488 |
Symbol | ZEP2 |
ID | 7196921 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 2222646 |
End bp | 2224755 |
Gene Length | 2110 bp |
Protein Length | 604 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | precursor of protein zeaxanthin epoxidase-like protein |
Protein accession | XP_002176935 |
Protein GI | 219110367 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAGTGACATT GCAAGCCGAT CCGAGGGAAG GTCACCAATC TTGTTCGAAA GAAACACATC CGCTGCCTGG AATCCTTCTG TCACTAACAG TCAATCCGGC AAGAAAGGTC TCCCAAGGAA CGAATCGATC TGAATTGGCT ACGTTGCCTC TGATTGACGA CAACGATCGT ATGGGTCTTT CGTTTCTATC ATTATGCGCC GTGCTGACGG CGTCGTCCGC AATGGCCTTT GTCACCACCC GATCACCGGC TTGCAACGAC GTGACTCGAT CACTGCATAG AATCAATACA AGGCACATGA CCTACCCATT TTACCCGGCA TCCTCTCTCC GTATCTCCAC GCGCGTTGCG AGCACGGCGG TTCCTCCGGA AGACGTTGCC TTTGACAAGC TAAGTTTACC CGCTCGCGAG GGCCGGCCTC TGAAGATTGC TATTGCCGGC GGTGGGGTAG GCGGTTTGAC GACCGCTCTC TGCATGTTGA AAAAGGGTTT CGACGTGACG GTGTACGAGA AGACTGCCGC CTTTGCCCGC TTTGGGGGAC CCATCCAGTT TGCCTCCAAC GCCTTGTCAG TCCTGAAAGA AATCGACGAA ACCCTCTTTG AGCGTGTCAT GGACAAGTTC ACTTTCACCG GAACACGAAC CTGTGGGATC AAGGACGGTT TGCGAGCGGA TGGAAGCTTC CGGATGACGG AAGATCGTCT GGATTATTTG TGGAATCCGG ATGCTCCGGC CGATTGGTTC GTCAAATTTC CGCTCAAAGT ACGTTGCAAC ACACTCCAAC CATTCACAAT CGAGGCCGTA ACACGAAAAC TTACCTTTCA CTTCTTTGGT ATTATTATCT CTAGCAATGC GCCGATTTGT TCGGTTTACC CTACACGGGT GTCATTGACC GTCCCGATTT ACAAGAAATT CTCATTGACG AATGCCGCAA ACTTAAACCC GACTTTCTCA TCAACGGCAA TCCCGTCGTA GGTTACGAAG ACTTAGGCAA AGGCCAAGGT GTGACAATCA ATTTAAACGA TCAGACAACT GCGTCCGCCG ATGTTCTAGT AGGATCGGAC GGAATTTGGT CAGCCGTACG TGACCAAATG TACAAGGAAG GTGGTGTCAA ATCTACCAGT GCCAACAAGA AGAAACGGCA GGGATGCGAC TATTCCGGAT ACACCGTCTT TGCGGGAGAA ACAATCCTCA AGACCCCGGA CTACTATGCA ACTGGATACA AGGTATACAT CGGACCCAAG CGGTACTTTG TTACGAGTGA TGTCGGTGAT GGCCGCATTC AGTGGTACGC CTTCTTTGCG CTACCTCCCG GAACCAAAAA AGCTCCCAGC GGTTGGGGTG GCTCCACTCG TGACGGACAG ACGGATCCGG AGGAGAATCT GGTAGACTAC GTCAAGGGCT TGCATGAAGG ATGGAGCGAT GAAGTCATGA TGGTTTTGGA CTCGACGTCA CCAGACAGTG TGGAACAACG TGACTTGTAC GATCGGGCGC CCGAATTATT TCGCAGTTGG GCGAATGGCA ACGTCGTACT CATTGGTGAT GCAGTACACG CCATGATGCC AAACTTGGGT CAAGGAGGTT GCCAGGCAAT TGAAGACGCC TACGTCTTGA CGGAAACTTT GGCGAACACA CGCACCACAG AAAAGCTGCA AGATGCATTA CAAGAGTACT ACCGCAAGCG CATCGTACGA GTGAGTATTG TGCAGTTTCT GAGTAAGCTG GCTAGCGACT TGATCATCAA CGCGTTTGAC ACCCCCTGGA GTCCACACGA CAACCTCGGA AAGTCCTGGA AATCCTACTT GACATTCTTT TGGAAGCCTA TCTTGCAGTT CGCCATTTTT CCAATGCAGT TCGCCTATCT TTATTCCTAC TATCCGACGG GGAATATGGG TGATCTTCCG GCCAAATTAG AAGCAATTTG GAAAGAGAAG CACAAAACAG ATGCCGAAGC TGTGTTTGAG CAAGCATCCA AGGAAGGCTT TGTCATGGAA CACGAAGCGT CCTTTTTCAA AAAAGCGGAA GTAGAATTGT CCCCAACAGC TTTGGCAGCT ACGAAAGAAG AACTAAGCTA GAAGTGAAGA AATTCCATAG TTTTCATTGT AGCCTTTCGG
|
Protein sequence | MGLSFLSLCA VLTASSAMAF VTTRSPACND VTRSLHRINT RHMTYPFYPA SSLRISTRVA STAVPPEDVA FDKLSLPARE GRPLKIAIAG GGVGGLTTAL CMLKKGFDVT VYEKTAAFAR FGGPIQFASN ALSVLKEIDE TLFERVMDKF TFTGTRTCGI KDGLRADGSF RMTEDRLDYL WNPDAPADWF VKFPLKQCAD LFGLPYTGVI DRPDLQEILI DECRKLKPDF LINGNPVVGY EDLGKGQGVT INLNDQTTAS ADVLVGSDGI WSAVRDQMYK EGGVKSTSAN KKKRQGCDYS GYTVFAGETI LKTPDYYATG YKVYIGPKRY FVTSDVGDGR IQWYAFFALP PGTKKAPSGW GGSTRDGQTD PEENLVDYVK GLHEGWSDEV MMVLDSTSPD SVEQRDLYDR APELFRSWAN GNVVLIGDAV HAMMPNLGQG GCQAIEDAYV LTETLANTRT TEKLQDALQE YYRKRIVRVS IVQFLSKLAS DLIINAFDTP WSPHDNLGKS WKSYLTFFWK PILQFAIFPM QFAYLYSYYP TGNMGDLPAK LEAIWKEKHK TDAEAVFEQA SKEGFVMEHE ASFFKKAEVE LSPTALAATK EELS
|
| |