Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_17199 |
Symbol | PGP_1 |
ID | 7196670 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 183075 |
End bp | 185531 |
Gene Length | 2457 bp |
Protein Length | 517 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | 2-phosphoglycolate phosphatase |
Protein accession | XP_002177036 |
Protein GI | 219110569 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.493472 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGAAGAGGGA GACCCAAGAC TACTCTCGGA AGACAACGGC GCTCCGGCAC TTTTACTGAT TGTCAATCCC ATTGCTTGGC GAGGTCAGCA GAACACCACC TCATCATCAC GAGCTGCTGT ACGTCAAGAC ACAGAGCAAC TGATGATGCT CTCTTCGATC CGTCATCAAA CGAGCTGTAC TCCGAGACAA GAGGATGACG ATGCCGCGAT TACAACGTCC TCGGGAATGT ATCGCCGTAA ATATCACGTG CGGGCGGTGT GTCAGTCTCT GGCATTTCTT ACATTGATTG GAGTGTTTTC TGTTTACAAC TACACCTTTG ACGATAGTGA AGGAGATTAT GCTGTCCAAG GCGTCGTTAC GCCGATCGAT ACTCGGCGCG TAGCGGAACT CGCTTTGATG AGCCCAGCGC AACGTCGTCG GGCGGAAACA TTGGTGTCGT GTGACGATAT TGAAAAGGCG GACCCTCGCT GGCTCACCGT ATTTCTCTGC ATTGGAGTCC TGTACATGTT CCTCGCTCTC GCCATTGTTT GTGACGAGTT CTTTGTGCCG GCCTTGGAAG AAATGTCTTC GAAGCGACGT ATGAATCTCT CGATGGACGT TGCGGGAGCC ACACTCATGG CAGCGGGTGG ATCGGCTCCC GAGTTGTTTA CGTCCCTTTT TGGTACATTT ACCGAGAGCG AAATTGGGTT TGGTACGATT GTGGGTAGTG CCGTCTTTAA CGTCCTCTTT GTCATTGCCA TGTGCACGAT ATTCTCCAAA GAAGTCTTGG CGTTGACGTG GTGGCCTCTC TTTCGTGACA GTCTCTTCTA CGCGATTGGA CTCGTCGTTC TATCCATTTT TGTCGGTGTT ACCAGTCCCG AAGAAATTGA ACTATGGGAA GCGATTGTGC TCTTTGCCAT GTACTTTTTG TACTGTGTAA TCATGTATTT CAATGCGGAC ATTTACCATT ATCTCACCGG CAAGGTACTC ATATATCCGG AAGACTCGGA CGACGAAGAG AGTACCGCCT CGCAAGAACA ACGGCAAGAA GCTGCGGCCG CAACCCGCGA TATTGAACGC CCTTCATTGG AAAAGGAAGG CTCGGCCAAT TCCTTGGCCA GTGCTTTGCA TTTGGTCACT CTCCAGAACG ATTTACAGTT AATGGGTCAA CACAGCTTTC GCTGGCAGGG TACCTTTCGT GCCGGGATTC TCAAGTTGTT GCGGGATCCC CATACCTGGG TCGAGACTGG CGGAGTCGGT ATCGTCGCCA AGATTGCCGG AGATGCGGAC TACATCTTTC GGAAAATTGA CGCCAACGGC GATGGACACT TGGACAAGGA AGAACTCAAG CGACTCTTTG AAGCCCTGGA CTGTCACGTC AGCCCGGAAG AATTGACCGA AGTTTTTGAT ATCCTCGACG TCAACAAGGA CGGTGTTATT AGTGAAGAAG AATTCAACAA ATGGTACACA ACATCCGCCG AACTCATTCG TTCGCAAACG CGGCAAGTGT TTGACAAGAT GGACGCTGAT CACTCTGGAA CCTTGGACAA GGACGAAATC AAGACTTTGC TGCAAGAACT TGACCCACAC GTTACTGACG AAGATGTCAC CGCGGCAGTA GACGAAATGT ACCAGCATGG ATCTCGTGAA GAAATTTCTT TTGAAGAGTT CGAAGAGTGG TACGAAAAGT CGATCATCTA CGAACGCCAA AAAAAGGCCG TCGAAGAAGA CAAGGAAGGC GCCGGTCAGA GTCTGAAACC TCCTTATGGC GAAGGCATTT TGAGCTGGAC ACAGTACATT ATTGTTTTCC CACTCGTCTT CGCAATGGTT TTTACGATCC CGGATGTTCG ACGGCCGGGA TGGGGTCGCT GGTGCTATTT GTCCTTTGTG CTTTCGATCG CATGGATTGG AGGTTTTGCT TATCTCATGG TAACATGGGC CGAAACCATT GGAAATACGG TCGGAATTCC CTCTGTAATT ATGGGTTTGA CTGTTTTGGC AGCCGGAACG TCTGTACCAG ATTTACTTTC GAGCGTAATT GTGGCGCGAA GGGGATCGGG TGATATGGCT GTTTCCAGCT CCATTGGCAG TAATATATTC GATATTCTGG TCGGTCTACC CGTGCCCTGG ATTTTGTATA CCTCCTGGCC CTCGAAAGAT TCGACGGTGG TTATTGCTTC GGGCAAAATT TGGATTTCCA TTTTTGTCTT GATTGGTATG CTGGTCTTTG TTATTGCTGC CGTTCACTGC CAAGGATGGA AGTTGACCAA GACACTGGGA GCCATGATGA TAGTCTTTTA CTTCGCATTT TTGGCCCAGG CAATTCTTTT GGAGCTGCCT TTCGAGACTT GCATCAGCTC CCCTTAAAAG AAGGGGCTGT TTTGCCGCAG GGACGCTTGC GCACTATTTG CGAACATTAA ATTGGTTATC ACGCTTAGGT AGTCGTTTAC AGTAATAAAC AATACATGAT TTACAGT
|
Protein sequence | VFLCIGVLYM FLALAIVCDE FFVPALEEMS SKRRMNLSMD VAGATLMAAG GSAPELFTSL FGTFTESEIG FGTIVGSAVF NVLFVIAMCT IFSKEVLALT WWPLFRDSLF YAIGLVVLSI FVGVTSPEEI ELWEAIVLFA MYFLYCVIMY FNADIYHYLT GKVLIYPEDS DDEESTASQE QRQEAAAATR DIERPSLEKE GSANSLASAL HLVTLQNDLQ LMGQHSFRWQ GTFRAGILKL LRDPHTWVET GGVGIVAKIA GDADYIFRKI DANGDGHLDK EELKRLFEAL DCHVSPEELT EVFDILDVNK DGVISEEEFN KWYTTSAELI RILSWTQYII VFPLVFAMVF TIPDVRRPGW GRWCYLSFVL SIAWIGGFAY LMVTWAETIG NTVGIPSVIM GLTVLAAGTS VPDLLSSVIV ARRGSGDMAV SSSIGSNIFD ILVGLPVPWI LYTSWPSKDS TVVIASGKIW ISIFVLIGML VFVIAAVHCQ GWKLTKTLGA MMIVFYFAFL AQAILLE
|
| |