Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43120 |
Symbol | GPH |
ID | 7196892 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 2092452 |
End bp | 2093611 |
Gene Length | 1160 bp |
Protein Length | 291 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | phosphoglycolate phosphatase, PGPase |
Protein accession | XP_002176905 |
Protein GI | 219110307 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.691617 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAGTTTCCCT GCAGAAAGAG GATCCCGATC GAAAGTGAGC CGAAATTTTT TTGAGAGTAC CTGCGATCTC TATAAACTAG CAGAATTGAA AGTTGTTGTT GGTGTCGCAA TCGAGACCAC TGTTTAGCAA GGTAATGCTC CCTTCCTTCA TGATCCCACG TTCATTCCTT CTTTACTGTC TGTGGGCCCA ACAGACCATT TCATCTTTCC CGCTTCACTC TACAACCCCA CAAACTATCT CCAGCTTTCG GTCGTTCAGG CTACCTGCAT CCAGTGTCCC TGCTGCGAAC GACGACGCCA ACAACAGCAT GTACCAAAAT ATCAAAGGCA TCATTTTCGA CGTTGACGGC ACGTTAGCCG ACTCCTGGAA GCTAGGATAC GACGCAACGG TCGTTATACT CGACAAACAC AATCTCCACC CTATCACGGA ACAAATTTAT CACGAGCACA CGGTATACTG CACTCCGGAG CGACTCGCCC GACACGCCGG TCTCGTACCA GGCGACGAAA CGTACGCCGA AGTTGGTGCC AAGCTCGGGA AGGAGTTTGA CGATTTGTAC GTTGGTCTCG TGTCGTCCCA AACGGCAGGC TTTTACCCTG GCGTGGCGGA GTGTTTACAG GCCATTCCAT CCGATATTGC CTTCGGGGCT CTGACGAATG CGGCGGTGAA CTACGCGCAC GCTGTTTTGC AAGTCAACGA TCAAAATAAA AATCTCGTGA ACCGTTTTGT CTCCATTCAC GGGGCCGACT CAGTGCCGGA GCCCAAACCG TCTCCCGCTG GTTTGCTTCA AGTATGCCGA GATCTGAATC TGCGACCCGC AGACTGTGTT TACATTGGTG ATAGCCCGAG TGACGGCAAA GCCGCAGAAG CGGCTGGTAT GGGAGCGATA GCCGTCTTGT GGGGCAGCCA CAAAGAAGAC ACCTTAAAGC AAGCGCCCTT TACACATTAC TGTCGGACGG TCCAAGAATT GCAAGCCCTT CTGCCCAAAA CCTCCGCGGC CGTGAGCTAG TGGTGCATTC GTGGAAACAC AGTCAATGTG CCATCGTGAT CTGCATTTGT GCACGACCAG TGTAGAATAC CTAGAGAATC TCCACACATT TTTCTGTTGA TTCTCTACCC TAAACCAACT TAATAATGAA TAAAATTTTT TTGAAGGAGG
|
Protein sequence | MLPSFMIPRS FLLYCLWAQQ TISSFPLHST TPQTISSFRS FRLPASSVPA ANDDANNSMY QNIKGIIFDV DGTLADSWKL GYDATVVILD KHNLHPITEQ IYHEHTVYCT PERLARHAGL VPGDETYAEV GAKLGKEFDD LYVGLVSSQT AGFYPGVAEC LQAIPSDIAF GALTNAAVNY AHAVLQVNDQ NKNLVNRFVS IHGADSVPEP KPSPAGLLQV CRDLNLRPAD CVYIGDSPSD GKAAEAAGMG AIAVLWGSHK EDTLKQAPFT HYCRTVQELQ ALLPKTSAAV S
|
| |