Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48026 |
Symbol | PGP_2 |
ID | 7203019 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 754068 |
End bp | 755488 |
Gene Length | 1421 bp |
Protein Length | 389 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | phosphoglycolate phosphatase |
Protein accession | XP_002182293 |
Protein GI | 219123982 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAAGTAACAT ATTTCCAATG TAAAATCAAA AGCGGACGCT AACAAAACCG CCGACGACCA ACACAGAAAC AAACACGTGT TGATTGTCAA CCACTCTAGC ATTCCGGAGC ATGTCACTGA GAATGGCGGG TACGTTAGCG TTGTTCGTAA CGGGAGCGAC GACGAGAGCA CTGGGTCGAT CCGGGGGCAC CTCGAAGCGA CCTTTCTCGT TGCGACTACC TTTGCAGACG GGTACCCAGC CGACCTTGGC GCTTTCCTTA TCGTCGTCCG CGTCCGCAAA CAAACCGACT CTCTCTTTGA CGGAACAAAT GCGCAAAGAA TCCGAAGCGG AACTCGCTAA ACTGGCCCAT CACTACGAAG ATCGAGCCCG AAACGACCCA GCCTTTGCAG ACCTGGCACC AATCATTTGG AAGACACTCG ACGAAGCAAC AGCTTTTGTG AACGATCACA TTGAGACCAT TATGTTTGAT TGTGACGGTG TGGTTTACCG AACACCGGAT GAGTGTCCCG GTGCCAAAGA ATGCATACAA CGTTTACTCG ACAAGGGGAA GCGGGTGTTT TTTGTCACCA ACAACGCCGC ATCCAATCGA TCGCAGCTGC GCGCCAAGCT CTCCGAAATT CTGGCGATTG AGAATTTGAC CGACGATATG ATGGTTCCGA GTTCCTATTC TTGTGCTCGG TTTCTGCAAC GAGAAATTTT AGATCGAAAA GGGCGTGGCC GTTTGTTCGT CATTGGCAGT CGAGGCCTTT GTGACGAATT GGAGCAAACG GGTTTTGAGG TACTGACTGG GAATGGACCA CTGGACAGCG ACGCATCCAT GACCAGAGAA GATCTAGCTA CATACCCGTT CTCCGAGCAT CCAGTAGACG CGGTCGTGGT TGGTACGTTG TTAATGGGCG AGGAAACGGA ATTTTCACTT ACATCAGTTC TTCCGTGAGA CTGATGGTAT ACTCACTATT GGTTTCGCTT TTGTCGCTCG GCAGGTCACG ACACGGCGTT GTCATTTCGT AAAATATGTA TAGCCAATGT GCTTTTACAA ATGAATCCCG ACGCACCACT AGTTGCCACC AACAAGGATG CATTCGACTT GGTCGGTGTA GACGGACGAC ACATCCCAGG CAACGGATGT GCCGTAGTCG CGCTCGAACA CTCTTCCAAA CGAACGGCCA TCAACGTTGG TAAACCGAGT GCCACTCTGG CCGATCTAAT TGCTGCCGAC CACGGCATTA ATCCATCCAG GACCATGTTC GTTGGCGACC GACTGGATAC AGACATTCAA TTTGGTGTGG AGAATGGAAT GCATTCTGTC TTAGTCATGA CTGGTGTTAC TACTGCCGAC TCGATGGTCC AACTTGGGAA CGGAACGAAC GATGAGCCAC TGCCGAACAT TGTCATACCA CATATTGGTC TTTTATACTA A
|
Protein sequence | MSLRMAGTLA LFVTGATTRA LGRSGGTSKR PFSLRLPLQT GTQPTLALSL SSSASANKPT LSLTEQMRKE SEAELAKLAH HYEDRARNDP AFADLAPIIW KTLDEATAFV NDHIETIMFD CDGVVYRTPD ECPGAKECIQ RLLDKGKRVF FVTNNAASNR SQLRAKLSEI LAIENLTDDM MVPSSYSCAR FLQREILDRK GRGRLFVIGS RGLCDELEQT GFEVLTGNGP LDSDASMTRE DLATYPFSEH PVDAVVVANV LLQMNPDAPL VATNKDAFDL VGVDGRHIPG NGCAVVALEH SSKRTAINVG KPSATLADLI AADHGINPSR TMFVGDRLDT DIQFGVENGM HSVLVMTGVT TADSMVQLGN GTNDEPLPNI VIPHIGLLY
|
| |