Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50444 |
Symbol | UGP/PGM |
ID | 7199256 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | + |
Start bp | 79779 |
End bp | 83661 |
Gene Length | 3883 bp |
Protein Length | 1057 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | UDP-Glucose- Pyrophosphorylase/Phosphoglucomutase |
Protein accession | XP_002185375 |
Protein GI | 219130444 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00884767 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCGAAAGAC GAACAACGAG CCTCCCGAAT ATCCTACGGT TGGTTGCTTA ATAACATTGC TTTGACAACT CAAGATGCCT TCTTTCGATC CCATTCGTGT AAGTAAAGTA CCACGTACAC AAGCATGATG TTTGAAACTG AAGCGGCTCT AGTTCGAGCG GCGCGGTTGA ATCGGAATCG CTAACTCGCG TTTTCGATGA CGACGACGGC CAGTCTACTT ATTCTTTGTG CTACTGTAGG ATCTCTCTGC ATGTTCTCTC ACTGTGAATT TGCGTTCCTT CAGGCAAAAA TGGAAGCCGG AGGCTGTGCT CCATCGGCGA TTGCCGCCTT CGAGTCGACC TATGGTAGTC TCGTCTCGGG TGATTCCGGA ATGATTTTGG AAGACTCTAT TGCGCCCGTC CCCCAGCTGG ACAAGACCGC GGAGCTGGAT ATTGCACCCA ACGCCACCCT TCTTGCCGAG ACGGTAGTTC TCAAACTCAA TGGTGGACTC GGCACGGGTA TGGGTCTGGA CAAGGCCAAG TCCCTGTTGC CAGTCAAGGG GGACGACACC TTTTTGGATT TGACCGCCAA ACAAGTCATT CAAATGCGTA AGGAATACGG TTTGAACGTC AAGTTTATGC TCATGAATTC GTTTTCTACT TCCGACGATA CCTTGAGCTT TTTGAGTTCC AAATACCCTG ATCTTGCTTC CGAGGAAGGT TTAGAAATGA TGCAAAATAA GGTCCCCAAG TTGAACGCGG AGACTCTCGA GGTATGAATT TGGTAGTCGA ATGGATACAG GTGACCACTC TCATCGGGTT GACTTCTGAC GCCATTTCTT TGCTTCTTTG GATTAGCCGG CATCTTGTGA ATCCGATCCG GAAAATGAGT GGTGTCCGCC GGGACACGGT GACTTGTACG CGGCCTTGGT TGGCTCTGGT CGTCTTGATG CCCTGCTCAA GGAAGGGTTC AAATATATGT TTGTCTCCAA TTCGGACAAC CTTGGTGCTA GCCTGGACCT TGAAATTCTG ACTTACTTTG CCGAGAAGAA TGTACCCTTC TTGATGGAGT GCTGCGAACG TACAGAAAAC GACAAAAAGG GAGGGCACTT GGCCGTCCGC AAATCCGATG GACAACTTAT TCTTCGGGAA TCTGCTATGT GCGCTGAAGA GGATGAAGAT GCATTCAGTG ATATCAGCAA GCACCGCTTT TTCAACACCA ACAATTTGTG GGTTCGTCTC GATAAACTCA AGGAGATCAT CGACCGCAAT GGCGGCTTTA TTCCTCTGCC CATGATCAAA AACAAAAAGA CGGTCGACCC CAAGGACGAC TCGTCGACCC CGGTACTGCA GTTGGAAACC GCTATGGGTG CCGCTATTGA ATGTTTCGAA GGCGCCAGCG CGGTGGTTGT TCCTCGCACA CGCTTTGCGC CCGTCAAAAA GTGCAGCGAT CTGCTCTTGC TGCGCTCCGA TGCATACTTG CTCGTGGACC ACAAGCCGGT ACTCAATCCA GCCTGCAACG GGAGCGCGCC CGTGATCAAT CTCGACAGCA AACTATACAA GCTGGTCGGC GCCTTGGAAG AAGCAACCCA GGACGGCATT CCGTCCCTCG TCAAGTGCGA CAAATTGACT ATCAAGGGTT TGGTCCGGAT GTCGAAAAAG ACCAAGTTTG TGGGTGATGT CAAGATTGTC AACTCGAGCG CCGAATCTAA GTTTGTGCCC ACCGGTGAAG TAACAGGGGA ACACGATCTG ACGTCTAATG CTGGTCTTGG CAAGCTAAAG CCCACCTCTG TTTCAACAGC ACCAATTGCG GGACAAAAGC CTGGTACTTC AGGACTCCGG AAGAAGGTTG CCGAATTCAA GAAGGAAAAC TACCTTAACA ATTTTGTACA AGCTGCTTTT GACGCCATCA AGGCCAGTGG TACGGACATA TCGAAGGGGT CCTTGGTAAT TGGTGGTGAT GGTCGCTACT TCAACCCTGA AGCAATCCAA ATACTTATTC AGATGGGTGT TGCTAACGGC GTCAGACGTT TCTGGATTGG ACAGGACGGC CTCTTGTCGA CACCCGCCGT TTCTGCGATC ATTCGGGAAG GCGGCCCGCG TTGGCAAAAG GCATTTGGAG CCTTTATTTT GACGGCTAGT CACAATCCCG GTGGCCCAAC GGAAGATTTT GGTATCAAGT ACAACTGCGA ACATGGTGAG CCCGCTCCGG AGAGGATGAC GGATGAAATT TACGCCAACA CAACGACGAT TAAGTCCTAC AAGATTTGTA AGGAATTCCC CAACATTGAC ATTGGCGCTG CGGGCCACTC CAAGATCATG TCTGACGACG GCAGCGCCGA AGTCAATATT GAAGTAATTG ATTCCACCGA AGCTCACGTC AAGTTGTTGA AATCTATTTT TGATTTCTCG GCCATCAGAG GGCTGTTGGA TCGCCCCGAC TTTTCCATGG TCTACGACGC CATGCACGGT GTCAACGGGC CGTACGTAAA AAAAGTATTC TGCGATATTC TGGGGCAGGA CCTCTCCGTC ACACTGAACT GTGTCCCCAA GGACGACTTC AACGGAGGCC ATGCCGACCC CAACCTCACG TACGCCAAAG AGCTTGTTGC CGTCATGGGG CTTAATCGCA AGGGCGAAAA GATCGATATG GGCGGACGTC CTATTCCCAG CTTTGGTGCG GCCGCCGACG GCGACGGAGA CCGCAACATG ATTCTGGGCA CACAGTTTTT TGTCAGTCCG TCCGATTCCT TGGCAGTCAT TGTTGCCAAC GCCGACACCA TTCCATTCTT CCGCACGCAA GGTGGACTCA AGGGCGTCGC GCGGTCCATG CCAACGTCCG GCGCCGTCGA TCTCGTCGCC AAGGACCTGA ACTACAGTTT GTTTGAAACA CCTACGGGAT GGAAATACTT CGGGAACCTG ATGGATTCCA AAGAGCTTTT TGACGGTGCC GAATACACTC CGTTTATTTG TGGGGAAGAA TCGTTCGGCA CAGGCTCCGA TCACATTCGC GAAAAGGACG GACTTTGGGC CGTGCTGGCT TGGCTCAGCA TTTTGGCGCA CGCCAATACT AACAGCCTAA GTGACACACT GGTGACCGTG GAAGACATTG TCAAGGCTCA TTGGGCAAAG TACGGACGCA ACTACTACAG CCGCTGGGAT TTCGAGAACA TGAATGCGAC CAAGGCGAAC GCCATGATGG ACAAGATGCG GGCGGAAACA GACGCGAACA CGGGCAAGAC GGTGGGCAAG TACTCGATCG AAAAGTCCGA CGACTTTGTG TACGTGGATC CCGTGGACGG CTCGGTGGCC AAGAAGCAGG GGATGCGGTT CCTAATGACG GATGGCTCGC GGATTATTTT CCGTTTGAGT GGCACGGCGG GCAGTGGCGC CACGGTCCGC ATGTACATCG AACAGTACGA ACCGACGAAG ATTGACATGG TGGCTTCAGA GGCTTTGGCA GATTTGATTC GAGTCGCACT GGATTTATCT GACCTCAAGG GATTCCTCGG AACTGAAGAA CCAACCGTAA TTACGTAACT GATGTTCGAG CTCTGGCAAC ACGTCCTGCT AGGTCTCAGT GTGGCTAACT AAACGAGCCA GCCAGAACAG TTTCCTCCGT CTGATATATG AATGATGTGA CTCGCTCAGG AATCGATTCG TAATTGTCGA GTAGAGCAAC TTAATAGTGC AACAACGATA GCCCTAGTGC AAAATCCTCG TCTCGTTTCG ATGGGTTCAT GCATCCTAAT GCAAGCTGAA TATTTCGTTG TCTATCCGAG TAATACAAAG AGAAAATTCG GTATTTGGGA TGAGCAGGGG TGAAATTTTC GCTATTTGGG AAAAATCACA CTGTTTCTAA GTGTTTTTAT TTTCGCGGGA AATACTTTCT AAGTAATCTT TTT
|
Protein sequence | MPSFDPIRAK MEAGGCAPSA IAAFESTYGS LVSGDSGMIL EDSIAPVPQL DKTAELDIAP NATLLAETVV LKLNGGLGTG MGLDKAKSLL PVKGDDTFLD LTAKQVIQMR KEYGLNVKFM LMNSFSTSDD TLSFLSSKYP DLASEEGLEM MQNKVPKLNA ETLEPASCES DPENEWCPPG HGDLYAALVG SGRLDALLKE GFKYMFVSNS DNLGASLDLE ILTYFAEKNV PFLMECCERT ENDKKGGHLA VRKSDGQLIL RESAMCAEED EDAFSDISKH RFFNTNNLWV RLDKLKEIID RNGGFIPLPM IKNKKTVDPK DDSSTPVLQL ETAMGAAIEC FEGASAVVVP RTRFAPVKKC SDLLLLRSDA YLLVDHKPVL NPACNGSAPV INLDSKLYKL VGALEEATQD GIPSLVKCDK LTIKGLVRMS KKTKFVGDVK IVNSSAESKF VPTGEVTGEH DLTSNAGLGK LKPTSVSTAP IAGQKPGTSG LRKKVAEFKK ENYLNNFVQA AFDAIKASGT DISKGSLVIG GDGRYFNPEA IQILIQMGVA NGVRRFWIGQ DGLLSTPAVS AIIREGGPRW QKAFGAFILT ASHNPGGPTE DFGIKYNCEH GEPAPERMTD EIYANTTTIK SYKICKEFPN IDIGAAGHSK IMSDDGSAEV NIEVIDSTEA HVKLLKSIFD FSAIRGLLDR PDFSMVYDAM HGVNGPYVKK VFCDILGQDL SVTLNCVPKD DFNGGHADPN LTYAKELVAV MGLNRKGEKI DMGGRPIPSF GAAADGDGDR NMILGTQFFV SPSDSLAVIV ANADTIPFFR TQGGLKGVAR SMPTSGAVDL VAKDLNYSLF ETPTGWKYFG NLMDSKELFD GAEYTPFICG EESFGTGSDH IREKDGLWAV LAWLSILAHA NTNSLSDTLV TVEDIVKAHW AKYGRNYYSR WDFENMNATK ANAMMDKMRA ETDANTGKTV GKYSIEKSDD FVYVDPVDGS VAKKQGMRFL MTDGSRIIFR LSGTAGSGAT VRMYIEQYEP TKIDMVASEA LADLIRVALD LSDLKGFLGT EEPTVIT
|
| |