Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48983 |
Symbol | PGK |
ID | 7195255 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | - |
Start bp | 222740 |
End bp | 225660 |
Gene Length | 2921 bp |
Protein Length | 448 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183701 |
Protein GI | 219126933 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATCTCTAAG ATCCCGTGGA GCAAATACCG CAACTCCTCC CGTATCATCA TGTTCCGTAT GTTGACTTCA ACGGCTTTGC GGCGTTCACC AGTAACCAGC AGCTTGACCT CTTGCTGTAA AGCAAATGCT TTTGCAGTCC GAATTCGTAG CTTTCACGCT GCCCCAGTGA TCCAAGCCAA AATGACGGTC GAGCAACTGG CCCAGCAAGT CGATATGAAA GGGACCAATG TTCTCGTGCG CGTTGATTTG AATGCCCCCC TGGCCACGGT ACGTCCCATC GAATCTTTTG GTTTCTCGGA TTGTTCGCTT TCGGATATAT ACTCTTGTTC CCAACGCCGT TGATGAATCT CTTTCTGATC TGCTGGACTT TCACATTAGG ATGATGTAAC GGTGACGGAT GACACACGCC TGCGCGCCAT TGTTCCAACC ACGAAATTCT TGCTCGAGCA GGGAGCCAAC GTGATTCTCT GCAGCCATTT TGGCCGACCC AAGGGTGAAA TTATCGAAAC TGGCAAGAAT GGTCGGCTCA ACCCAGTGGT GAAGCCGCTC GAAGTACTGC TGGGGCAGAC GATTACCAAA CTTGATGATT GCATGGGACC GGATGTTGAA GCCGCAACGA AAAATTTGGG TGAGGGAAAG GTTGTCTTGT TGGAAAACAC CCGTTTTTAC AGCGGAGAAA CCAAGAACGA TCCGGAACTC GCAGCCGGGC TCGGAAAACT TGCTGATTAT TTCGTCATGG ACGCCTTTGG CACGGCTCAC CGAGCACATT CTTCGACTGC CGGTGTGACC ACCCATATGA AGTTCAACGC TGCTGGAAAA CTTATGGAGA AGGAATTGCA ATACTTGCAA GGTGCCGTGG AAGAGCCCAA ACGCCCCATG ATGGCTATTG TCGGTGGTGC CAAAGTATCC ACCAAAATTC CGGTCATCGA ATCGCTTTTG GACAAGTGCG ATGTCATTTT GATTGGCGGC GGTATGATCT TTACCTTTTA CAAGGCTCTC GGTTACGACA TTGGCGCATC GCTCGTGGAA GACGACATTG TGGAATTGGC GAGTTCATTG ATGAAAAAGG CCGAAGAAAA GGGTGTCAAA CTGATTCTGC CTGTTGACGT TGTCTTGGCT GATAAGTTTG ACAATGATGC AAACTCGGCC GTGGCGAAGG TCACCGACAT TTCCGGAGAC TGGATGGGAC TGGACATTGG ACCGGAAACG ATCGATTTGT TTCGTAGCGA AATTGCCGAA GCCAATACAA TTGGTACGTC GTTACGCGTT ATTTGATTGA AAGTGGGTCG GGTTATTGTT TGTTTGCCAT CTCATGTTCG CTATTGTCTC TTGCTATGAT AGTTTGGAAC GGTCCGATGG GCGTCTTTGA ATTTAGCAAC TTTGCGGCTG GCACCAACGA TGTTGCCCAG ATGCTCGCCC AAGCCACTGC CGAACGGGGC GCCGTCACCA TAATTGGTGG TGGCGACTCT GTGGCCGCCG TCAACAAAGC GGGTTTGGGT GACAAGGTTT CTCACATTTC GACGGGCGGT GGCGCCAGTT TGGAGCTTCT GGAGGGCAAG GTATTGCCCG GCGTTGCGGC TTTGACAGAA GTGTAAGCCG GGAATGCGTA TAGTCTAGCA ATATTTATAA TACTGTCGTA AGAAACGAGT CATCAGTTGG ACAAACGGTA TCAGTTAGTG CAAATCGGGC GGAGCCATCA TACACAATCA GAATTGCTGT ACACGAGATC CGCCATAGAA AGCCGATAGT GACCTGCAGT ATAACGACTT TAATTTGCGC TCGCGTACAT TGAATCGTAT CGACACGAAT TTCCCGTGCA GTTCCTTTGG TACCTACACA TGCTGCAAAG TGGAACCATC CGATCCCTTC GGAAACGTTG ATTCCACCAT GTTTTTGACA CTGCTAAAGC TTCTTTTCGT ATATTGATAA AGCAATGAGC TATCCATGGC AATGTTCAAA GGACTCGGTA CGCTATCGGC GGAAGGTGGT ACGGCGGCCC TATCTTTGGC GAGAATGTGT TGGGGGTTGT ACCCCAAGTG CTCAAAGACT AGTTCCGCCA TATCTACCCG ACTAACAGCA GTCGGGCCAC CCATATTGTA CACGCCCGTG GGGACGGTAT CACCGTTGCG AAAACTGGCG AGCAGTCCCA GTATTACCGC TACGACGTCG TGTACAGACA CCACGGACCG AACTTCATCC CGGTAAAAGA CCGTGTCCAC GCCTTGGCGG GTGGCACAAA AGTGCAAAAA TGTTTCGTGC GCTATTTCGG GAAGAACGGG GGCACGGGGA CCCAGAATGA TACTGCTACG GAGTATCAAG GTACGACAGT TGTTGCTTTG CAACAAATGA TGTTCCAAAG CTAGTTTGGT TCTTCCGTAC ACATTACAAG GTTCCGGAGG AGTGTCCTCT CGGTAAGGTG GTTCTGTTCC ATCGTAGACT TGGTCCGTGG ATAAAGCAAT AACGTAGGTA TTTCGAACTT CCAACAGGGC ATCCAGAAAG GCTTTTGGAC AGTTACTGTT ATGGGCGAGC TCGGGTTGTG CTTGGCAGGT CCGGGGACTC GACAAGGCGG CCGTATGGAT ACAAACGTCC AATATTGGTA TCCTGGCGAA CCAGTCCCGT ACGGCTCTTT GATCGCTTAA ATCTAGTGCC TGTACGTGTA CCTTGGTAGT CGGAAATTGG GCGGCAGCCG TTGATACTGC AGCAGCAAAG CCTTCGGCAC GGTGATACAA CGCGTATATT TCGTAGGAGT GTTGTTCTGG GTGACTCGAT GATTGGAAGA GCGATGCCAA AATATGTTGC CCCAAGTAGC CCGAGGCTCC TGTCAAGAGA ATTCTGAACG CATTGGAAGA ATTATCACAA CGAGAAGTCT CTACTGGACT TCTGGATATC GTACCGGTCG TCACCATGGT CAGTCTCGAC G
|
Protein sequence | MFRMLTSTAL RRSPVTSSLT SCCKANAFAV RIRSFHAAPV IQAKMTVEQL AQQVDMKGTN VLVRVDLNAP LATDDVTVTD DTRLRAIVPT TKFLLEQGAN VILCSHFGRP KGEIIETGKN GRLNPVVKPL EVLLGQTITK LDDCMGPDVE AATKNLGEGK VVLLENTRFY SGETKNDPEL AAGLGKLADY FVMDAFGTAH RAHSSTAGVT THMKFNAAGK LMEKELQYLQ GAVEEPKRPM MAIVGGAKVS TKIPVIESLL DKCDVILIGG GMIFTFYKAL GYDIGASLVE DDIVELASSL MKKAEEKGVK LILPVDVVLA DKFDNDANSA VAKVTDISGD WMGLDIGPET IDLFRSEIAE ANTIVWNGPM GVFEFSNFAA GTNDVAQMLA QATAERGAVT IIGGGDSVAA VNKAGLGDKV SHISTGGGAS LELLEGKVLP GVAALTEV
|
| |