Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_20360 |
Symbol | PDH1 |
ID | 7201049 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 850004 |
End bp | 853045 |
Gene Length | 3042 bp |
Protein Length | 814 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | precursor of dehydrogenase pyruvate dehydrogenase E1, alpha and beta subunits |
Protein accession | XP_002180334 |
Protein GI | 219119135 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.387314 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAAACAGAGT GAGAACACTT GCTCGGAATC GAAGCAATAT AAGACTCTCT ATACCCTACC TATACCGCAC GCAACACAAC GCAAACTACC AGATATCCTA CACGCATCCT TCCAATCGTC GAACCGGTCT TATTCTTCCT CTCGAACGAC TGTAACGTAC CCCAACGCCA CACTCCTTCC TTTCGTTGGT GTTCTTTTAA CGTATCTCAC AGTCCGAAAC ATACAACACT ACTAACGTTA CTCATTTTCG GTCCACTCTC GTCATGAAGT TCTCCACTGC CACTCTCGCG CTTTGCGTCG CCACGGCTTC CGCCTTTGTG TAAGTCGAAT CGAACGGACT TTCCAGTCCT CTCGGCACAC CCAAGATACC CTTTGTTTTA GGGTCTCTCA CCCAGCCTTT TTCTTGTTTT TGTTTGCAGT CCCGTCGCCT TGCGACCCCA ATCCAATGGC GTGACGACCT ATTCACGAAC AGGCCACGCG CTCCCAGTGT CTCGAATGTC CACGACCGTG GAAGATACAA CGACGACCGA AGAAGCTTCC TTTGCGGTAC GTCACCATTG GTGGAAAGAG ACGAGTCTCC GTGCCACATC ACGGCTGCCC CCATGGGAAG GAATCCTCGA GACTACCAGT CTCCACCTTT TGGCGTTTAC CTCGTTTCTC ACTCGCACCC CTCTCCTTTT CTCGGTAGCC CATGCGACCC CCCGTCGACT TGCCGTGGCA CAAAATTACG AAACAACTCC AGGACGCCTT TGGCTACACG GACACCGAAA TCGAAGCCTA CAATTCCCTC GACGGCGACA AAGAAGCCCT GCTCAAACTG TACAAGGCCA TGATGCTGGC GCGGGGCTTC GAAAATGCCT GCAATCAGCA GTACATGCAG GGCAAGATCC GCGGATTTAT GCATCTCGAC AACGGTCAAG AATCCATTCC GGGTTTGGTC GATTACGCCG TCAAGACGCA GGACAAGAAA TTCTCCTACT ACCGCGAACA CACGCATGCA CTAGCCTCCG GATGCGACCC GGGGGCCATC ATGGCCGAGC TAATGATGAA GGATACCGGA TCGTGCCGGG GAGCCGGCGG CTCCATGCAC ATTTTTGACA AGGAAAAGTA CTTTCAGGGT GGCTGGGCCT TGGTCTCCGA GCAGTTGCCT TACGCAGCCG GCGCCGCCAA GAGTATCTTG CTCGATCGGG CACTCGGTCT AAGCGACGAC GAAAAAATTG TCAAGGGAAA CGTCGCGCCG CCGGCGGATG ACGATCGAAT CAGTGTCGTC TTTATTGGGG AAGGTGGGGC CCAGAACGGA CGCATGGCGG AACTGCTCAA CGCCGCCAGC AAGGACAACC TGCCCCTGCT CATCATTGTT ATCGACAATG GTCGTGCCAT TAACACATTC ACGCCCGATA TCGCCAGGAA CTCCAACGTT TACCAACAAG GCTTGCACTA CGGCGTCCCC GGTTTGTTAG TCGATGGTCT GAACGCAGTC GACGTGGCCA AGGGCGGCAA GGCCGTGGTG GATTACATTC GGGCGGGCAA AGGACCGGCC ATTCTGCAGG TTCACACTTA CCGCTTCAAC GGCCACTCGC CTGCCGATCC CGAACACGAA CGGGGTCGCA AGGAAGAAAA GGCGTGGGCG CGCAACGCAC AGGATCCCAT CAAAGCCTTT GAAGACGCTT ACACGGCCAA CGGCGTCTTT ACCGAGGACG ACCTCAAGGC CGCCAAGAAG GAAATCTTGG CACAAGTCAA GGCCAGTGTT GAATTCGCCG ACAAGTCGCC AATGCCACCA GTGGAACTTG CCAAGGAACT GGAATACCCC GACAAGCCCA GTACGGATTA CAATGTTCGC AGTGGACCGG CATGGGCGGA TGAAGTTAAC CAGCGTACCA TTTCGAGCTC GCAAATGGAA ACAATCCAAG CCCATATTGC TGCTCTGCAA CAAAAGGCCA AGGATGGTGA GATTTCCATT GGTGACGCCA TTAATCTGGC CATTCATGAA GAAATGCTTC GTGATCCCAC CACAACTATT CACGCGGAAG ATTTGCAGGC TGGCTCGTCG TACGACATTC CCAAATTGAC CCAGCAAACC TACGGGCAAA TTCGTGCCGC GGATGAGATT ATTGATGAAG GACACTTTAT TGGCAAGGCT TTGGGCGAAG CACTCAACGG TTATCGTCCA ATTGTCGAAC TCATGAATAC CAACTTTGGT ATCTACGGCA TGGCTGAACT CTCGTCGGCA GGTAACACTT TCGCCACTAC TGGTGGGCAA TTCGACATGC CCATGACGAT CATCGGTGCT GGTGGTACTG CCCCCGATCA AGCTTTGGGT GCCGAGCACA GTCAACCGTT TCATGCGTAC GTTATGGGTA TTCCCGGCCT AAAAATTGGC ACCGCCGCAT CACCCGATGC CGCCTATGGT CTGACTAAAT CCATGATTCG CGACAACGGT CCGTGCTTTT TGTTTGCTCC CGTGAAAATG ATGAAGGAAT CCAAAGGAAA GGTTGATATT GGCAAATGCA TGCCTCTGAA CAAGGCAGCA TTACTGCACG AGGCCTCCGA GGCGACCGTC AAGGCAGGCA AGGCCGTGAC TGTTTTGACG TACCTGCATG GCGTGAAGGA AGCAACCGCG TCAATCGACG CGATCCGGGA AGAAGGCTTC GATATTGATT TGATTGAATT GCGATCTCTC AAGCCGCTGG ACATGGAAAC GATTACAACG AGTCTTGCGC GTACCAATAA GATGGCCATT TTGGACGAAT CAACCAAGTC TGGTGGAGTC GGTGCAACCA TTTCGGCTCA AGTAAGCGAG GAATTGTTTG ATTTGCTAGA TGCCCCCGTA AAGCGACTCT GCATGGACGA TGCCCCCGTA CCGTACGCGA GTAGTATGGA AAAGGCTGTC GTAAAGCGTG GCTCCGATTT GATTGAAGGT GTCTTTAATT TGTGCACCAA AAAATTCTGA ATAAAAAGTG GTAGACCTTG ACTTCTGTCT CTTTTGCCAT GTTCTCTATC ATGAACCTCA GAACTACTAT ATAAACTTTC GCTATCTATT TA
|
Protein sequence | MKFSTATLAL CVATASAFVP VALRPQSNGV TTYSRTGHAL PVSRMSTTVE DTTTTEEASF APMRPPVDLP WHKITKQLQD AFGYTDTEIE AYNSLDGDKE ALLKLYKAMM LARGFENACN QQYMQGKIRG FMHLDNGQES IPGLVDYAVK TQDKKFSYYR EHTHALASGC DPGAIMAELM MKDTGSCRGA GGSMHIFDKE KYFQGGWALV SEQLPYAAGA AKSILLDRAL GLSDDEKIVK GNVAPPADDD RISVVFIGEG GAQNGRMAEL LNAASKDNLP LLIIVIDNGR AINTFTPDIA RNSNVYQQGL HYGVPGLLVD GLNAVDVAKG GKAVVDYIRA GKGPAILQVH TYRFNGHSPA DPEHERGRKE EKAWARNAQD PIKAFEDAYT ANGVFTEDDL KAAKKEILAQ VKASVEFADK SPMPPVELAK ELEYPDKPST DYNVRSGPAW ADEVNQRTIS SSQMETIQAH IAALQQKAKD GEISIGDAIN LAIHEEMLRD PTTTIHAEDL QAGSSYDIPK LTQQTYGQIR AADEIIDEGH FIGKALGEAL NGYRPIVELM NTNFGIYGMA ELSSAGNTFA TTGGQFDMPM TIIGAGGTAP DQALGAEHSQ PFHAYVMGIP GLKIGTAASP DAAYGLTKSM IRDNGPCFLF APVKMMKESK GKVDIGKCMP LNKAALLHEA SEATVKAGKA VTVLTYLHGV KEATASIDAI REEGFDIDLI ELRSLKPLDM ETITTSLART NKMAILDEST KSGGVGATIS AQVSEELFDL LDAPVKRLCM DDAPVPYASS MEKAVVKRGS DLIEGVFNLC TKKF
|
| |