Gene PHATRDRAFT_20360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_20360 
SymbolPDH1 
ID7201049 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp850004 
End bp853045 
Gene Length3042 bp 
Protein Length814 aa 
Translation table 
GC content53% 
IMG OID 
Productprecursor of dehydrogenase pyruvate dehydrogenase E1, alpha and beta subunits 
Protein accessionXP_002180334 
Protein GI219119135 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.387314 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAAACAGAGT GAGAACACTT GCTCGGAATC GAAGCAATAT AAGACTCTCT ATACCCTACC 
TATACCGCAC GCAACACAAC GCAAACTACC AGATATCCTA CACGCATCCT TCCAATCGTC
GAACCGGTCT TATTCTTCCT CTCGAACGAC TGTAACGTAC CCCAACGCCA CACTCCTTCC
TTTCGTTGGT GTTCTTTTAA CGTATCTCAC AGTCCGAAAC ATACAACACT ACTAACGTTA
CTCATTTTCG GTCCACTCTC GTCATGAAGT TCTCCACTGC CACTCTCGCG CTTTGCGTCG
CCACGGCTTC CGCCTTTGTG TAAGTCGAAT CGAACGGACT TTCCAGTCCT CTCGGCACAC
CCAAGATACC CTTTGTTTTA GGGTCTCTCA CCCAGCCTTT TTCTTGTTTT TGTTTGCAGT
CCCGTCGCCT TGCGACCCCA ATCCAATGGC GTGACGACCT ATTCACGAAC AGGCCACGCG
CTCCCAGTGT CTCGAATGTC CACGACCGTG GAAGATACAA CGACGACCGA AGAAGCTTCC
TTTGCGGTAC GTCACCATTG GTGGAAAGAG ACGAGTCTCC GTGCCACATC ACGGCTGCCC
CCATGGGAAG GAATCCTCGA GACTACCAGT CTCCACCTTT TGGCGTTTAC CTCGTTTCTC
ACTCGCACCC CTCTCCTTTT CTCGGTAGCC CATGCGACCC CCCGTCGACT TGCCGTGGCA
CAAAATTACG AAACAACTCC AGGACGCCTT TGGCTACACG GACACCGAAA TCGAAGCCTA
CAATTCCCTC GACGGCGACA AAGAAGCCCT GCTCAAACTG TACAAGGCCA TGATGCTGGC
GCGGGGCTTC GAAAATGCCT GCAATCAGCA GTACATGCAG GGCAAGATCC GCGGATTTAT
GCATCTCGAC AACGGTCAAG AATCCATTCC GGGTTTGGTC GATTACGCCG TCAAGACGCA
GGACAAGAAA TTCTCCTACT ACCGCGAACA CACGCATGCA CTAGCCTCCG GATGCGACCC
GGGGGCCATC ATGGCCGAGC TAATGATGAA GGATACCGGA TCGTGCCGGG GAGCCGGCGG
CTCCATGCAC ATTTTTGACA AGGAAAAGTA CTTTCAGGGT GGCTGGGCCT TGGTCTCCGA
GCAGTTGCCT TACGCAGCCG GCGCCGCCAA GAGTATCTTG CTCGATCGGG CACTCGGTCT
AAGCGACGAC GAAAAAATTG TCAAGGGAAA CGTCGCGCCG CCGGCGGATG ACGATCGAAT
CAGTGTCGTC TTTATTGGGG AAGGTGGGGC CCAGAACGGA CGCATGGCGG AACTGCTCAA
CGCCGCCAGC AAGGACAACC TGCCCCTGCT CATCATTGTT ATCGACAATG GTCGTGCCAT
TAACACATTC ACGCCCGATA TCGCCAGGAA CTCCAACGTT TACCAACAAG GCTTGCACTA
CGGCGTCCCC GGTTTGTTAG TCGATGGTCT GAACGCAGTC GACGTGGCCA AGGGCGGCAA
GGCCGTGGTG GATTACATTC GGGCGGGCAA AGGACCGGCC ATTCTGCAGG TTCACACTTA
CCGCTTCAAC GGCCACTCGC CTGCCGATCC CGAACACGAA CGGGGTCGCA AGGAAGAAAA
GGCGTGGGCG CGCAACGCAC AGGATCCCAT CAAAGCCTTT GAAGACGCTT ACACGGCCAA
CGGCGTCTTT ACCGAGGACG ACCTCAAGGC CGCCAAGAAG GAAATCTTGG CACAAGTCAA
GGCCAGTGTT GAATTCGCCG ACAAGTCGCC AATGCCACCA GTGGAACTTG CCAAGGAACT
GGAATACCCC GACAAGCCCA GTACGGATTA CAATGTTCGC AGTGGACCGG CATGGGCGGA
TGAAGTTAAC CAGCGTACCA TTTCGAGCTC GCAAATGGAA ACAATCCAAG CCCATATTGC
TGCTCTGCAA CAAAAGGCCA AGGATGGTGA GATTTCCATT GGTGACGCCA TTAATCTGGC
CATTCATGAA GAAATGCTTC GTGATCCCAC CACAACTATT CACGCGGAAG ATTTGCAGGC
TGGCTCGTCG TACGACATTC CCAAATTGAC CCAGCAAACC TACGGGCAAA TTCGTGCCGC
GGATGAGATT ATTGATGAAG GACACTTTAT TGGCAAGGCT TTGGGCGAAG CACTCAACGG
TTATCGTCCA ATTGTCGAAC TCATGAATAC CAACTTTGGT ATCTACGGCA TGGCTGAACT
CTCGTCGGCA GGTAACACTT TCGCCACTAC TGGTGGGCAA TTCGACATGC CCATGACGAT
CATCGGTGCT GGTGGTACTG CCCCCGATCA AGCTTTGGGT GCCGAGCACA GTCAACCGTT
TCATGCGTAC GTTATGGGTA TTCCCGGCCT AAAAATTGGC ACCGCCGCAT CACCCGATGC
CGCCTATGGT CTGACTAAAT CCATGATTCG CGACAACGGT CCGTGCTTTT TGTTTGCTCC
CGTGAAAATG ATGAAGGAAT CCAAAGGAAA GGTTGATATT GGCAAATGCA TGCCTCTGAA
CAAGGCAGCA TTACTGCACG AGGCCTCCGA GGCGACCGTC AAGGCAGGCA AGGCCGTGAC
TGTTTTGACG TACCTGCATG GCGTGAAGGA AGCAACCGCG TCAATCGACG CGATCCGGGA
AGAAGGCTTC GATATTGATT TGATTGAATT GCGATCTCTC AAGCCGCTGG ACATGGAAAC
GATTACAACG AGTCTTGCGC GTACCAATAA GATGGCCATT TTGGACGAAT CAACCAAGTC
TGGTGGAGTC GGTGCAACCA TTTCGGCTCA AGTAAGCGAG GAATTGTTTG ATTTGCTAGA
TGCCCCCGTA AAGCGACTCT GCATGGACGA TGCCCCCGTA CCGTACGCGA GTAGTATGGA
AAAGGCTGTC GTAAAGCGTG GCTCCGATTT GATTGAAGGT GTCTTTAATT TGTGCACCAA
AAAATTCTGA ATAAAAAGTG GTAGACCTTG ACTTCTGTCT CTTTTGCCAT GTTCTCTATC
ATGAACCTCA GAACTACTAT ATAAACTTTC GCTATCTATT TA
 
Protein sequence
MKFSTATLAL CVATASAFVP VALRPQSNGV TTYSRTGHAL PVSRMSTTVE DTTTTEEASF 
APMRPPVDLP WHKITKQLQD AFGYTDTEIE AYNSLDGDKE ALLKLYKAMM LARGFENACN
QQYMQGKIRG FMHLDNGQES IPGLVDYAVK TQDKKFSYYR EHTHALASGC DPGAIMAELM
MKDTGSCRGA GGSMHIFDKE KYFQGGWALV SEQLPYAAGA AKSILLDRAL GLSDDEKIVK
GNVAPPADDD RISVVFIGEG GAQNGRMAEL LNAASKDNLP LLIIVIDNGR AINTFTPDIA
RNSNVYQQGL HYGVPGLLVD GLNAVDVAKG GKAVVDYIRA GKGPAILQVH TYRFNGHSPA
DPEHERGRKE EKAWARNAQD PIKAFEDAYT ANGVFTEDDL KAAKKEILAQ VKASVEFADK
SPMPPVELAK ELEYPDKPST DYNVRSGPAW ADEVNQRTIS SSQMETIQAH IAALQQKAKD
GEISIGDAIN LAIHEEMLRD PTTTIHAEDL QAGSSYDIPK LTQQTYGQIR AADEIIDEGH
FIGKALGEAL NGYRPIVELM NTNFGIYGMA ELSSAGNTFA TTGGQFDMPM TIIGAGGTAP
DQALGAEHSQ PFHAYVMGIP GLKIGTAASP DAAYGLTKSM IRDNGPCFLF APVKMMKESK
GKVDIGKCMP LNKAALLHEA SEATVKAGKA VTVLTYLHGV KEATASIDAI REEGFDIDLI
ELRSLKPLDM ETITTSLART NKMAILDEST KSGGVGATIS AQVSEELFDL LDAPVKRLCM
DDAPVPYASS MEKAVVKRGS DLIEGVFNLC TKKF