Gene PHATRDRAFT_45333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45333 
Symbol6PGDH 
ID7200028 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp869375 
End bp871526 
Gene Length2152 bp 
Protein Length519 aa 
Translation table 
GC content51% 
IMG OID 
Product6-phosphogluconate dehydrogenase 
Protein accessionXP_002179525 
Protein GI219117461 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0580309 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATAAAACGG CCATCGCAAC TGCTGTTCAC GAGAACTGTT GCAATCGCGA AGCAATCGCA 
AAAGGTGCGT ACCATGTCTC CACACACCGG AACAGTACAG GATGATACTG ACTGGGTTGA
TGGTTGGTTC TAGAGACGAG AATGTTGGTT TGATTCTTTG CGATTTGGAA TTTACGCAAA
GGACCCGAAC CACAGGATCT TCCAAGAATG ACTTTCATCT TCCACGCTCT CTGACTACGT
TGGCGGATGT CGTCCCGGAT CGGCTTTGGC GACGTACCTT AGCCGACAAT ACAGTACACC
GTGTAGCATA CTCATGTTCC TCACAATCCT ACTGTTTGCC GCTTTCTGCA CGACAACTTT
AGAATCTATT CAAACACTTT CCACCAACGA GATAAACAGG ATGAGCTGCG ATATTGGTCT
TTACGGTCTT GCTGTCATGG GACAGAATTT TGCGCTCAAT ATGGTACGTG TAGGGAGCAT
TCGAGTTGAA TCAATCACCG ACGCACTGGT GGACTCCGTA CTGCGACCGA CCAAACCATG
TCTCACGGAT TTTGCCTTGC TGTGTCGTTT TCTCTGTAGG CGTCGCACGG GTTCACCGTT
GCCGTTTGCA ACCGCTCGCC CTCCAAAGTC GATACGACGG TCCAACGCGC CAAGGACGAA
GGCGATTTGC CCTTGATCGG TACCAAATCT CCCGAAGAAT TTATTTCCAA ACTCAGCAAG
CCTCGGAAAG TCGTCATTCT CGTCCAAGCC GGAAAACCTG TCGATTTGAC CATCGAAGCG
ATCAGCGAAT TCATGGAAGA AGGGGATGTC ATTATTGACG GAGGCAACGA ATGGTTCCCG
AATCAGATTC GTCGTCACGA AGAATTGGAA AAGAAGGGTA TCATGTTCAT CGGCATGGGA
ATTTCTGGTG GCGAAGAAGG AGCCCGCAAC GGACCTTCTC TCATGCCTGG CGGTCCCCGA
AAGGCGTACG ACTTGATTGA ACCCATCATC ATGAAGTGTG CCGCCAAGGC TGGGGATCCG
GAAGAACCCT GCACGGGTTA TTGCGGACCA ATCGGAGCGG GCAATTACGT CAAAATGGTG
CACAACGGTA TCGAATACGG CGACATGCAG TTGATTGGAG AGGTCTACGA TATTCTAAAG
GTAGGCTATT CAACGGAGCG GAGCCTGTGC ACCTAAACCT TTGGAGCTTG TTCTCACCTA
TCCTGTTGCA GAATATTGTC GGTATGGGCA ACGATGAAAT GGCCACACTC TTTGAAGACT
GGAACTCTGG TGATCTCGAG TCGTACCTCA TTGAAATCAC GGCCAAGATT TTGGCTCGCA
AGGACGATTT GACCGACGAC GGATACGTGG TGGACAAGAT TCTTGACAAG ACAGGAATGA
AAGGTACTGG CCGTTGGACG GTACAAGAAG CTGCCGAACA GAGTGTTGCA GCCCCTCTCA
TTGCAGCTTC CCTCGATAGT CGCTACATTT CCGGCCGCAA GGAAGAACGT GTCGCTGCCA
GCAAAGTCCT CCAGGGACCA TCCAACGAAA TGCCGCAAGT CGACAAGGAT CAAATCTTGT
CGGATCTGCA GCAAGCGTTG TACTGCGCCA AGGTAACTTC GTATGCGCAG GGAATGGGAA
TCATCCAGGC CGCGTCCGAC AAGAACGAGT GGGACGTCGA CCTCTCCCTC TGTGCCAAAA
TGTGGCGTGG AGGCTGCATC ATTCGCGCGA GCTTGTTGAG CAAGATCACG GCCGCCTTTG
AAAAGAACAA GGACTTGCAG AATTTGTTGG TGGACGAAAC GTTTGCTGAA GAAATCAACG
CAAGGCAGAT GGCTTGGCGC CGCGTGGTTT CTTTGGGCGT CGCTAGCGGT ATCGCTACCC
CGGCCTTGTC CGCTGCTCTT TCCTACTTTG ACCAGTACCG TCGTGATCGC TTGCCGGCCA
ACCTGATTCA AGCGCAGCGC GATTTCTTTG GCGGACACAC TTACAATCGT GTCGATCGTG
ATGGCACTTT TCATTGCTTG TGGGACGAAA CGCACAAGGA TATTGGTGAT TTGACGGGCC
GCACCGCGGG TGAGCTCTAG ACGCCACGGA TCCCGAGTCG CGTTAACGTA TATCAAATCT
AAGTATTGGT AAATCACAAA CTTCGTCAGC AATAGCGAGT CATTTTAATA TC
 
Protein sequence
MFLTILLFAA FCTTTLESIQ TLSTNEINRM SCDIGLYGLA VMGQNFALNM ASHGFTVAVC 
NRSPSKVDTT VQRAKDEGDL PLIGTKSPEE FISKLSKPRK VVILVQAGKP VDLTIEAISE
FMEEGDVIID GGNEWFPNQI RRHEELEKKG IMFIGMGISG GEEGARNGPS LMPGGPRKAY
DLIEPIIMKC AAKAGDPEEP CTGYCGPIGA GNYVKMVHNG IEYGDMQLIG EVYDILKNIV
GMGNDEMATL FEDWNSGDLE SYLIEITAKI LARKDDLTDD GYVVDKILDK TGMKGTGRWT
VQEAAEQSVA APLIAASLDS RYISGRKEER VAASKVLQGP SNEMPQVDKD QILSDLQQAL
YCAKVTSYAQ GMGIIQAASD KNEWDVDLSL CAKMWRGGCI IRASLLSKIT AAFEKNKDLQ
NLLVDETFAE EINARQMAWR RVVSLGVASG IATPALSAAL SYFDQYRRDR LPANLIQAQR
DFFGGHTYNR VDRDGTFHCL WDETHKDIGD LTGRTAGEL