Gene PHATRDRAFT_43120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43120 
SymbolGPH 
ID7196892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp2092452 
End bp2093611 
Gene Length1160 bp 
Protein Length291 aa 
Translation table 
GC content51% 
IMG OID 
Productphosphoglycolate phosphatase, PGPase 
Protein accessionXP_002176905 
Protein GI219110307 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.691617 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAGTTTCCCT GCAGAAAGAG GATCCCGATC GAAAGTGAGC CGAAATTTTT TTGAGAGTAC 
CTGCGATCTC TATAAACTAG CAGAATTGAA AGTTGTTGTT GGTGTCGCAA TCGAGACCAC
TGTTTAGCAA GGTAATGCTC CCTTCCTTCA TGATCCCACG TTCATTCCTT CTTTACTGTC
TGTGGGCCCA ACAGACCATT TCATCTTTCC CGCTTCACTC TACAACCCCA CAAACTATCT
CCAGCTTTCG GTCGTTCAGG CTACCTGCAT CCAGTGTCCC TGCTGCGAAC GACGACGCCA
ACAACAGCAT GTACCAAAAT ATCAAAGGCA TCATTTTCGA CGTTGACGGC ACGTTAGCCG
ACTCCTGGAA GCTAGGATAC GACGCAACGG TCGTTATACT CGACAAACAC AATCTCCACC
CTATCACGGA ACAAATTTAT CACGAGCACA CGGTATACTG CACTCCGGAG CGACTCGCCC
GACACGCCGG TCTCGTACCA GGCGACGAAA CGTACGCCGA AGTTGGTGCC AAGCTCGGGA
AGGAGTTTGA CGATTTGTAC GTTGGTCTCG TGTCGTCCCA AACGGCAGGC TTTTACCCTG
GCGTGGCGGA GTGTTTACAG GCCATTCCAT CCGATATTGC CTTCGGGGCT CTGACGAATG
CGGCGGTGAA CTACGCGCAC GCTGTTTTGC AAGTCAACGA TCAAAATAAA AATCTCGTGA
ACCGTTTTGT CTCCATTCAC GGGGCCGACT CAGTGCCGGA GCCCAAACCG TCTCCCGCTG
GTTTGCTTCA AGTATGCCGA GATCTGAATC TGCGACCCGC AGACTGTGTT TACATTGGTG
ATAGCCCGAG TGACGGCAAA GCCGCAGAAG CGGCTGGTAT GGGAGCGATA GCCGTCTTGT
GGGGCAGCCA CAAAGAAGAC ACCTTAAAGC AAGCGCCCTT TACACATTAC TGTCGGACGG
TCCAAGAATT GCAAGCCCTT CTGCCCAAAA CCTCCGCGGC CGTGAGCTAG TGGTGCATTC
GTGGAAACAC AGTCAATGTG CCATCGTGAT CTGCATTTGT GCACGACCAG TGTAGAATAC
CTAGAGAATC TCCACACATT TTTCTGTTGA TTCTCTACCC TAAACCAACT TAATAATGAA
TAAAATTTTT TTGAAGGAGG
 
Protein sequence
MLPSFMIPRS FLLYCLWAQQ TISSFPLHST TPQTISSFRS FRLPASSVPA ANDDANNSMY 
QNIKGIIFDV DGTLADSWKL GYDATVVILD KHNLHPITEQ IYHEHTVYCT PERLARHAGL
VPGDETYAEV GAKLGKEFDD LYVGLVSSQT AGFYPGVAEC LQAIPSDIAF GALTNAAVNY
AHAVLQVNDQ NKNLVNRFVS IHGADSVPEP KPSPAGLLQV CRDLNLRPAD CVYIGDSPSD
GKAAEAAGMG AIAVLWGSHK EDTLKQAPFT HYCRTVQELQ ALLPKTSAAV S