Gene PHATRDRAFT_48983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48983 
SymbolPGK 
ID7195255 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp222740 
End bp225660 
Gene Length2921 bp 
Protein Length448 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183701 
Protein GI219126933 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATCTCTAAG ATCCCGTGGA GCAAATACCG CAACTCCTCC CGTATCATCA TGTTCCGTAT 
GTTGACTTCA ACGGCTTTGC GGCGTTCACC AGTAACCAGC AGCTTGACCT CTTGCTGTAA
AGCAAATGCT TTTGCAGTCC GAATTCGTAG CTTTCACGCT GCCCCAGTGA TCCAAGCCAA
AATGACGGTC GAGCAACTGG CCCAGCAAGT CGATATGAAA GGGACCAATG TTCTCGTGCG
CGTTGATTTG AATGCCCCCC TGGCCACGGT ACGTCCCATC GAATCTTTTG GTTTCTCGGA
TTGTTCGCTT TCGGATATAT ACTCTTGTTC CCAACGCCGT TGATGAATCT CTTTCTGATC
TGCTGGACTT TCACATTAGG ATGATGTAAC GGTGACGGAT GACACACGCC TGCGCGCCAT
TGTTCCAACC ACGAAATTCT TGCTCGAGCA GGGAGCCAAC GTGATTCTCT GCAGCCATTT
TGGCCGACCC AAGGGTGAAA TTATCGAAAC TGGCAAGAAT GGTCGGCTCA ACCCAGTGGT
GAAGCCGCTC GAAGTACTGC TGGGGCAGAC GATTACCAAA CTTGATGATT GCATGGGACC
GGATGTTGAA GCCGCAACGA AAAATTTGGG TGAGGGAAAG GTTGTCTTGT TGGAAAACAC
CCGTTTTTAC AGCGGAGAAA CCAAGAACGA TCCGGAACTC GCAGCCGGGC TCGGAAAACT
TGCTGATTAT TTCGTCATGG ACGCCTTTGG CACGGCTCAC CGAGCACATT CTTCGACTGC
CGGTGTGACC ACCCATATGA AGTTCAACGC TGCTGGAAAA CTTATGGAGA AGGAATTGCA
ATACTTGCAA GGTGCCGTGG AAGAGCCCAA ACGCCCCATG ATGGCTATTG TCGGTGGTGC
CAAAGTATCC ACCAAAATTC CGGTCATCGA ATCGCTTTTG GACAAGTGCG ATGTCATTTT
GATTGGCGGC GGTATGATCT TTACCTTTTA CAAGGCTCTC GGTTACGACA TTGGCGCATC
GCTCGTGGAA GACGACATTG TGGAATTGGC GAGTTCATTG ATGAAAAAGG CCGAAGAAAA
GGGTGTCAAA CTGATTCTGC CTGTTGACGT TGTCTTGGCT GATAAGTTTG ACAATGATGC
AAACTCGGCC GTGGCGAAGG TCACCGACAT TTCCGGAGAC TGGATGGGAC TGGACATTGG
ACCGGAAACG ATCGATTTGT TTCGTAGCGA AATTGCCGAA GCCAATACAA TTGGTACGTC
GTTACGCGTT ATTTGATTGA AAGTGGGTCG GGTTATTGTT TGTTTGCCAT CTCATGTTCG
CTATTGTCTC TTGCTATGAT AGTTTGGAAC GGTCCGATGG GCGTCTTTGA ATTTAGCAAC
TTTGCGGCTG GCACCAACGA TGTTGCCCAG ATGCTCGCCC AAGCCACTGC CGAACGGGGC
GCCGTCACCA TAATTGGTGG TGGCGACTCT GTGGCCGCCG TCAACAAAGC GGGTTTGGGT
GACAAGGTTT CTCACATTTC GACGGGCGGT GGCGCCAGTT TGGAGCTTCT GGAGGGCAAG
GTATTGCCCG GCGTTGCGGC TTTGACAGAA GTGTAAGCCG GGAATGCGTA TAGTCTAGCA
ATATTTATAA TACTGTCGTA AGAAACGAGT CATCAGTTGG ACAAACGGTA TCAGTTAGTG
CAAATCGGGC GGAGCCATCA TACACAATCA GAATTGCTGT ACACGAGATC CGCCATAGAA
AGCCGATAGT GACCTGCAGT ATAACGACTT TAATTTGCGC TCGCGTACAT TGAATCGTAT
CGACACGAAT TTCCCGTGCA GTTCCTTTGG TACCTACACA TGCTGCAAAG TGGAACCATC
CGATCCCTTC GGAAACGTTG ATTCCACCAT GTTTTTGACA CTGCTAAAGC TTCTTTTCGT
ATATTGATAA AGCAATGAGC TATCCATGGC AATGTTCAAA GGACTCGGTA CGCTATCGGC
GGAAGGTGGT ACGGCGGCCC TATCTTTGGC GAGAATGTGT TGGGGGTTGT ACCCCAAGTG
CTCAAAGACT AGTTCCGCCA TATCTACCCG ACTAACAGCA GTCGGGCCAC CCATATTGTA
CACGCCCGTG GGGACGGTAT CACCGTTGCG AAAACTGGCG AGCAGTCCCA GTATTACCGC
TACGACGTCG TGTACAGACA CCACGGACCG AACTTCATCC CGGTAAAAGA CCGTGTCCAC
GCCTTGGCGG GTGGCACAAA AGTGCAAAAA TGTTTCGTGC GCTATTTCGG GAAGAACGGG
GGCACGGGGA CCCAGAATGA TACTGCTACG GAGTATCAAG GTACGACAGT TGTTGCTTTG
CAACAAATGA TGTTCCAAAG CTAGTTTGGT TCTTCCGTAC ACATTACAAG GTTCCGGAGG
AGTGTCCTCT CGGTAAGGTG GTTCTGTTCC ATCGTAGACT TGGTCCGTGG ATAAAGCAAT
AACGTAGGTA TTTCGAACTT CCAACAGGGC ATCCAGAAAG GCTTTTGGAC AGTTACTGTT
ATGGGCGAGC TCGGGTTGTG CTTGGCAGGT CCGGGGACTC GACAAGGCGG CCGTATGGAT
ACAAACGTCC AATATTGGTA TCCTGGCGAA CCAGTCCCGT ACGGCTCTTT GATCGCTTAA
ATCTAGTGCC TGTACGTGTA CCTTGGTAGT CGGAAATTGG GCGGCAGCCG TTGATACTGC
AGCAGCAAAG CCTTCGGCAC GGTGATACAA CGCGTATATT TCGTAGGAGT GTTGTTCTGG
GTGACTCGAT GATTGGAAGA GCGATGCCAA AATATGTTGC CCCAAGTAGC CCGAGGCTCC
TGTCAAGAGA ATTCTGAACG CATTGGAAGA ATTATCACAA CGAGAAGTCT CTACTGGACT
TCTGGATATC GTACCGGTCG TCACCATGGT CAGTCTCGAC G
 
Protein sequence
MFRMLTSTAL RRSPVTSSLT SCCKANAFAV RIRSFHAAPV IQAKMTVEQL AQQVDMKGTN 
VLVRVDLNAP LATDDVTVTD DTRLRAIVPT TKFLLEQGAN VILCSHFGRP KGEIIETGKN
GRLNPVVKPL EVLLGQTITK LDDCMGPDVE AATKNLGEGK VVLLENTRFY SGETKNDPEL
AAGLGKLADY FVMDAFGTAH RAHSSTAGVT THMKFNAAGK LMEKELQYLQ GAVEEPKRPM
MAIVGGAKVS TKIPVIESLL DKCDVILIGG GMIFTFYKAL GYDIGASLVE DDIVELASSL
MKKAEEKGVK LILPVDVVLA DKFDNDANSA VAKVTDISGD WMGLDIGPET IDLFRSEIAE
ANTIVWNGPM GVFEFSNFAA GTNDVAQMLA QATAERGAVT IIGGGDSVAA VNKAGLGDKV
SHISTGGGAS LELLEGKVLP GVAALTEV