Gene PHATRDRAFT_55018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_55018 
SymbolPEPCK1 
ID7195549 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp387907 
End bp390142 
Gene Length2236 bp 
Protein Length612 aa 
Translation table 
GC content49% 
IMG OID 
Productphosphoenolpyruvate carboxykinase 
Protein accessionXP_002183984 
Protein GI219127525 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAAACGTTCC CTGATCGCTT ATAAAAAGGA TTCGAAAGAT CAGATCGATC AGGTTTCATC 
CTTTGGCGTG TTTTTCTCAG CTCAATACTT CCCCCCTGCT TACGTTAGCA CACGAGTAGT
CCTCGTTGTC AAATTATCAT GTTGTTGACT ACTGGAGCTG CCCGCGTCTT TTTGCGCTCG
GCGGCCGTCA GCAAATCTGC CGTCAAGACG TTTGCTGCCC GTGCCGTGTT AGGCTCGGGT
CGCTTAAGCT CGCCTTCGTC GTTGGTACGT GCCTCAGCGC TCTTCTTCTG CTCTCCATTA
GCAATGGCTT GGTCACCGCT TGAGGATGCA TTCGTTTTTC TTTTTTTTTT CTCATGGATA
ATGATGACCA TCTTTTGTTG CCTCACTGTT TACAGTACCA CGGATGCTCA GTCCAGACTT
TTACTAGTCT CCCCAACGAC GCTTCCACCA GTTGCAAGGA AGGACGGGAA GCTTACAACG
TTTCGCAGAC ACACAAGGGC ACGGACGCCT GTCTCAAAGT CGGTATTGAC AAACTCGGGA
TCACAGGCCC TTCCACAATC TACCGCAACT TGAACTACGA CGAAATCTTT GAGCACGAAG
TCAAGAACGG CGAAGGCGTT GTGGCCAAGG CCGAATACGG TGACACATTT TGCGTCGATA
CCGGAAAATT CACCGGTCGT TCCCCCAAGG ACAAGTGGAT TGTATTGAAT AAGGGATCCG
AGACCGAAGC CAACATTGAC TGGAATAGTA TCAATCAGGC AACCAAGCCG GAAGTATTCG
ATGAGCTCTA CGACAAAGCC GTCGACTATT TTAATCAACG CGAGTCGTGC TACGTCGCGG
ATGTCTATTG TGGAGCCAAT CCATCCACAC GCAAGAAGAT CCGCTTCTTG TTCGACAAGG
CCTGGCAGCA GCACTTTGTA ACGAATATGT TTATTCGTCC GTCAGACGAG GCGGAGCTAG
ATGGGTTCGA TCCTGACTTT ACCGTTATTA ATTGCTGCGC ACAGGTAGAT GACGACTGGG
AACGCCATGG TCTCCACAGC GACACTGCTG TAGTCTTCAA CATAGAAAAG AAGACGGCCG
TTGTTTTCGG AACCTGGTAC GGTGGAGAAA ATAAGAAGGG CATCTTTTCA CTCATGAACT
ACTGGTTGCC TATGCAAGGA CATTTGCCCA TGCACTGTTC CGCCAATGTT GGCAAGGAAG
GTGACGTGGC CTTGTTTTTC GGGCTCAGCG GAACTGGAAA GACCACTCTG TCAGCCGACC
CGCACCGCGC TTTAATTGGT GACGATGAGC ATGGATGGTA CGTAGGAGGA CTGCTGTTTT
TGAAAGCAAT GCAATCGTAG AAATGCATTC TGACGAATCT TACGTTTTCT CTATGGACAG
GGATCACGAT GGCATCTTCA ACTTTGAAGG TGGCTGCTAC GCTAAGACAA TAAACTTGTC
TGAAGCGACC GAGCCAGATA TTTACCGAGC CATCCACAAG GACGCTCTTC TGGAGAACGT
GGCTATTCGG GACGATGGAA CCCCTGACTA CTCGAATGTC TCCAAGACGG AGAACGGACG
TGTCTCGTAT CCAATTTTCA ACATTCCTGG GTATCACAAG GAGCAAATGG CTGGGCATCC
TAGTAACATC ATCTTTTTGT CCTGTGATGC ATTTGGCGTA ATGCCTCCTG TGGCTCGTCT
GTCTTCCGGG CAGGCCATGT ACCACTTTTT ATCTGGATAC ACAGCCAAAG TGGCCGGAAC
GGAACGTGGA ATCACGGAAC CCTCAGCCAC ATTCTCGACT TGCTTTGGTG CTGCGTTTAT
GACTATGCAC CCGACCGTGT ATGCTGATTT GTTGCAAGAA AAACTTGACA AACATGGATC
CCATGCCTAT CTGGTTAATT CCGGATGGTC TGGAGGTGCC TATGGTACCG GGAAGCGTAT
GAGTATCAAA ACGACGCGCA CATGCATTGA TGCAATTCTA GATGGATCCA TTCACGATGC
AGAATTTCAA GTGGATCCTA TCTTTGGCTA CGAAGTACCC AAAAGCCTTC CCGGACTGGA
CGATCTTCTC TTGGATCCCA AGTCGACTTG GGATAACCAG GACGCCTACG ACGAAACAGC
AGCGAAACTT GCCAAGATGT ACTCCGACAA CTTCAAGCAG TATGAAGGAA AGGGGTCCAT
TGACTACACC AAATTCGGAC CCAAGATATA ATTACGGAAC AATAAACCGG TCTATAAATG
GGAGCCTAAT CTATGA
 
Protein sequence
MLLTTGAARV FLRSAAVSKS AVKTFAARAV LGSGRLSSPS SLYHGCSVQT FTSLPNDAST 
SCKEGREAYN VSQTHKGTDA CLKVGIDKLG ITGPSTIYRN LNYDEIFEHE VKNGEGVVAK
AEYGDTFCVD TGKFTGRSPK DKWIVLNKGS ETEANIDWNS INQATKPEVF DELYDKAVDY
FNQRESCYVA DVYCGANPST RKKIRFLFDK AWQQHFVTNM FIRPSDEAEL DGFDPDFTVI
NCCAQVDDDW ERHGLHSDTA VVFNIEKKTA VVFGTWYGGE NKKGIFSLMN YWLPMQGHLP
MHCSANVGKE GDVALFFGLS GTGKTTLSAD PHRALIGDDE HGWDHDGIFN FEGGCYAKTI
NLSEATEPDI YRAIHKDALL ENVAIRDDGT PDYSNVSKTE NGRVSYPIFN IPGYHKEQMA
GHPSNIIFLS CDAFGVMPPV ARLSSGQAMY HFLSGYTAKV AGTERGITEP SATFSTCFGA
AFMTMHPTVY ADLLQEKLDK HGSHAYLVNS GWSGGAYGTG KRMSIKTTRT CIDAILDGSI
HDAEFQVDPI FGYEVPKSLP GLDDLLLDPK STWDNQDAYD ETAAKLAKMY SDNFKQYEGK
GSIDYTKFGP KI