Gene PHATRDRAFT_45997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45997 
SymbolPK4a 
ID7201060 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp912856 
End bp914986 
Gene Length2131 bp 
Protein Length535 aa 
Translation table 
GC content52% 
IMG OID 
Productkinase pyruvate kinase 4a 
Protein accessionXP_002180140 
Protein GI219118746 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.01235 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCAACGGGAT CTTTTCCAAG ATTGTTACTG ACAGTGATTG TGTAATAGCT CCACACTTTC 
TCTCCTATCC AACAATCGCT TGTCCTTTCG TCGTTTGATA GTTGTACACA ACAACGTATA
ACAAGTATCC AGAAATATCT AGTGTTGTTA CAACGTGTGC AGTAGTGACC ATAACATGCT
TTCGAGCACG AGTACCATTC CCAAGCTAGA CGGCGAGGTC GTCACGCTCA GCGTCATCAA
GAAGCCCACC GAAACCAAGA AGAGACGCAC CAAGATTATT TGTACCTTGG TAAGTATTGG
GTCCTCTGTC GGAGACTCAG TTGACGGAAG TGACCAAGCT GTATTTCATG GGCACATATG
CTGCCGGACT GTGTTGTGCT GTCGTGTATC TCAACGGGCG TTATCGTTCG TATCTCGTCC
TTACCAGTAC GGGCGCTCGA CTCGATCACT CACTGCTGCC TGCTATGTTC CAGGGACCTG
CTTGTTGGAG CGAAGAAGGC CTCGGCCAGC TCATGGACGC CGGCATGAAT GTCGCTCGCT
TCAACTTTTC CCACGGTGAT CACGAAGGAC ACGGAAAAGT CCTCGAACGT TTGCGCAAGG
TTGCCAAGGA AAAGAAGCGC AACATTGGTA CGTACGAACC AACCACGAAC ATTCGTCTTT
TGAGCGTGTA CGCGTGTCTT TTTGTAGCTC TACCGGACGA AACAAATGGA GCTGTAGCGT
GCTTCTGTAG GCGCCTGGTT TACCATCTTT CCTCACACCC CATTCTTGAT TCCTTTCCTC
ATCGGCCACA GCGGTGCTCT TGGATACCAA GGGTCCGGAA ATTCGTACGG GATTTTTTGC
CGACGGCATC GACAAGATTA ACCTGTCCAA GGGAGACACG ATCGTACTGA CCACGGACTA
TGACTTCAAG GGCGATAGCA AGCGTTTGGC GTGCAGTTAC CCCACACTAG CCAAGTCCGT
TACCCAGGGA CAAGCCATTC TTATTGCCGA CGGATCACTC GTTTTGACCG TCTTGAGCAT
CGACACGGCT AATAACGAAG TGCAGTGTCG CGTCGAGAAC AACGCTTCCA TTGGCGAACG
CAAAAACATG AATTTGCCCG GAGTTGTCGT CGATTTACCC ACCTTCACCG AACGTGACGT
CAACGATATC GTCAATTTTG GTATCAAGAG CAAGGTAGAC TTTATCGCTG CTTCTTTTGT
TCGCAAGGGA AGTGACGTGA CCAACCTGCG CAAGCTCCTC GCCGACAATG GCGGTCCACA
GATCAAAATT ATTTGTAAAA TTGAGAATCA AGAAGGCCTC GAGAACTACG GAGACATTCT
GGAGCACACG GATGCCATCA TGGTGGCCCG CGGTGATCTC GGTATGGAAA TTCCTTCGTC
CAAGGTATTT CTGGCGCAAA AGTACATGAT TCGCGAAGCC AACGTTGCGG GCAAGCCCGT
TGTCACTGCC ACGCAAATGC TCGAAAGTAT GGTGACCAAC CCGCGTCCTA CGCGTGCCGA
ATGTTCCGAC GTGGCCAACG CCGTTTACGA CGGCACCGAC GCCGTTATGC TGTCGGGAGA
AACCGCCAAC GGTCCACATT TTGAAAAGGC CGTGCTGGTC ATGGCGCGTA CGTGTTGCGA
AGCCGAGTCG TCCCGCAACT ACAACCTGTT GTTCCAGTCG GTCCGCAACT CAATCGTCAT
TGCGCGCGGT GGCTTGTCTA CCGGGGAATC CATGGCCAGC AGTGCCGTCA AGTCGGCCCT
CGACATTGAA GCCAAGTTGA TTGTGGTCAT GAGTGAAACG GGCAAGATGG GCAACTACGT
GGCCAAATTT CGTCCGGGCT TGAGTGTCCT GTGCATGACC CCCAACGAAA CGGCCGCGCG
GCAGGCTAGT GGATTGCTGT TGGGCATGCA CACGGTCGTG GTGGATTCGT TGGAAAAATC
GGAAGAGTTG GTGGAAGAAC TCAATTACGA ATTGGTGCAA TCCAACTTTC TCAAACCCGG
CGACAAGATG GTTGTCATTG CCGGACGCAT GGCCGGCATG AAGGAACAGT TGCGCATTGT
GACGTTGGAC GAGGGGAAGT CGTATGGTCA CATTGTCTCC GGCACGAGCT TCTTCTTTGA
ACGCACACGT CTGTTGGACT TTAACGACTA A
 
Protein sequence
MLSSTSTIPK LDGEVVTLSV IKKPTETKKR RTKIICTLGP ACWSEEGLGQ LMDAGMNVAR 
FNFSHGDHEG HGKVLERLRK VAKEKKRNIA VLLDTKGPEI RTGFFADGID KINLSKGDTI
VLTTDYDFKG DSKRLACSYP TLAKSVTQGQ AILIADGSLV LTVLSIDTAN NEVQCRVENN
ASIGERKNMN LPGVVVDLPT FTERDVNDIV NFGIKSKVDF IAASFVRKGS DVTNLRKLLA
DNGGPQIKII CKIENQEGLE NYGDILEHTD AIMVARGDLG MEIPSSKVFL AQKYMIREAN
VAGKPVVTAT QMLESMVTNP RPTRAECSDV ANAVYDGTDA VMLSGETANG PHFEKAVLVM
ARTCCEAESS RNYNLLFQSV RNSIVIARGG LSTGESMASS AVKSALDIEA KLIVVMSETG
KMGNYVAKFR PGLSVLCMTP NETAARQASG LLLGMHTVVV DSLEKSEELV EELNYELVQS
NFLKPGDKMV VIAGRMAGMK EQLRIVTLDE GKSYGHIVSG TSFFFERTRL LDFND