Gene PHATRDRAFT_22404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_22404 
SymbolPK1 
ID7203591 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011685 
Strand
Start bp329868 
End bp331930 
Gene Length2063 bp 
Protein Length591 aa 
Translation table 
GC content52% 
IMG OID 
Productpyruvate kinase 1 
Protein accessionXP_002182818 
Protein GI219125083 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.529919 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCCTCTGTC CCAAGGAGAC AAAAAAGTAT CTTTGATAGT TGGAATTGGT ACCAACTGTG 
CTACAACAAA AGCTTTTACA ATCATGAAAC TTTCTCTTCT CGCGTTGACG TTCGCTTTAG
GCCATGCGTT CGTTCCTCCT TCCTTCTTGG CGTCGCCGTC GTCTCGTAAG GTACTGTCAT
CCTCGCGATC GGCGTCGGTA GCGGCCAACG CTGCGGATGT GTTGGCAAAG ACAACATCTT
CTTCCAGCAC ACCCAGTTCT TTGATGCCCA AGGAAACAAC AGTGGCAGCG GTTCCCAAGG
TCGCGCAGCG TTGGCGCAAG TCGACGAAAC AAGTCGTCAC GCTGGGACCG GCTTCGAGCA
ACAAGGAAAT GATTGAAAAG TTGTTTCTCG CCGGGGCCGA TGTCTTTCGT CTCAACTTTT
CCCACGGATC GCAAGAACAA AAGAAAGAAC TCCTCATCAT GATTCGGGAA GTGGAAGAAA
AATACTCCCA TCCTATCGGT ATTTTGGGAG ATTTGCAGGG TCCGAAATTG CGGGTACGTA
TCTGCCCCAA CGCACGCTGG TAGACAGAGA AATCTTGGGG TTTGGCTTCA ATCTGACGGT
ACACACCATT CTTTTTGTTC CTCTCTTCTT TGACAAATGA TCGCTTTTTA GGTTGGTGAA
TTCTCCAAGC CGGAGGGTGA GTTCTTGGAA CTCGGTCAGA GCTTTCGCCT CGATCTGGAC
AACGCCAAGG GTGACAACAA ACGCGTCCAG CTCCCCCACC CCGAAATCAT CAAGGCATCC
GAGCTCGGGC ACGCCTTGCT CGTGGATGAC GGTAAGGTCA AGCTTGTCGT TACGGCCAAG
GGCGATGACT ACCTCGAATG CCGCGTCGAT GTCGCCGGGA TGATCAAAGA TCGTAAGGGA
GTCAACACGC CCGATTCGGT GCTCGAAATC AGTCCTCTCA CACCCAAGGA TCGCAGTGAC
TTGGAGTACA TGCTTGGTAT TGGCGTCGAC TGGGTTGCGC TATCTTTTGT ACAGACTCCG
GCGGATATGG TGGAGATCCA CGCCTTGATC GACGAAAAAC TCCCTTCCGG ACAATTCAAG
CCCGCCGTCA TGGCCAAGAT TGAAAAGCCC AGTTGCTTCT ACGACGACAA TCTGCAACGC
ATTGTCGGAC TCTGCAACGG CATCATGGTT GCCCGTGGTG ATTTGGGTGT GGAGTGCCCT
CCCGAAGACG TGCCCTTGCT ACAAAAGGAA ATCATCGACG AATGCCGCAA TCAAGGACGT
CCCGTGATTG TAGCTACACA AATGCTCGAA TCCATGATTG AAGTACCAAC ACCGACCCGT
GCGGAAGCCA GTGATGTGGC CACGGCTATT TACGATGGCG CCGATGCGAT CATGCTCAGT
GCCGAATCCG CCGCCGGAAA GTTCCCGGAA GAATCCGTCG CTATGCAACA GCGCATCATC
AACCGCGTCG AGGGTGACAA GCACTACCGT TCTTACTTGA AACAAAACGA GCCCGATCCG
GAGAATACCC CGACCGATGC TATTATCACG GCGGCACGTC AAGTCGCGAA GACTATCGGT
GCCAAATCGA TTGTCTGCTT TTCACTACGA GGTTCGACCG TTCTGCGAGC CTCCAAATCT
CGTCCGGGTG TTCCAATTCT AGCCCTGTGT CCGTTCAAAG AAACTTCAAG ACAGCTAGCT
CTCAGCTGGG GCGTTTACTC CGATCTACCC AAGGCCGGCT CGTACGGATA CACCGTTTCG
GAAGAAGACA TGTTCAACTA CGACCGACCC ATGGTGGAAA AGAGCACGGA TGACTTCGAC
CTGGTCCTTA AAAATGCCTG CCGTGCGGCG TTGAAGAAAG GATTGGTCAG CGATCCGGAC
GATCTGCTTG TTGTGACGGC TGGCCTTCCT TTCGGTACCC CGGGAGCGGC AAACATCATT
CGTGTGGTCC CTGCCGCCGG CCCCAGTTGT TGGGACGGCG TTTGCCGTGT CGATTAAGGC
TAAGCTCGCG CCAAAGTTCT AGAAAGTGTT CATTTTTTCG CAGTCTATAA CCATATTTAA
AAATTAAAAG CAAACGTTTG TGT
 
Protein sequence
MKLSLLALTF ALGHAFVPPS FLASPSSRKV LSSSRSASVA ANAADVLAKT TSSSSTPSSL 
MPKETTVAAV PKVAQRWRKS TKQVVTLGPA SSNKEMIEKL FLAGADVFRL NFSHGSQEQK
KELLIMIREV EEKYSHPIGI LGDLQGPKLR VGEFSKPEGE FLELGQSFRL DLDNAKGDNK
RVQLPHPEII KASELGHALL VDDGKVKLVV TAKGDDYLEC RVDVAGMIKD RKGVNTPDSV
LEISPLTPKD RSDLEYMLGI GVDWVALSFV QTPADMVEIH ALIDEKLPSG QFKPAVMAKI
EKPSCFYDDN LQRIVGLCNG IMVARGDLGV ECPPEDVPLL QKEIIDECRN QGRPVIVATQ
MLESMIEVPT PTRAEASDVA TAIYDGADAI MLSAESAAGK FPEESVAMQQ RIINRVEGDK
HYRSYLKQNE PDPENTPTDA IITAARQVAK TIGAKSIVCF SLRGSTVLRA SKSRPGVPIL
ALCPFKETSR QLALSWGVYS DLPKAGSYGY TVSEEDMFNY DRPMVEKSTD DFDLVLKNAC
RAALKKGLVS DPDDLLVVTA GLPFGTPGAA NIIRVVPAAG PSCWDGVCRV D