Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_22404 |
Symbol | PK1 |
ID | 7203591 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | + |
Start bp | 329868 |
End bp | 331930 |
Gene Length | 2063 bp |
Protein Length | 591 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | pyruvate kinase 1 |
Protein accession | XP_002182818 |
Protein GI | 219125083 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.529919 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCCTCTGTC CCAAGGAGAC AAAAAAGTAT CTTTGATAGT TGGAATTGGT ACCAACTGTG CTACAACAAA AGCTTTTACA ATCATGAAAC TTTCTCTTCT CGCGTTGACG TTCGCTTTAG GCCATGCGTT CGTTCCTCCT TCCTTCTTGG CGTCGCCGTC GTCTCGTAAG GTACTGTCAT CCTCGCGATC GGCGTCGGTA GCGGCCAACG CTGCGGATGT GTTGGCAAAG ACAACATCTT CTTCCAGCAC ACCCAGTTCT TTGATGCCCA AGGAAACAAC AGTGGCAGCG GTTCCCAAGG TCGCGCAGCG TTGGCGCAAG TCGACGAAAC AAGTCGTCAC GCTGGGACCG GCTTCGAGCA ACAAGGAAAT GATTGAAAAG TTGTTTCTCG CCGGGGCCGA TGTCTTTCGT CTCAACTTTT CCCACGGATC GCAAGAACAA AAGAAAGAAC TCCTCATCAT GATTCGGGAA GTGGAAGAAA AATACTCCCA TCCTATCGGT ATTTTGGGAG ATTTGCAGGG TCCGAAATTG CGGGTACGTA TCTGCCCCAA CGCACGCTGG TAGACAGAGA AATCTTGGGG TTTGGCTTCA ATCTGACGGT ACACACCATT CTTTTTGTTC CTCTCTTCTT TGACAAATGA TCGCTTTTTA GGTTGGTGAA TTCTCCAAGC CGGAGGGTGA GTTCTTGGAA CTCGGTCAGA GCTTTCGCCT CGATCTGGAC AACGCCAAGG GTGACAACAA ACGCGTCCAG CTCCCCCACC CCGAAATCAT CAAGGCATCC GAGCTCGGGC ACGCCTTGCT CGTGGATGAC GGTAAGGTCA AGCTTGTCGT TACGGCCAAG GGCGATGACT ACCTCGAATG CCGCGTCGAT GTCGCCGGGA TGATCAAAGA TCGTAAGGGA GTCAACACGC CCGATTCGGT GCTCGAAATC AGTCCTCTCA CACCCAAGGA TCGCAGTGAC TTGGAGTACA TGCTTGGTAT TGGCGTCGAC TGGGTTGCGC TATCTTTTGT ACAGACTCCG GCGGATATGG TGGAGATCCA CGCCTTGATC GACGAAAAAC TCCCTTCCGG ACAATTCAAG CCCGCCGTCA TGGCCAAGAT TGAAAAGCCC AGTTGCTTCT ACGACGACAA TCTGCAACGC ATTGTCGGAC TCTGCAACGG CATCATGGTT GCCCGTGGTG ATTTGGGTGT GGAGTGCCCT CCCGAAGACG TGCCCTTGCT ACAAAAGGAA ATCATCGACG AATGCCGCAA TCAAGGACGT CCCGTGATTG TAGCTACACA AATGCTCGAA TCCATGATTG AAGTACCAAC ACCGACCCGT GCGGAAGCCA GTGATGTGGC CACGGCTATT TACGATGGCG CCGATGCGAT CATGCTCAGT GCCGAATCCG CCGCCGGAAA GTTCCCGGAA GAATCCGTCG CTATGCAACA GCGCATCATC AACCGCGTCG AGGGTGACAA GCACTACCGT TCTTACTTGA AACAAAACGA GCCCGATCCG GAGAATACCC CGACCGATGC TATTATCACG GCGGCACGTC AAGTCGCGAA GACTATCGGT GCCAAATCGA TTGTCTGCTT TTCACTACGA GGTTCGACCG TTCTGCGAGC CTCCAAATCT CGTCCGGGTG TTCCAATTCT AGCCCTGTGT CCGTTCAAAG AAACTTCAAG ACAGCTAGCT CTCAGCTGGG GCGTTTACTC CGATCTACCC AAGGCCGGCT CGTACGGATA CACCGTTTCG GAAGAAGACA TGTTCAACTA CGACCGACCC ATGGTGGAAA AGAGCACGGA TGACTTCGAC CTGGTCCTTA AAAATGCCTG CCGTGCGGCG TTGAAGAAAG GATTGGTCAG CGATCCGGAC GATCTGCTTG TTGTGACGGC TGGCCTTCCT TTCGGTACCC CGGGAGCGGC AAACATCATT CGTGTGGTCC CTGCCGCCGG CCCCAGTTGT TGGGACGGCG TTTGCCGTGT CGATTAAGGC TAAGCTCGCG CCAAAGTTCT AGAAAGTGTT CATTTTTTCG CAGTCTATAA CCATATTTAA AAATTAAAAG CAAACGTTTG TGT
|
Protein sequence | MKLSLLALTF ALGHAFVPPS FLASPSSRKV LSSSRSASVA ANAADVLAKT TSSSSTPSSL MPKETTVAAV PKVAQRWRKS TKQVVTLGPA SSNKEMIEKL FLAGADVFRL NFSHGSQEQK KELLIMIREV EEKYSHPIGI LGDLQGPKLR VGEFSKPEGE FLELGQSFRL DLDNAKGDNK RVQLPHPEII KASELGHALL VDDGKVKLVV TAKGDDYLEC RVDVAGMIKD RKGVNTPDSV LEISPLTPKD RSDLEYMLGI GVDWVALSFV QTPADMVEIH ALIDEKLPSG QFKPAVMAKI EKPSCFYDDN LQRIVGLCNG IMVARGDLGV ECPPEDVPLL QKEIIDECRN QGRPVIVATQ MLESMIEVPT PTRAEASDVA TAIYDGADAI MLSAESAAGK FPEESVAMQQ RIINRVEGDK HYRSYLKQNE PDPENTPTDA IITAARQVAK TIGAKSIVCF SLRGSTVLRA SKSRPGVPIL ALCPFKETSR QLALSWGVYS DLPKAGSYGY TVSEEDMFNY DRPMVEKSTD DFDLVLKNAC RAALKKGLVS DPDDLLVVTA GLPFGTPGAA NIIRVVPAAG PSCWDGVCRV D
|
| |