Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45997 |
Symbol | PK4a |
ID | 7201060 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 912856 |
End bp | 914986 |
Gene Length | 2131 bp |
Protein Length | 535 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | kinase pyruvate kinase 4a |
Protein accession | XP_002180140 |
Protein GI | 219118746 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.01235 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCAACGGGAT CTTTTCCAAG ATTGTTACTG ACAGTGATTG TGTAATAGCT CCACACTTTC TCTCCTATCC AACAATCGCT TGTCCTTTCG TCGTTTGATA GTTGTACACA ACAACGTATA ACAAGTATCC AGAAATATCT AGTGTTGTTA CAACGTGTGC AGTAGTGACC ATAACATGCT TTCGAGCACG AGTACCATTC CCAAGCTAGA CGGCGAGGTC GTCACGCTCA GCGTCATCAA GAAGCCCACC GAAACCAAGA AGAGACGCAC CAAGATTATT TGTACCTTGG TAAGTATTGG GTCCTCTGTC GGAGACTCAG TTGACGGAAG TGACCAAGCT GTATTTCATG GGCACATATG CTGCCGGACT GTGTTGTGCT GTCGTGTATC TCAACGGGCG TTATCGTTCG TATCTCGTCC TTACCAGTAC GGGCGCTCGA CTCGATCACT CACTGCTGCC TGCTATGTTC CAGGGACCTG CTTGTTGGAG CGAAGAAGGC CTCGGCCAGC TCATGGACGC CGGCATGAAT GTCGCTCGCT TCAACTTTTC CCACGGTGAT CACGAAGGAC ACGGAAAAGT CCTCGAACGT TTGCGCAAGG TTGCCAAGGA AAAGAAGCGC AACATTGGTA CGTACGAACC AACCACGAAC ATTCGTCTTT TGAGCGTGTA CGCGTGTCTT TTTGTAGCTC TACCGGACGA AACAAATGGA GCTGTAGCGT GCTTCTGTAG GCGCCTGGTT TACCATCTTT CCTCACACCC CATTCTTGAT TCCTTTCCTC ATCGGCCACA GCGGTGCTCT TGGATACCAA GGGTCCGGAA ATTCGTACGG GATTTTTTGC CGACGGCATC GACAAGATTA ACCTGTCCAA GGGAGACACG ATCGTACTGA CCACGGACTA TGACTTCAAG GGCGATAGCA AGCGTTTGGC GTGCAGTTAC CCCACACTAG CCAAGTCCGT TACCCAGGGA CAAGCCATTC TTATTGCCGA CGGATCACTC GTTTTGACCG TCTTGAGCAT CGACACGGCT AATAACGAAG TGCAGTGTCG CGTCGAGAAC AACGCTTCCA TTGGCGAACG CAAAAACATG AATTTGCCCG GAGTTGTCGT CGATTTACCC ACCTTCACCG AACGTGACGT CAACGATATC GTCAATTTTG GTATCAAGAG CAAGGTAGAC TTTATCGCTG CTTCTTTTGT TCGCAAGGGA AGTGACGTGA CCAACCTGCG CAAGCTCCTC GCCGACAATG GCGGTCCACA GATCAAAATT ATTTGTAAAA TTGAGAATCA AGAAGGCCTC GAGAACTACG GAGACATTCT GGAGCACACG GATGCCATCA TGGTGGCCCG CGGTGATCTC GGTATGGAAA TTCCTTCGTC CAAGGTATTT CTGGCGCAAA AGTACATGAT TCGCGAAGCC AACGTTGCGG GCAAGCCCGT TGTCACTGCC ACGCAAATGC TCGAAAGTAT GGTGACCAAC CCGCGTCCTA CGCGTGCCGA ATGTTCCGAC GTGGCCAACG CCGTTTACGA CGGCACCGAC GCCGTTATGC TGTCGGGAGA AACCGCCAAC GGTCCACATT TTGAAAAGGC CGTGCTGGTC ATGGCGCGTA CGTGTTGCGA AGCCGAGTCG TCCCGCAACT ACAACCTGTT GTTCCAGTCG GTCCGCAACT CAATCGTCAT TGCGCGCGGT GGCTTGTCTA CCGGGGAATC CATGGCCAGC AGTGCCGTCA AGTCGGCCCT CGACATTGAA GCCAAGTTGA TTGTGGTCAT GAGTGAAACG GGCAAGATGG GCAACTACGT GGCCAAATTT CGTCCGGGCT TGAGTGTCCT GTGCATGACC CCCAACGAAA CGGCCGCGCG GCAGGCTAGT GGATTGCTGT TGGGCATGCA CACGGTCGTG GTGGATTCGT TGGAAAAATC GGAAGAGTTG GTGGAAGAAC TCAATTACGA ATTGGTGCAA TCCAACTTTC TCAAACCCGG CGACAAGATG GTTGTCATTG CCGGACGCAT GGCCGGCATG AAGGAACAGT TGCGCATTGT GACGTTGGAC GAGGGGAAGT CGTATGGTCA CATTGTCTCC GGCACGAGCT TCTTCTTTGA ACGCACACGT CTGTTGGACT TTAACGACTA A
|
Protein sequence | MLSSTSTIPK LDGEVVTLSV IKKPTETKKR RTKIICTLGP ACWSEEGLGQ LMDAGMNVAR FNFSHGDHEG HGKVLERLRK VAKEKKRNIA VLLDTKGPEI RTGFFADGID KINLSKGDTI VLTTDYDFKG DSKRLACSYP TLAKSVTQGQ AILIADGSLV LTVLSIDTAN NEVQCRVENN ASIGERKNMN LPGVVVDLPT FTERDVNDIV NFGIKSKVDF IAASFVRKGS DVTNLRKLLA DNGGPQIKII CKIENQEGLE NYGDILEHTD AIMVARGDLG MEIPSSKVFL AQKYMIREAN VAGKPVVTAT QMLESMVTNP RPTRAECSDV ANAVYDGTDA VMLSGETANG PHFEKAVLVM ARTCCEAESS RNYNLLFQSV RNSIVIARGG LSTGESMASS AVKSALDIEA KLIVVMSETG KMGNYVAKFR PGLSVLCMTP NETAARQASG LLLGMHTVVV DSLEKSEELV EELNYELVQS NFLKPGDKMV VIAGRMAGMK EQLRIVTLDE GKSYGHIVSG TSFFFERTRL LDFND
|
| |