Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0744 |
Symbol | pgk |
ID | 5054931 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 661636 |
End bp | 662871 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640468302 |
Product | phosphoglycerate kinase |
Protein accession | YP_001152982 |
Protein GI | 145590980 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0126] 3-phosphoglycerate kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.333208 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAACA TGTTACTTAA TGAAGTTCTT AACCAGTTGC CAAATATAAA TAAATGCTTA GAAAAAGGAA AAAAATTAAT TATAAGAATA GACATAAACT CGCCAATTAT AAACGGTAAA ATTATTGACG ATTACAGAAT ACGCGCCCAT TCATACACGC TTAGGCTTGC CTCAGACGCC GGGGCGAGGA CCGTGGTGCT GGCACACCAG GGCAGGCCGG GGCAAGACGA CTTCACATCT TTAGAGGTGC ACAAGCCCTA CATTGAGAAG TACTTAGAGA GGCCCATAAA ATTCGTCGAC GACATCATAG GGCCTGAGGC ACGGAGACAG ATTAAGGAGC TGAAAGACGG CGAGATCTTA CTGCTAGAAA ACGTGAGGAT GTTGTCGGAG GAGGTCATCG AAAAGATCCC AGAGGCCCAG GCAGAGACCA TGTTGGTGAA GAAGCTGGCG CCGCTGGCGG ACTACTACGT CTTCGACGGA TTTGCCGTGG CTCACAGATC CCAGCCCAGC GTCGTGGGGT TCCCCATGGT GATGCCCTCC TGTATGGGCC CCGTCTTCGA GAAGGAGCTG AGAGCGCTGA GCGTGGTGTT CGAGAAGCGT GGAAAAGGAG TAGTCCTCTT GGCAGGGGGG GCCAAGATCC CAGATACTAT AAAAGCCGTG GAACAGCTAC TCAAAAACGG CTTTGTGGAA AAGGTGGCCT TCGGCGGCTT GGTGGGCTTC ATCTTCACCG TGGCAAAACA CGGAGTTTTG AACGCGGCCT TAAAACAGGA GGTGGAAAAA GGCGGGTACC TCCCCTATGT GGAAAGAGCG CGCCAGCTAC TAAGCAAATA CGGAGAGAAG ATAGAAGTGC CGGTCGACTT TGCGGTTAGC CAGAACGGGA GGATCGACGT CGACGCCTTC TCCCTAGCGC AACAACCGCT AGACATAGGC AAATCCACAA CGATACGATA CAAGGAAGTC ATCGACCAGG CGGAGGTGGT CATATTCAGC GGCCCAATGG GCTATGTAGA AGACGAGAGG TTCGCCACAG GTACGTTGGA GTTGCTAAGA GCCGCCGCCA AGAAGAAGCT CATCCTCGGC GGAGGGCACA CCATACTGGC CGCCGAAAAG GCCGGAGTAA TCGACAAGGC CTTCCACGTC TCGACGGGAG GCCGCGCCTT CATATCAACA ATCGGCGGCG AGGAAATGCC CGCCGTGAAA GCGTTATTAA CCTCGGCCGC GAAGTTTAGG CTATGA
|
Protein sequence | MSNMLLNEVL NQLPNINKCL EKGKKLIIRI DINSPIINGK IIDDYRIRAH SYTLRLASDA GARTVVLAHQ GRPGQDDFTS LEVHKPYIEK YLERPIKFVD DIIGPEARRQ IKELKDGEIL LLENVRMLSE EVIEKIPEAQ AETMLVKKLA PLADYYVFDG FAVAHRSQPS VVGFPMVMPS CMGPVFEKEL RALSVVFEKR GKGVVLLAGG AKIPDTIKAV EQLLKNGFVE KVAFGGLVGF IFTVAKHGVL NAALKQEVEK GGYLPYVERA RQLLSKYGEK IEVPVDFAVS QNGRIDVDAF SLAQQPLDIG KSTTIRYKEV IDQAEVVIFS GPMGYVEDER FATGTLELLR AAAKKKLILG GGHTILAAEK AGVIDKAFHV STGGRAFIST IGGEEMPAVK ALLTSAAKFR L
|
| |