Gene Pars_0744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0744 
Symbolpgk 
ID5054931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp661636 
End bp662871 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content53% 
IMG OID640468302 
Productphosphoglycerate kinase 
Protein accessionYP_001152982 
Protein GI145590980 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0126] 3-phosphoglycerate kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.333208 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAACA TGTTACTTAA TGAAGTTCTT AACCAGTTGC CAAATATAAA TAAATGCTTA 
GAAAAAGGAA AAAAATTAAT TATAAGAATA GACATAAACT CGCCAATTAT AAACGGTAAA
ATTATTGACG ATTACAGAAT ACGCGCCCAT TCATACACGC TTAGGCTTGC CTCAGACGCC
GGGGCGAGGA CCGTGGTGCT GGCACACCAG GGCAGGCCGG GGCAAGACGA CTTCACATCT
TTAGAGGTGC ACAAGCCCTA CATTGAGAAG TACTTAGAGA GGCCCATAAA ATTCGTCGAC
GACATCATAG GGCCTGAGGC ACGGAGACAG ATTAAGGAGC TGAAAGACGG CGAGATCTTA
CTGCTAGAAA ACGTGAGGAT GTTGTCGGAG GAGGTCATCG AAAAGATCCC AGAGGCCCAG
GCAGAGACCA TGTTGGTGAA GAAGCTGGCG CCGCTGGCGG ACTACTACGT CTTCGACGGA
TTTGCCGTGG CTCACAGATC CCAGCCCAGC GTCGTGGGGT TCCCCATGGT GATGCCCTCC
TGTATGGGCC CCGTCTTCGA GAAGGAGCTG AGAGCGCTGA GCGTGGTGTT CGAGAAGCGT
GGAAAAGGAG TAGTCCTCTT GGCAGGGGGG GCCAAGATCC CAGATACTAT AAAAGCCGTG
GAACAGCTAC TCAAAAACGG CTTTGTGGAA AAGGTGGCCT TCGGCGGCTT GGTGGGCTTC
ATCTTCACCG TGGCAAAACA CGGAGTTTTG AACGCGGCCT TAAAACAGGA GGTGGAAAAA
GGCGGGTACC TCCCCTATGT GGAAAGAGCG CGCCAGCTAC TAAGCAAATA CGGAGAGAAG
ATAGAAGTGC CGGTCGACTT TGCGGTTAGC CAGAACGGGA GGATCGACGT CGACGCCTTC
TCCCTAGCGC AACAACCGCT AGACATAGGC AAATCCACAA CGATACGATA CAAGGAAGTC
ATCGACCAGG CGGAGGTGGT CATATTCAGC GGCCCAATGG GCTATGTAGA AGACGAGAGG
TTCGCCACAG GTACGTTGGA GTTGCTAAGA GCCGCCGCCA AGAAGAAGCT CATCCTCGGC
GGAGGGCACA CCATACTGGC CGCCGAAAAG GCCGGAGTAA TCGACAAGGC CTTCCACGTC
TCGACGGGAG GCCGCGCCTT CATATCAACA ATCGGCGGCG AGGAAATGCC CGCCGTGAAA
GCGTTATTAA CCTCGGCCGC GAAGTTTAGG CTATGA
 
Protein sequence
MSNMLLNEVL NQLPNINKCL EKGKKLIIRI DINSPIINGK IIDDYRIRAH SYTLRLASDA 
GARTVVLAHQ GRPGQDDFTS LEVHKPYIEK YLERPIKFVD DIIGPEARRQ IKELKDGEIL
LLENVRMLSE EVIEKIPEAQ AETMLVKKLA PLADYYVFDG FAVAHRSQPS VVGFPMVMPS
CMGPVFEKEL RALSVVFEKR GKGVVLLAGG AKIPDTIKAV EQLLKNGFVE KVAFGGLVGF
IFTVAKHGVL NAALKQEVEK GGYLPYVERA RQLLSKYGEK IEVPVDFAVS QNGRIDVDAF
SLAQQPLDIG KSTTIRYKEV IDQAEVVIFS GPMGYVEDER FATGTLELLR AAAKKKLILG
GGHTILAAEK AGVIDKAFHV STGGRAFIST IGGEEMPAVK ALLTSAAKFR L