Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1653 |
Symbol | pgk |
ID | 5104858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1594661 |
End bp | 1595884 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640507544 |
Product | phosphoglycerate kinase |
Protein accession | YP_001191732 |
Protein GI | 146304416 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0126] 3-phosphoglycerate kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.973793 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATTC CCACACTGGA TGATCTCAAC TTCAGCAACA GCAAGGTTCT TGTGAGGATT GACATAAACT CGCCCGTTGA TTCAAAGACG GGCAAACTTC TAGACGATTC CAGGATTAAG GCTCACGTTG AGACTATTCG TGAACTTGTG AGCCGAGGGA ATGGGGTGGT CCTGGTCTCG CATCAGGGAA GACCAGGCGA CAATGACTTT ACGGATCTCG AGGAACACTC GAGACTACTA CAGAAACATC TCGATATGGA AGTTGAGTTT GTGGGGGACG TCATGGGGCC CTATGCCAGG GAGAGGATAA GGAACATGAA ATTAGGGTCC GTCCTTCTTC TAGATAATGT CAGGTTCGTA TCGGAGGAGT TAATAGAGGC CTCTCCCCTT CAACACTCGA GGAGCTTTCT GGTTCGCAGA CTTCAACCAC TCTTCCAGGG TTACGTGAAC GACGCCTTTG CTACAGCCCA CAGAAGTCAG GCTAGCCTCG TGGGTTTCCC ACTTGTCCTA AGATCCAGTG CAGGAAGGGT AATGGAGAAG GAGGTATCAG CTCTAGCCAA GGTCTTCAAC GAGGGAGAGG AGCCCAAGGT CTTCATCATG GGTGGCGGGA AGGTTCTGGA TAGCCTCAGG ATAATTGAGA ATCTCGTGAA AAGGAGACTA GCTGATAGGA TATTGACCGG CGGGCTCATT GCTGAGCTCT TCGCGGTTGC CAAGGGACTA GACCTGGGGA AAGACAATCT TCAGGTTCTT GAGAAGGCCG GAGTTCTTAG CTTAGTTCCA AGGGCGAGAA AACTTCTCTT GTCTGGAGCT CCCGTGGAGA TTCCCGTAGA CTTCAAGGTG GAGAAGGGAG GTCAGGTATA CGAAGAACCT GCAAACAAGG TCACAGCAGT AATCAAGGAC GTAGGCTCAA CTACAGTTGG GATATATTCG TCATTTATAA GGGACGCGAA GATCATTGTC ATGAGGGGAC CAATGGGGGT AATAGAGGAC GAAAGGTTCA GGGAATCTAG TAGAACGCTT CTCAAGAACT CACTGGAGAG TCCCGGTTAC CTCATTGTCG GGGGTGGTCA CATGATCAGC CTCCTTGGGG GAATAGGACC CCTAGATAGT TCCAAGGTTC ACGTATCCAC AGGAGGTGGA GCTCTCCTCC TTTTCCTTGC AGGCGAAAGG TTACCGGCCC TTGAGGCCCT AGACATTTCA GCCAGGGAGG TGCTCAAGAA ATGA
|
Protein sequence | MNIPTLDDLN FSNSKVLVRI DINSPVDSKT GKLLDDSRIK AHVETIRELV SRGNGVVLVS HQGRPGDNDF TDLEEHSRLL QKHLDMEVEF VGDVMGPYAR ERIRNMKLGS VLLLDNVRFV SEELIEASPL QHSRSFLVRR LQPLFQGYVN DAFATAHRSQ ASLVGFPLVL RSSAGRVMEK EVSALAKVFN EGEEPKVFIM GGGKVLDSLR IIENLVKRRL ADRILTGGLI AELFAVAKGL DLGKDNLQVL EKAGVLSLVP RARKLLLSGA PVEIPVDFKV EKGGQVYEEP ANKVTAVIKD VGSTTVGIYS SFIRDAKIIV MRGPMGVIED ERFRESSRTL LKNSLESPGY LIVGGGHMIS LLGGIGPLDS SKVHVSTGGG ALLLFLAGER LPALEALDIS AREVLKK
|
| |