Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0773 |
Symbol | |
ID | 4601095 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 723975 |
End bp | 725204 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639773549 |
Product | phosphoglycerate kinase |
Protein accession | YP_920178 |
Protein GI | 119719683 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0126] 3-phosphoglycerate kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.1233 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGAAG GACTGGGTAT AAAGACCCTC GACGACGTGG ACGTTCGCGG GAAGACTGTC GGCGTTCGCG TCGACTTTAA CTCCCCGGTA GACCCCCAGA CGAAGAGGCT TCTGGACGAC ACGCGTATAC GCGCCCATGC CGAGACGACC ATAAGGGAGC TCGTAGAGAA GAAAGCGAAG GTCGTGGTTC TGTCGCACCA GGGCCGTAAG GGAGACCCGG ACTTCACCAG CCTACGCGAG CACGCGGAGG TTCTCTCGAG GCTTGTCCCG GCGCGCGTAA AGTTCGTAGA CGACATCTTC GGCGAAAAGG CCGTAAAGGA GATCAAAGCT CTATCGCCGG GAGAGGTTCT GGTACTTGAG AACGTGAGGA TGTGGGACGG AGAAGCGAAG AACGCGTCCC CGGAGGAGCA CGCGAAGACC CCCCTCGTAC AAGCGCTGGC ACCGCTCCTG GAGGTCTACG TGGTGGACGC TTTCTCCGCT GCGCATAGGC CGCATGCATC CCTGGTCGGC TTTGCACCCG TAGTGAAGCA CTTCGTGGCG GGACGGGTGA TGGAGCGGGA GCTCCAAGCT CTCTACAGGG TGAGGAACAA CCCGGAGCGC CCATGCGTCT ACGTGATCGG TGGCGCTAAG GCGGAGGACA CCGCCGAGAT CATTTCGAGC GTCCTTGGAA ACAACATCGC CGACAAGGTG CTCACAGGAG GGTTAGCCGC GAACCTCCTC CTACACTCCT CCGGGAAGAA GATCGGAGAC GTTAACGTCT CCGTGCTGAA GGAGAAGGGA TTCCTGGACC TCGAGCCGGA GCTAAAGAAG CTCCTCGACA AGTACGGGGA CAGGATAGTT CTCCCGGTCG ACTTAGCGGT CGAGGAGAGC GGGAAGAGGG TGGAGGTCGG AGTAGACTCC GTACCGAACC TCCCGATAAA GGACATAGGC TCTAGGACGG CGGAGGAGTA TGCCAGGATA ATCTCGGAAG CTAGAACAGT CGTAATGAAC GGCCCGATGG GGGTCTTCGA GGACGAAAAG TTCAGCCTGG GCACGAAGAA GGTCTTCGAG GCTATGGCGT CCTCCAAGGC GTTCACGCTT ATAGGCGGAG GTCACACGAT AGCGGCCGCT TCTAAGCTGG GCTTCGCGGA CAAGCTATCC CACGTGAGCA CGGGTGGCGG AGCGCTAATC GAGTACCTTA TCAAGGGCAC TCTCCCCGTG ATAGAGGTTC TAAAGAAGTA CTCGAAGTAG
|
Protein sequence | MIEGLGIKTL DDVDVRGKTV GVRVDFNSPV DPQTKRLLDD TRIRAHAETT IRELVEKKAK VVVLSHQGRK GDPDFTSLRE HAEVLSRLVP ARVKFVDDIF GEKAVKEIKA LSPGEVLVLE NVRMWDGEAK NASPEEHAKT PLVQALAPLL EVYVVDAFSA AHRPHASLVG FAPVVKHFVA GRVMERELQA LYRVRNNPER PCVYVIGGAK AEDTAEIISS VLGNNIADKV LTGGLAANLL LHSSGKKIGD VNVSVLKEKG FLDLEPELKK LLDKYGDRIV LPVDLAVEES GKRVEVGVDS VPNLPIKDIG SRTAEEYARI ISEARTVVMN GPMGVFEDEK FSLGTKKVFE AMASSKAFTL IGGGHTIAAA SKLGFADKLS HVSTGGGALI EYLIKGTLPV IEVLKKYSK
|
| |