Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0757 |
Symbol | |
ID | 4601078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 708350 |
End bp | 709378 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639773533 |
Product | glyceraldehyde-3-phosphate dehydrogenase, type I |
Protein accession | YP_920162 |
Protein GI | 119719667 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase |
TIGRFAM ID | [TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATAC GCGTCGGTAT TAACGGGTTT GGACGCATAG GGAGAAACTT TCTGAGAGCG GCCCTAAAGA ACGAGAAGTT CTTCGACAAG TTCGAGATCG TGGCAATCAA CGACCTCGGA TCCCCGAAAA TGCTGGCATA CCTGCTGAAG TACGACTCGG TATTCGGACG CCTACCCAAC AAGGTCGAGG TTAAGGACAA CAAGCTCGTT GTAGACGGCA TGGAGATGCT TGTGCTAAAC GAGGCTAGCC CCGAGAAGCT ACCCTGGAAA GATCTCGGAG TAGACGTCGC GCTCGAAGCC ACGGGTAGGT TCACAGACAG GGAGAAGGCC GCCTTGCACC TGAAGGCAGG CGCGAAGAAG GTTGTCGTCA CAGCGCCTTC CAAGGGTGCC GACGTGACCA TAGTGATGGG CGTTAACCAC CACATGTACG ACCCGAAGAA GCACGAAGTC ATCTCCAATG CTTCGTGTAC TACGAACTGC CTCGCCCCAG TAGTGTACGT GCTTCTCAAA AACTTCGGCC TAGAGTGGGG ATTCATGACC ACTGTACACG CCTACACAAA CGACCAGAGA GTACTGGACC TCATACACCC GGAGGACTTT AGGAGGACGC GCGCCGCCGC GTTGAACATA ATCCCGACGA CGACTGGGGC TGCAAGGGCG CTACACCTGG TAATCCCCGA GGTAAAGGGC AAGCTGGACG GTATGGCGAT GCGCGTCCCG GTTGCTGACG GCTCAGTCGT GGACCTCGTG GCTCAAATAG GGAGGGAGAT AACCAAGGAA GAGCTCGACG CGGCCTTCAA GGAGGCTGCC GAGACATACC TGAAGGGTAT CCTGGAGTAC GTCGACGAGC CGATAGTGTC CTCGGACATA GTCGGCAACC CACACTCCTC CATCTACGAT GCCCAGGCAA GCATGGTGCT CGGGGGCAAG AGCAACAAGG TAAAGGTGGT CGCTTGGTAC GACAACGAGT GGGGCTTCAG CAACAGGCTC GTCGACTTGC TACTGTACAT GTCCGAGAAA GGAATCTAA
|
Protein sequence | MKIRVGINGF GRIGRNFLRA ALKNEKFFDK FEIVAINDLG SPKMLAYLLK YDSVFGRLPN KVEVKDNKLV VDGMEMLVLN EASPEKLPWK DLGVDVALEA TGRFTDREKA ALHLKAGAKK VVVTAPSKGA DVTIVMGVNH HMYDPKKHEV ISNASCTTNC LAPVVYVLLK NFGLEWGFMT TVHAYTNDQR VLDLIHPEDF RRTRAAALNI IPTTTGAARA LHLVIPEVKG KLDGMAMRVP VADGSVVDLV AQIGREITKE ELDAAFKEAA ETYLKGILEY VDEPIVSSDI VGNPHSSIYD AQASMVLGGK SNKVKVVAWY DNEWGFSNRL VDLLLYMSEK GI
|
| |