Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0764 |
Symbol | |
ID | 4601086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 716841 |
End bp | 717881 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639773540 |
Product | hypothetical protein |
Protein accession | YP_920169 |
Protein GI | 119719674 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG3425] 3-hydroxy-3-methylglutaryl CoA synthase |
TIGRFAM ID | [TIGR00748] hydroxymethylglutaryl-CoA synthase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.128662 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGGAA TAGCGTCCAT ACTAGGCTAC GGTGCGTACA TCCCTGTCTA CAGGATCGAG AGCGGGGAGA TATCGAGGGT GCACACTAAG GGCGGAGAGA AGGCTCCCGT GAAGCAGAAG AGCGTGCCCG GGCCCGACGA GGACTCGCTA ACGATGGCCT ACGAGGCGTC TAAGAACGCC TTGAAACGAG CTAGACTAGA CCCCCGGGAG GTTCAGGCTT TGTACATCGG GTCGGAGAGC CCGCCCTACG CCGTCAAGCC GTCCGCGACT GTTGTGGCGG AGGCTCTTGG GCTGTCGAGG AAGCTCTACG GGATAGACAT GGAGTTCGCG TGCAAGGCGG GGACTACCGC GCTTATAAGC GTGGCAGGGT TAGTCAAGTC GGGTATAATC ACGTACGGGT TGGCGGTGGG CACGGACACG GCGCAGGGAA GACCCGGCGA TGAGCTCGAG TACACCGCGG GTGCCGGGGC GGCGGCGTTC GTCGTGTCTC CTGTGCGCAG CGACGCGGTG GCAACGATCG AGCACGTCTA CTCCTACGTT ACCGATACGC CCGACTTCTG GCGGAGGGAG GGGGAGAGGT TCCCGATGCA CACGTTCCGC TTCACCGGCG AGCCCGCGTA CTTCCACCAT ATAGTCAGCG CGGCGAAGGG GCTCTTCGAG GAGACGGGGC TTAAGCCCTC GGACTTCGCC TACGCTGTTT TCCACCAGCC GAACGTAAAG TTCCCGCAGA GGGTGGGGGC GATGCTGGGC TTCAAGCCTG AGCAGCTAAA GCTGGGCTTG CTCTCGGGCG AGATCGGCAA CACCTACGCA GCCGCCTCGC TGATAGGGTT GACCAACGTT TTGGACCACG CTAAGCCTGG CGAGAGAATA CTTGTGGTAT CCTTCGGTAG CGGCGCTGGC TCGGACGCGC TGAGTATAGT GGTCGAGGAG GGTATCGAGG ACCGCCGCGG GCTGGCGCCT CTGACGATGC AGTACGTGAA GAGGGCCAAA CTAGTCGATT ACGCCGTTTA CCTTAAGTAC AAGGACTTCA TAAAGAGGTG A
|
Protein sequence | MPGIASILGY GAYIPVYRIE SGEISRVHTK GGEKAPVKQK SVPGPDEDSL TMAYEASKNA LKRARLDPRE VQALYIGSES PPYAVKPSAT VVAEALGLSR KLYGIDMEFA CKAGTTALIS VAGLVKSGII TYGLAVGTDT AQGRPGDELE YTAGAGAAAF VVSPVRSDAV ATIEHVYSYV TDTPDFWRRE GERFPMHTFR FTGEPAYFHH IVSAAKGLFE ETGLKPSDFA YAVFHQPNVK FPQRVGAMLG FKPEQLKLGL LSGEIGNTYA AASLIGLTNV LDHAKPGERI LVVSFGSGAG SDALSIVVEE GIEDRRGLAP LTMQYVKRAK LVDYAVYLKY KDFIKR
|
| |