Gene Tpen_0764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0764 
Symbol 
ID4601086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp716841 
End bp717881 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content61% 
IMG OID639773540 
Producthypothetical protein 
Protein accessionYP_920169 
Protein GI119719674 
COG category[I] Lipid transport and metabolism 
COG ID[COG3425] 3-hydroxy-3-methylglutaryl CoA synthase 
TIGRFAM ID[TIGR00748] hydroxymethylglutaryl-CoA synthase, putative 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.128662 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGGAA TAGCGTCCAT ACTAGGCTAC GGTGCGTACA TCCCTGTCTA CAGGATCGAG 
AGCGGGGAGA TATCGAGGGT GCACACTAAG GGCGGAGAGA AGGCTCCCGT GAAGCAGAAG
AGCGTGCCCG GGCCCGACGA GGACTCGCTA ACGATGGCCT ACGAGGCGTC TAAGAACGCC
TTGAAACGAG CTAGACTAGA CCCCCGGGAG GTTCAGGCTT TGTACATCGG GTCGGAGAGC
CCGCCCTACG CCGTCAAGCC GTCCGCGACT GTTGTGGCGG AGGCTCTTGG GCTGTCGAGG
AAGCTCTACG GGATAGACAT GGAGTTCGCG TGCAAGGCGG GGACTACCGC GCTTATAAGC
GTGGCAGGGT TAGTCAAGTC GGGTATAATC ACGTACGGGT TGGCGGTGGG CACGGACACG
GCGCAGGGAA GACCCGGCGA TGAGCTCGAG TACACCGCGG GTGCCGGGGC GGCGGCGTTC
GTCGTGTCTC CTGTGCGCAG CGACGCGGTG GCAACGATCG AGCACGTCTA CTCCTACGTT
ACCGATACGC CCGACTTCTG GCGGAGGGAG GGGGAGAGGT TCCCGATGCA CACGTTCCGC
TTCACCGGCG AGCCCGCGTA CTTCCACCAT ATAGTCAGCG CGGCGAAGGG GCTCTTCGAG
GAGACGGGGC TTAAGCCCTC GGACTTCGCC TACGCTGTTT TCCACCAGCC GAACGTAAAG
TTCCCGCAGA GGGTGGGGGC GATGCTGGGC TTCAAGCCTG AGCAGCTAAA GCTGGGCTTG
CTCTCGGGCG AGATCGGCAA CACCTACGCA GCCGCCTCGC TGATAGGGTT GACCAACGTT
TTGGACCACG CTAAGCCTGG CGAGAGAATA CTTGTGGTAT CCTTCGGTAG CGGCGCTGGC
TCGGACGCGC TGAGTATAGT GGTCGAGGAG GGTATCGAGG ACCGCCGCGG GCTGGCGCCT
CTGACGATGC AGTACGTGAA GAGGGCCAAA CTAGTCGATT ACGCCGTTTA CCTTAAGTAC
AAGGACTTCA TAAAGAGGTG A
 
Protein sequence
MPGIASILGY GAYIPVYRIE SGEISRVHTK GGEKAPVKQK SVPGPDEDSL TMAYEASKNA 
LKRARLDPRE VQALYIGSES PPYAVKPSAT VVAEALGLSR KLYGIDMEFA CKAGTTALIS
VAGLVKSGII TYGLAVGTDT AQGRPGDELE YTAGAGAAAF VVSPVRSDAV ATIEHVYSYV
TDTPDFWRRE GERFPMHTFR FTGEPAYFHH IVSAAKGLFE ETGLKPSDFA YAVFHQPNVK
FPQRVGAMLG FKPEQLKLGL LSGEIGNTYA AASLIGLTNV LDHAKPGERI LVVSFGSGAG
SDALSIVVEE GIEDRRGLAP LTMQYVKRAK LVDYAVYLKY KDFIKR