Gene Tpen_0041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0041 
Symbol 
ID4600982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp29704 
End bp30792 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content58% 
IMG OID639772794 
Productradical SAM domain-containing protein 
Protein accessionYP_919454 
Protein GI119718959 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1180] Pyruvate-formate lyase-activating enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.821324 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGATA AGTTGCTGGG TAAACCGTTC GTCAGGGAGG CGGCATTCTG GGAGCCGGTT 
CAGGGGAAGC CGGGCTACGT GAAGTGCAAT CTCTGCAACA GGAGGTGCGT GATAGCCCCC
GGTAGGTTCG GGGTTTGCGG TGTGAGGAAG AATATCGACG GCAAGCTTTA CACGCTGGTC
TACGGCCTCT TGACAGCCGC GAATCTAGAC CCTATCGAGA AGAAGCCTCT CTCCCACTTC
TACCCGGGTA GCGCGGTGTT CTCGGTGTCC ACGCCCGGCT GCAACTTTTT CTGCCAGTTC
TGCCAGAACT GGGAGATAAG CCAGAGCAGG CTGGAGAGAG GGCTCTACGG GCACTACTAC
CCCCCGGAGG ACGTCGTAAG GGAGGCTAAG AGGCTGCAGG CGGACGGGAT CTCGTACACC
TACAACGAGC CAACGATATT CTACGAGTTC ATGCTGGACA CTGCGCGCCT AGCGAAGAAG
GAGGGCCTCT TCAACACGAT GGTTACGAAC GGCTACATAT CCCCGGAGGC CCTCGACGAG
CTGGCGCCCT ACCTGGACGC CGCCACCGTG GACTTCAAGG GAGGAGGCGA CCCGGAGTTT
TACAGAAAGT TTATGGGGGT GCCAGACCCA AGCCCCATCT ACGACACGTT GCTCAGAATG
AAGGAGAAGG GGATCCACGT AGAGATAACG AACCTTGTGG TTCCAATAGT GGGCGACGAC
GAGGAGAAGC TGAGGTCCCT GGCTAGGTGG GTAGCGGAGA ACTTGGGCGA CGAGACGCCC
TTCCACCTCC TGAGGTTCTA CCCCCACTAC AAGATGATCG ACTACCCGCC GACGGAGGTC
GGGGACCTCG AAAAGCTCGC GGGGGTGGCG AGGGAGGAGG GGCTCAAGTA CGTCTACATA
GGGAACGTGT GGGGGCACCC CCTCGAGAAC ACTTACTGCC CGAAGTGCGG CCACAGGGTC
ATAGAGAGGA GGGGCTTCTT CATAGTGAAG TGGGATTTAA CGGAGGACAA CAGGTGCCCG
GTGTGCGGCG CGAAGATAAA CATAAAGGGG AGCTACAGGA AAAGAAGCTG GGACGTCTTC
TTCTACTAG
 
Protein sequence
MSDKLLGKPF VREAAFWEPV QGKPGYVKCN LCNRRCVIAP GRFGVCGVRK NIDGKLYTLV 
YGLLTAANLD PIEKKPLSHF YPGSAVFSVS TPGCNFFCQF CQNWEISQSR LERGLYGHYY
PPEDVVREAK RLQADGISYT YNEPTIFYEF MLDTARLAKK EGLFNTMVTN GYISPEALDE
LAPYLDAATV DFKGGGDPEF YRKFMGVPDP SPIYDTLLRM KEKGIHVEIT NLVVPIVGDD
EEKLRSLARW VAENLGDETP FHLLRFYPHY KMIDYPPTEV GDLEKLAGVA REEGLKYVYI
GNVWGHPLEN TYCPKCGHRV IERRGFFIVK WDLTEDNRCP VCGAKINIKG SYRKRSWDVF
FY