Gene Tpen_1584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1584 
Symbol 
ID4600548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1535505 
End bp1536575 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content60% 
IMG OID639774357 
ProductATPase 
Protein accessionYP_920982 
Protein GI119720487 
COG category[R] General function prediction only 
COG ID[COG1672] Predicted ATPase (AAA+ superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTATTCG ACCCTAGGCC GAAAGAGAGG AAGGAGGATC TTTACGACTT TGAAAGGGAG 
CTGGGAGAAC TTGAGAAGTT CGTCGGGGAG CCTCTGGTCG TGGTTTCGGG GCTGAGGAGG
ACTGGGAAGA CCTCGCTCAT CCTAACGGCT CTCAACGAGC TGGGAGAGGT GTACGTGTAC
GTCGACCTGA GGGAGGGGGC TTCGTCCGCC GCCGGGATCT ACAAACTTTT GTCGAGGAGC
TTCGGCGAGA CCTACCGGAG GCTGGGGGGA GGGGTAGGGG AGCTTTTCCG AAGGGTTTTG
TCGAGAGTTA GGGGTGTAAG CTTCATGGGG CTCGAGGTCT CCCTGGACTG GTCTCCAAGG
GGGAGGCCCT TGCTGAGCGA GCTCTTCGAC GCGCTGGACG AGCTCGGGGA GAGGCTCGGG
AGGAAGGTCG TGGTCGTGCT AGACGAGTTT CAGAGGGCTA GGACGGGCTT CGGGCTCGCG
TTGCAAAACG CTGTGGCCCA CTCCTACGAC TTCAACAGGA ATGTTTCCTT CGTGCTGTCA
GGCTCTGAGA TGGGCGTGCT CTACGGGGTT CTCGGCAACC CCGAGGGGCC TCTCTACGGG
AGGGCTTACG TGGAGGTTAG GACGAGGAAG CTCTCGGCGG AGGAGTCGCT GGACTTTCTC
AGGAGGGGCT TCGAGGAGGC AGGCCTCGAG GTTGAGGGGA GGGAGCTGGA GAGAGCTGTG
AACGAGCTGG ACGGCATAAT AGGGTGGCTT ACGTACTACG GCTACCTGAG GCTACGCGGG
AAAGGCTCCC TCGAGGAGAT AGTTAGCGAG GCCACAGCCC TCGCTAGGAG GGAGCTCGAA
GAATTCCTCT CCACGCGTAT GAGCAGGAGG TACAGGCTGG TGATGAGGTT GCTCGCGAGA
GGCGTGAAGG AATGGAGCGA GCTGAAGAGA GAGCTGGAGA GGGCCGAGGG TAGAGAGGTC
AGCGACAGGG CGCTATACGA GGTTCTCCAG CACCTGAAGA AGCACTCCTT GATAGACGAG
AACAACGAGT ATACGGACCC GGTGAACAGG CGAGCCGCGC TGGAGCTATA G
 
Protein sequence
MLFDPRPKER KEDLYDFERE LGELEKFVGE PLVVVSGLRR TGKTSLILTA LNELGEVYVY 
VDLREGASSA AGIYKLLSRS FGETYRRLGG GVGELFRRVL SRVRGVSFMG LEVSLDWSPR
GRPLLSELFD ALDELGERLG RKVVVVLDEF QRARTGFGLA LQNAVAHSYD FNRNVSFVLS
GSEMGVLYGV LGNPEGPLYG RAYVEVRTRK LSAEESLDFL RRGFEEAGLE VEGRELERAV
NELDGIIGWL TYYGYLRLRG KGSLEEIVSE ATALARRELE EFLSTRMSRR YRLVMRLLAR
GVKEWSELKR ELERAEGREV SDRALYEVLQ HLKKHSLIDE NNEYTDPVNR RAALEL