Gene Tpen_0982 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0982 
Symbol 
ID4600456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp928490 
End bp931495 
Gene Length3006 bp 
Protein Length1001 aa 
Translation table11 
GC content51% 
IMG OID639773760 
Producthypothetical protein 
Protein accessionYP_920385 
Protein GI119719890 
COG category[L] Replication, recombination and repair 
COG ID[COG1743] Adenine-specific DNA methylase containing a Zn-ribbon 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGAGG AAAAGCCTAC TCTACTAGAG TCGCCGAGCT TCCCGATAGA AAGTATTAAC 
AAGGCTTCGA AGTCTGAGAA GACGGGTGGA GGGAGGCCCC CTTACTGGGA GATGGTTTTC
TGGTGGACCA GGAAGCCTCT TGCCGGGGCA AGAGCAATAA TAGCGGCTTC GCTACTATCA
CAGGACGACT ACCCAGAAAG CTACAACTTC CTAAAAGACC TCTTCCCCTG TATGGACAAG
AGGACTCCTC ATTCTTGCAA CCCTAACCAA AGACTCGTAG AGAAACTCAA GGGAAAGAAG
CTCCTAGACC CCTTCGCCGG CTTTGGCTCA ATACCCCTAG AGGCTGCAAG GCTAGGCCTC
GACGTAACCG CGGTCGAGCT ACTGCCGACA GCTTACGTGT TCCTCAAGGC TGTATTGGAA
TATCCAAAGG AGTACGGCAA AAGGCTTATC GAGATAAGCG GGAAAGAGGT CGAGAGCCTC
GGCTTAAGAG ACGCTGTCAG GCGGTTCAAC GGCTCGGCAA AGATAATAGA GACGGGCAGG
TACAAGGTGC CGCTACTAAT ATACGACGTG GCCAGGTGGG GCAGGTGGGT AACCGAGGAG
CTTAAAAAAG ACCCAGACTT CAAAGAACTC TACGACGAAG ACGTCGCAGT ATACATAGGT
ACATGGGAAA TTAAATGCCC CGTCTGCGGG CGCTACACCC CACTTGTAGG CAACTGGTGG
CTTGCCAGAG TAAAGTCGAA ACGCGGCTAC GAGAGGCTAG CCTGGATGCA ATGGAGAGAC
GGAGAAATAG AGGTTGTAGA CCTCAACGAA GCATGCAAGA AGACCGGAAG AAGCTCATGC
AACGAGCTTC TCGCCAAGGT ACAAGGCAAA GATGAAGAGT CCGGTGCTAG GGTAGAATGG
AACGGCCAGG TATACGTTGT CCCCTCAAAG AATATCAACG CTAAGTTGGA AGAAGCTCAA
TGCCTTTACT GTAGGGCAAA AATCGACCAC CGCGTAAAAG AAAACAGAAT ATTGAAACCT
GTGAAAAATA AGAAAAAAGA AGGAGAATGG TACGTAAAAT GGGCTCTTCA ACGCTGGAAC
AGCCTCCTAG AAGATTATCT CTTTGGGAAG GTAAGCTTGG AGGAACTGAG AAACGCGCCC
GCCAGGCCCA GGATACTGGT CAAAGTCAGG GTCACAGACG GGGACCTCGA GTTCGAGCCT
GCAACACGAG AGGACACAGA GAAGTTATGG AAAGCCCTCG AAAAACTGAA GCAAAAGTGG
AAAGAACCGG ATGTACCATC AGAAGAGTTA TGGAAGTATA CTGCAAGTGG CGGAGGCGCG
CTGAGCATAT GGACATGGGG CTTTGACAAA TTCTACAAAC TTTTCAATCC TAGGCAGTTG
CTGACATTGG TTAAGCTCGT CAGGCTAGTG AGGGAGGCCG GGAAGAGCGT CGAGGAGGAA
AAGCTGAAGG AGGGCTGGAG CAAGGAAGAC TCCTTTAGGT ACGCCGAAGC GATAACAACA
TACCTGGCAA TAGCATTATG TAAACAAATA AATTATGACA GTATTGTAAC ATCTACAGAG
CCTGTACAGA AATTCATCCG AGAAACATTA GCGTTTAGAG GCATTGCTAT GACATGGAAC
TGGGTAGAGG AGTTACCTGT AGCAGACGTT CTGGGTTCAT ATATAAGGTC GTTAAATTCC
AGTGTTGGTA GCTTATCTTA TCTTGTTTCT GCTGTGTATG GTAGCCCTAG CAGGGTTAAG
GTTTTGCTCG ACGACGCTAC AACTCTGGAC ATGCTTGTGG GCGAGAAGTT CGACCTGATA
GTCACGGATC CGCCTTACGC CGACGACGTG CCGTATACGG AGCTGAGCGA CTTTTACTAC
GTGTGGCTCA AGAGAGCTTT AAGCGATGTC TCGGGCGGGA AGCTTATTCC CAGATTTCTG
CCGGAAGCCT TCTTTGACGA GTTCGGAGAG GAGATAAAGA CTCAGTGGGA GACCTTTGCT
ACGAGAGAGG TCAGCGAGAA TACTGAGCGT TGGAAATACT TTAAGCTGAA CATTTCCTTC
AGTGAACTTC TGGCTAGGGC TTTTGCTAAC GTTACAAGGT TCCTTGATGA AAAGGGTCTG
CTGGTAACGT ACTATGTTGC TAAGAAGCCG GAGGCTTGGG TAGCCTTGAT AGATGCGCTC
TGGCATATCA ATGGTATGAG GGTTGTAGTT GCTTACCCCG TGGTTACGGA GGCTGAAGAA
AATGTGGTAG CAAGGGGAAG GGCGGCAGTG ATGGGAGGCT ATGTTATGGG ATGGCGGAGG
AGGGAGGTGG AGAAGCCCTT GGATCTCTCG AGTGAGAAAG AGGCTGTTGT GACTACCGTC
TCAGAGCGTC TAGGTAATTA CTTAAAGGCC ATAGATGTGA AGGAAGGTGC TACGGCGTGG
GTTTATGCTT ATCTTGCGGC TCTTAGCTAT TTAACTTCTT TCTACCCCGT AAAGGATGGG
GGCGTGGAGC TAGATGCTGA GGGTGTTGTT AGCCATGCGA TGGCGTTGTC CTTTGAAGCT
ATGTTGAGGA AAGCTGGTGT AAACCTGCAT GACCCGGCGG CGCTGGCATA CCTTGCGCTG
AGAGTTGTGG AGGATGAGAA GGGTAGGGTT GACAGCGACG TGCTTTCTCG GGTGGCGTTA
GGGCTTGGGA TTAGAGACGT GGAGCTCGTT AAACTGGGGC TTGTCAGGGA GGTTCGGAGC
GGAGGGCCTA AGGTGGCTAA GCGTAAGGTG TTCGAGGTTA TGGCGCCCAG GAACGAGACG
GTCGACGAGG TTAGGCGCGT GTTGTACCCG TTGCGGGGGA AAGCTCCTGT GCTGGAGTGT
TTTAGGAATC TTCAGCTCTC GGTGCTAGCG AGAACCCAGG TATCCTGTGA TCAGCGGGCT
AGGGAGGAGG CGAAGGAGCT TGCAAAAGCT ATTGTAAGGC TTAGCGGGAT GGGTCTTATT
GACGAGGAGG ATCCAGATGT TAGGCTTTCT AGGGCTGTGT TGGGTTTTGA GTGGTGGGAG
CAATGA
 
Protein sequence
MPEEKPTLLE SPSFPIESIN KASKSEKTGG GRPPYWEMVF WWTRKPLAGA RAIIAASLLS 
QDDYPESYNF LKDLFPCMDK RTPHSCNPNQ RLVEKLKGKK LLDPFAGFGS IPLEAARLGL
DVTAVELLPT AYVFLKAVLE YPKEYGKRLI EISGKEVESL GLRDAVRRFN GSAKIIETGR
YKVPLLIYDV ARWGRWVTEE LKKDPDFKEL YDEDVAVYIG TWEIKCPVCG RYTPLVGNWW
LARVKSKRGY ERLAWMQWRD GEIEVVDLNE ACKKTGRSSC NELLAKVQGK DEESGARVEW
NGQVYVVPSK NINAKLEEAQ CLYCRAKIDH RVKENRILKP VKNKKKEGEW YVKWALQRWN
SLLEDYLFGK VSLEELRNAP ARPRILVKVR VTDGDLEFEP ATREDTEKLW KALEKLKQKW
KEPDVPSEEL WKYTASGGGA LSIWTWGFDK FYKLFNPRQL LTLVKLVRLV REAGKSVEEE
KLKEGWSKED SFRYAEAITT YLAIALCKQI NYDSIVTSTE PVQKFIRETL AFRGIAMTWN
WVEELPVADV LGSYIRSLNS SVGSLSYLVS AVYGSPSRVK VLLDDATTLD MLVGEKFDLI
VTDPPYADDV PYTELSDFYY VWLKRALSDV SGGKLIPRFL PEAFFDEFGE EIKTQWETFA
TREVSENTER WKYFKLNISF SELLARAFAN VTRFLDEKGL LVTYYVAKKP EAWVALIDAL
WHINGMRVVV AYPVVTEAEE NVVARGRAAV MGGYVMGWRR REVEKPLDLS SEKEAVVTTV
SERLGNYLKA IDVKEGATAW VYAYLAALSY LTSFYPVKDG GVELDAEGVV SHAMALSFEA
MLRKAGVNLH DPAALAYLAL RVVEDEKGRV DSDVLSRVAL GLGIRDVELV KLGLVREVRS
GGPKVAKRKV FEVMAPRNET VDEVRRVLYP LRGKAPVLEC FRNLQLSVLA RTQVSCDQRA
REEAKELAKA IVRLSGMGLI DEEDPDVRLS RAVLGFEWWE Q