Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0982 |
Symbol | |
ID | 4600456 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 928490 |
End bp | 931495 |
Gene Length | 3006 bp |
Protein Length | 1001 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 639773760 |
Product | hypothetical protein |
Protein accession | YP_920385 |
Protein GI | 119719890 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1743] Adenine-specific DNA methylase containing a Zn-ribbon |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGAGG AAAAGCCTAC TCTACTAGAG TCGCCGAGCT TCCCGATAGA AAGTATTAAC AAGGCTTCGA AGTCTGAGAA GACGGGTGGA GGGAGGCCCC CTTACTGGGA GATGGTTTTC TGGTGGACCA GGAAGCCTCT TGCCGGGGCA AGAGCAATAA TAGCGGCTTC GCTACTATCA CAGGACGACT ACCCAGAAAG CTACAACTTC CTAAAAGACC TCTTCCCCTG TATGGACAAG AGGACTCCTC ATTCTTGCAA CCCTAACCAA AGACTCGTAG AGAAACTCAA GGGAAAGAAG CTCCTAGACC CCTTCGCCGG CTTTGGCTCA ATACCCCTAG AGGCTGCAAG GCTAGGCCTC GACGTAACCG CGGTCGAGCT ACTGCCGACA GCTTACGTGT TCCTCAAGGC TGTATTGGAA TATCCAAAGG AGTACGGCAA AAGGCTTATC GAGATAAGCG GGAAAGAGGT CGAGAGCCTC GGCTTAAGAG ACGCTGTCAG GCGGTTCAAC GGCTCGGCAA AGATAATAGA GACGGGCAGG TACAAGGTGC CGCTACTAAT ATACGACGTG GCCAGGTGGG GCAGGTGGGT AACCGAGGAG CTTAAAAAAG ACCCAGACTT CAAAGAACTC TACGACGAAG ACGTCGCAGT ATACATAGGT ACATGGGAAA TTAAATGCCC CGTCTGCGGG CGCTACACCC CACTTGTAGG CAACTGGTGG CTTGCCAGAG TAAAGTCGAA ACGCGGCTAC GAGAGGCTAG CCTGGATGCA ATGGAGAGAC GGAGAAATAG AGGTTGTAGA CCTCAACGAA GCATGCAAGA AGACCGGAAG AAGCTCATGC AACGAGCTTC TCGCCAAGGT ACAAGGCAAA GATGAAGAGT CCGGTGCTAG GGTAGAATGG AACGGCCAGG TATACGTTGT CCCCTCAAAG AATATCAACG CTAAGTTGGA AGAAGCTCAA TGCCTTTACT GTAGGGCAAA AATCGACCAC CGCGTAAAAG AAAACAGAAT ATTGAAACCT GTGAAAAATA AGAAAAAAGA AGGAGAATGG TACGTAAAAT GGGCTCTTCA ACGCTGGAAC AGCCTCCTAG AAGATTATCT CTTTGGGAAG GTAAGCTTGG AGGAACTGAG AAACGCGCCC GCCAGGCCCA GGATACTGGT CAAAGTCAGG GTCACAGACG GGGACCTCGA GTTCGAGCCT GCAACACGAG AGGACACAGA GAAGTTATGG AAAGCCCTCG AAAAACTGAA GCAAAAGTGG AAAGAACCGG ATGTACCATC AGAAGAGTTA TGGAAGTATA CTGCAAGTGG CGGAGGCGCG CTGAGCATAT GGACATGGGG CTTTGACAAA TTCTACAAAC TTTTCAATCC TAGGCAGTTG CTGACATTGG TTAAGCTCGT CAGGCTAGTG AGGGAGGCCG GGAAGAGCGT CGAGGAGGAA AAGCTGAAGG AGGGCTGGAG CAAGGAAGAC TCCTTTAGGT ACGCCGAAGC GATAACAACA TACCTGGCAA TAGCATTATG TAAACAAATA AATTATGACA GTATTGTAAC ATCTACAGAG CCTGTACAGA AATTCATCCG AGAAACATTA GCGTTTAGAG GCATTGCTAT GACATGGAAC TGGGTAGAGG AGTTACCTGT AGCAGACGTT CTGGGTTCAT ATATAAGGTC GTTAAATTCC AGTGTTGGTA GCTTATCTTA TCTTGTTTCT GCTGTGTATG GTAGCCCTAG CAGGGTTAAG GTTTTGCTCG ACGACGCTAC AACTCTGGAC ATGCTTGTGG GCGAGAAGTT CGACCTGATA GTCACGGATC CGCCTTACGC CGACGACGTG CCGTATACGG AGCTGAGCGA CTTTTACTAC GTGTGGCTCA AGAGAGCTTT AAGCGATGTC TCGGGCGGGA AGCTTATTCC CAGATTTCTG CCGGAAGCCT TCTTTGACGA GTTCGGAGAG GAGATAAAGA CTCAGTGGGA GACCTTTGCT ACGAGAGAGG TCAGCGAGAA TACTGAGCGT TGGAAATACT TTAAGCTGAA CATTTCCTTC AGTGAACTTC TGGCTAGGGC TTTTGCTAAC GTTACAAGGT TCCTTGATGA AAAGGGTCTG CTGGTAACGT ACTATGTTGC TAAGAAGCCG GAGGCTTGGG TAGCCTTGAT AGATGCGCTC TGGCATATCA ATGGTATGAG GGTTGTAGTT GCTTACCCCG TGGTTACGGA GGCTGAAGAA AATGTGGTAG CAAGGGGAAG GGCGGCAGTG ATGGGAGGCT ATGTTATGGG ATGGCGGAGG AGGGAGGTGG AGAAGCCCTT GGATCTCTCG AGTGAGAAAG AGGCTGTTGT GACTACCGTC TCAGAGCGTC TAGGTAATTA CTTAAAGGCC ATAGATGTGA AGGAAGGTGC TACGGCGTGG GTTTATGCTT ATCTTGCGGC TCTTAGCTAT TTAACTTCTT TCTACCCCGT AAAGGATGGG GGCGTGGAGC TAGATGCTGA GGGTGTTGTT AGCCATGCGA TGGCGTTGTC CTTTGAAGCT ATGTTGAGGA AAGCTGGTGT AAACCTGCAT GACCCGGCGG CGCTGGCATA CCTTGCGCTG AGAGTTGTGG AGGATGAGAA GGGTAGGGTT GACAGCGACG TGCTTTCTCG GGTGGCGTTA GGGCTTGGGA TTAGAGACGT GGAGCTCGTT AAACTGGGGC TTGTCAGGGA GGTTCGGAGC GGAGGGCCTA AGGTGGCTAA GCGTAAGGTG TTCGAGGTTA TGGCGCCCAG GAACGAGACG GTCGACGAGG TTAGGCGCGT GTTGTACCCG TTGCGGGGGA AAGCTCCTGT GCTGGAGTGT TTTAGGAATC TTCAGCTCTC GGTGCTAGCG AGAACCCAGG TATCCTGTGA TCAGCGGGCT AGGGAGGAGG CGAAGGAGCT TGCAAAAGCT ATTGTAAGGC TTAGCGGGAT GGGTCTTATT GACGAGGAGG ATCCAGATGT TAGGCTTTCT AGGGCTGTGT TGGGTTTTGA GTGGTGGGAG CAATGA
|
Protein sequence | MPEEKPTLLE SPSFPIESIN KASKSEKTGG GRPPYWEMVF WWTRKPLAGA RAIIAASLLS QDDYPESYNF LKDLFPCMDK RTPHSCNPNQ RLVEKLKGKK LLDPFAGFGS IPLEAARLGL DVTAVELLPT AYVFLKAVLE YPKEYGKRLI EISGKEVESL GLRDAVRRFN GSAKIIETGR YKVPLLIYDV ARWGRWVTEE LKKDPDFKEL YDEDVAVYIG TWEIKCPVCG RYTPLVGNWW LARVKSKRGY ERLAWMQWRD GEIEVVDLNE ACKKTGRSSC NELLAKVQGK DEESGARVEW NGQVYVVPSK NINAKLEEAQ CLYCRAKIDH RVKENRILKP VKNKKKEGEW YVKWALQRWN SLLEDYLFGK VSLEELRNAP ARPRILVKVR VTDGDLEFEP ATREDTEKLW KALEKLKQKW KEPDVPSEEL WKYTASGGGA LSIWTWGFDK FYKLFNPRQL LTLVKLVRLV REAGKSVEEE KLKEGWSKED SFRYAEAITT YLAIALCKQI NYDSIVTSTE PVQKFIRETL AFRGIAMTWN WVEELPVADV LGSYIRSLNS SVGSLSYLVS AVYGSPSRVK VLLDDATTLD MLVGEKFDLI VTDPPYADDV PYTELSDFYY VWLKRALSDV SGGKLIPRFL PEAFFDEFGE EIKTQWETFA TREVSENTER WKYFKLNISF SELLARAFAN VTRFLDEKGL LVTYYVAKKP EAWVALIDAL WHINGMRVVV AYPVVTEAEE NVVARGRAAV MGGYVMGWRR REVEKPLDLS SEKEAVVTTV SERLGNYLKA IDVKEGATAW VYAYLAALSY LTSFYPVKDG GVELDAEGVV SHAMALSFEA MLRKAGVNLH DPAALAYLAL RVVEDEKGRV DSDVLSRVAL GLGIRDVELV KLGLVREVRS GGPKVAKRKV FEVMAPRNET VDEVRRVLYP LRGKAPVLEC FRNLQLSVLA RTQVSCDQRA REEAKELAKA IVRLSGMGLI DEEDPDVRLS RAVLGFEWWE Q
|
| |