Gene Tpen_1284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1284 
Symbol 
ID4600592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1222478 
End bp1225090 
Gene Length2613 bp 
Protein Length870 aa 
Translation table11 
GC content60% 
IMG OID639774060 
Productankyrin 
Protein accessionYP_920685 
Protein GI119720190 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.226002 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCGGGCC CGGAGGGGAA CCGCGCGGGA ACAGTCGAAG AACTCTTCAG GGCAGTCTGC 
TCGGGGGACG CGAAGAGGGT GAAGGCCCTC CTAGAGGGGG GCGTCGACCC GAACGCCGCG
GGCCCCGCGG GGCTTGCTCC GCTACACTGC GCCGCCATCT TCGGGCACGC CGAGGCCGCA
AGGCTACTCC TGGAGCGAGG CGCAGACCCA AATGTGAAGG ACAAGATCAC ATGGGATGTA
CTCAGCTCCG AGCTGGGTCG CAAAGGGAGG ACACCGCTAC ACTGGGCCGC CGTGTACGGG
CACTTCGTCG TAGCCGAGGT ACTGCTCGAC CGCGGCGCAG ACCCGAACGC CACCGACGAG
GAAGGCAATA CTCCATTGCA CCTGGCGGCG CTACTAGGGT TTGCAGACAT AGCCAGGCTA
CTGCTGGATA GAGGCGCAGA CGTGAACGCG AAGAACAGCT CCGGGAAGAC ACCGCTTCAC
TACGCCGCTG AGCAAGGAAG CGCAGAAGTC GCAAAGTTGC TACTCGAGCG GGGCGCCGAT
CCGGGAGCTA CGGACACGTA TGGCAATACA CCGCTACACC TGGCTGTTAG GAGCATAGAA
GTCAGTAAGC TACTCTTAGA GAGAGGGGCT GACGTAAACG CCAGGAACAA CGAGGGACGA
ACGCCGCTAC ACCGCGCAGC TATGGAGGGA AGCGCCGAGG TGGTGAAGTT TTTGCTCGAG
CGCGGCGCAG ACCCGTGCGC TGTGGACGCC TTTGGAAATA CCCCCCTACA CTTAGCCTTT
AAAAACATGG AGGTTGCCAA GCTACTCTTA GAGAAAGGTG CAGACCCCAA CGCGAAGAAT
AGCTCAGGGA TGACGCCGCT ACACTTCGCC GCAGGACTGG GAAAAGTCGA AGTCGTCGAG
CTACTCCTAG AGCACGGAGC AGATGTGGAC GCTAAGGATA ACGATGGGCT TACACCGTTA
GCCTATGCGG CTCACCGCCA GGATATGTAT ATACGCGCAG ATGCGCTCAC AGCGTTGAAG
GTCGTCGGGC TACTCCTGGA GCGGGGCGCA GACCCCAGCC TCATCGGCTC GGATAGCTAC
ACGCTACTCC ACAAAGCGGC CTTTTGGTGC TACGCAAAGG TTGTCAGGCT CCTCCTAGAG
AAAGGTCTGG ATGCTAACGC AAAGGACGAG TACGGAAGAA CTCCGCTACA CTGGGCCGCG
GAGCGGGGAT GTCCGGAGGT AGTAGAACTC CTGCTCGAGC ACGGGGCAGA CCCCAACGCA
AGGAATGACT CCGGGATGAC ACCGCTACAC CTAGCGGCGA CTGTGAAAGA CACTGAGGCT
GCGAAGCTTC TGCTAGAGCA TGGGGCGGAT CCGAACGCCG AGGAGTATGG AGGCTCAACT
CCGCTGGCTA TTATCTCCTC CTTTTTCTGT TACGATGATA ACATTACGGA CTGGCTTACA
GGCGAGCACA AAGCACTCGA GTTTATCAGG CTACTGTTGG AGCACGGAGC AGAGCCTGGC
AACGGTCTTC ACGCGGCGGT AAGGTGCGGG CGCCCGGAGT GCGTTAAGAA GTTGCTTGAG
TGGGGCGTGA ACCCCAATAC CAGGGACAAC GACGGCAACA CGTTGCTACA TGCCGCCGCC
TGGAATGGGG ACGTGGAGGT TATCGAGATT CTGCTGGAGA GGGGCGCAGA CATTAATGCC
AGGAACAAGT TCGGGGAAAC ACCGCTACAC GTAGCCGCGG AGCGAGGAAA CTTCGAGGCA
GTGAAGTTGC TACTAGAGAG GGGCGCGGAG GTAAACGCGG ACGCGCTCTG CTACGCGGCA
AGGAGTTGTC GCTGGGACGT CTTCACGCTG CTCCTCGAGA GAGGCGCAGA CATTAACGCG
AGGGACTGGT TTGACAGGAC TCCGCTACAC GGCGCCGCCG GGTGCAGGGA CGCCGGGATT
GCGAGGTTCC TCATCGAGAG AGGGGCAGAC ATTAACGCGA GAACCAAGGA CGGAGAAACA
CCGCTACATA AAGCCACGTC CAGCGGGAAC GTCGAGGCGG TAAGACTGCT GTTGGAGCAC
GGCGCCGACG TAGACGCCAG GAACGATTTC GGAGGGACAC CACTGCACCA CGCCGCCGCC
CGGGGGCACC TGGAGATAGT CAGGCTCCTG CTCAAGCACG GCGCAGACTC CAACGCAAGG
AACAGTCACG GGGAGACACC GCTACACTAC GTAGCTGAAC ACGCAGATAT GTGTAGCAAG
AACGCATGGG ATAATTGCTT GAGGATCGCC GAGCTTCTCT TGATACACGG AGCAGACGTA
AACGCCAGGG ACTCCCGGGA CCAGACGCCG CTCCACATAG CCGTGTTTTT CGGCTCCCGC
GAGCACCTCG AAGTCGCGAG GTGGCTCCTG GAGCACGGGG CGGACCCCAA CGCGAGAGAC
TGGGAAGGCA ACACCCCACT ACACTACGTA ATTGAGCATT CCTTCTGGAG GGAACGGCGA
GAAGCTATCG AGCTACTATT AGAGCACGGG GCGGATCCGA GTATAAGGAA CTCCGAGGGG
CTGAGCCCGC TACAGCTCGC TGTGATCAAA GGAGACACCG ACGCTTTCGC GCTGCTCTCA
GGGTACATGT TCAGGTTCAG GAAAACTCGG TAA
 
Protein sequence
MPGPEGNRAG TVEELFRAVC SGDAKRVKAL LEGGVDPNAA GPAGLAPLHC AAIFGHAEAA 
RLLLERGADP NVKDKITWDV LSSELGRKGR TPLHWAAVYG HFVVAEVLLD RGADPNATDE
EGNTPLHLAA LLGFADIARL LLDRGADVNA KNSSGKTPLH YAAEQGSAEV AKLLLERGAD
PGATDTYGNT PLHLAVRSIE VSKLLLERGA DVNARNNEGR TPLHRAAMEG SAEVVKFLLE
RGADPCAVDA FGNTPLHLAF KNMEVAKLLL EKGADPNAKN SSGMTPLHFA AGLGKVEVVE
LLLEHGADVD AKDNDGLTPL AYAAHRQDMY IRADALTALK VVGLLLERGA DPSLIGSDSY
TLLHKAAFWC YAKVVRLLLE KGLDANAKDE YGRTPLHWAA ERGCPEVVEL LLEHGADPNA
RNDSGMTPLH LAATVKDTEA AKLLLEHGAD PNAEEYGGST PLAIISSFFC YDDNITDWLT
GEHKALEFIR LLLEHGAEPG NGLHAAVRCG RPECVKKLLE WGVNPNTRDN DGNTLLHAAA
WNGDVEVIEI LLERGADINA RNKFGETPLH VAAERGNFEA VKLLLERGAE VNADALCYAA
RSCRWDVFTL LLERGADINA RDWFDRTPLH GAAGCRDAGI ARFLIERGAD INARTKDGET
PLHKATSSGN VEAVRLLLEH GADVDARNDF GGTPLHHAAA RGHLEIVRLL LKHGADSNAR
NSHGETPLHY VAEHADMCSK NAWDNCLRIA ELLLIHGADV NARDSRDQTP LHIAVFFGSR
EHLEVARWLL EHGADPNARD WEGNTPLHYV IEHSFWRERR EAIELLLEHG ADPSIRNSEG
LSPLQLAVIK GDTDAFALLS GYMFRFRKTR