Gene Tpen_0931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0931 
Symbol 
ID4600754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp879501 
End bp881120 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content63% 
IMG OID639773710 
Productaminopeptidase 
Protein accessionYP_920335 
Protein GI119719840 
COG category[R] General function prediction only 
COG ID[COG4882] Predicted aminopeptidase, Iap family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0562565 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAATAC GGGAAATCGC GTACGAGCTG ACATCAGCGT TGCCGCGCGG CGACGTGGTA 
GCGGGGAGCA GGGAGGAGCA CGAATCCCTG GAAGCAGTTG CAGGGTACCT AGACGGGCTC
TCGGCGCAGG TGCACTACTT CCCATGCTCC ACGTGGGTCG AGAAGGGGGT CGAGCTGTCC
GGGGGAGGCT TACGCGTTAA GGCTGTCGCC ATGCCCGGCT CCCCCTCGGG CGAGCTCTAC
CTCGAGCCCG TGTACCTGGG CGAGAGGGTT CTCCCGGAGG AGTGGGAGGG GGTAGACCTC
GAGGGCAAGA TAGCTGTCGT CAAGATGTAC GGAAAGGTCG ACGAAGCGGC CTGGCAGTAC
GTACAGGCGG TGGCGAGGGG CGCCGAGGCT GTCGTCTTCG TAGACCCCTT CCCGGATAGA
AGGAGGAGAA TAGTCGTCAC CGCTACCCCC GATTACCGCT TCGGCCCAGG CACCCCGCCT
CCAGTGCCCG CCGTTGCTGT CTCGCTGGAG GACGGGTTGA GGCTTGCGAG GGTCTCGGGC
AGGGGGGAGA AGCTATACCT CCGCGTTGAG ACAGCCTTCG ACCACTCCGC GAGGACAGGC
GTCGTGGTGG CCGGGAACGC CGGGGGCCCG CTGTTCACGG CGCACGTCGA TAAGTGGCTC
TCGGGGTTCA CGGACAACGT TCTCGGGGTA GCGCTGGTCG TAGCTCTCTC CAGGGCCTTC
GGGGAGAGCG CGGGCTACGC GGTTTTCGGG GCCGAGGAGT ACGGCGCGCC CGGGCATTCC
CCGTGGTACT GGATATGGGG TTCCCGCTCC TACGCCGACT TCTTGGAGAG GAGGGGGGAG
CTCGACTCCC TGGGAGTAGT CGTCAACCTC GACGTGCTCG GGGGAGCTGG GATCACGGTG
TCAGCCTCGG GGCCCGACTT CCAGGGAGGG CTCGGGAAGG CTCTGGGAGA AGGCTACAGG
TACACCTCGG ACCAGGTGAT ATTCGACAGC TTCAGCTTCA CCATGAAGGG TGTAGCGGCG
GCGACTCTGC ACACCTTCCA GGACGTGCTC CCCGTGTACC ACACGGACCT CGACGAGCCC
CAGAGGGTTG ACTGGGAGAA GGTCTTAGAA GCCTACACGC TGGCAGAGAG GGCGGGCGAA
GCTTTCCTTC GGGAGGAGTG GGGGTTGCTG GAATACTCCC TCCTCAAGCG CGTAGCTCTG
GAGAAGCTCG AGAAGGTCTA CTTCCTCGAG GAGGCCAGGA GGGTCGCCGG GCTCCTAGAG
GGGGTGAACA TTAGGGACGA GCACGACGCG CGCCATCTCA GGCGGCTTTT CACGCGCCCA
CTGCACAGGG GGAGGTACGG CGAAGTGTTC TCAGAGGTCG AAGCCGTTTA CCCATACATC
CTCGACGCCG TGGAAGACCT CCTAGTGCTG AAGAGGGCTG TCGAAGAGGG CTCCCGGAGC
GTGCCGGCGA GGATTTTCTT CACCAGGATC GTACCTGGAT GGGAGGAGGT TTTAGTAGAC
CTAGAGCCCC CGGGGAAGAG GCACGGAGGA GTGTTGAAAG AGTACTACGA GTCGTTCCTG
CGCGCGGTTC GACGTAGCCT TGAATCGATG GAGGAAGCAC TTCTAGAGCT TAAGAGGTAG
 
Protein sequence
MKIREIAYEL TSALPRGDVV AGSREEHESL EAVAGYLDGL SAQVHYFPCS TWVEKGVELS 
GGGLRVKAVA MPGSPSGELY LEPVYLGERV LPEEWEGVDL EGKIAVVKMY GKVDEAAWQY
VQAVARGAEA VVFVDPFPDR RRRIVVTATP DYRFGPGTPP PVPAVAVSLE DGLRLARVSG
RGEKLYLRVE TAFDHSARTG VVVAGNAGGP LFTAHVDKWL SGFTDNVLGV ALVVALSRAF
GESAGYAVFG AEEYGAPGHS PWYWIWGSRS YADFLERRGE LDSLGVVVNL DVLGGAGITV
SASGPDFQGG LGKALGEGYR YTSDQVIFDS FSFTMKGVAA ATLHTFQDVL PVYHTDLDEP
QRVDWEKVLE AYTLAERAGE AFLREEWGLL EYSLLKRVAL EKLEKVYFLE EARRVAGLLE
GVNIRDEHDA RHLRRLFTRP LHRGRYGEVF SEVEAVYPYI LDAVEDLLVL KRAVEEGSRS
VPARIFFTRI VPGWEEVLVD LEPPGKRHGG VLKEYYESFL RAVRRSLESM EEALLELKR