Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0931 |
Symbol | |
ID | 4600754 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 879501 |
End bp | 881120 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639773710 |
Product | aminopeptidase |
Protein accession | YP_920335 |
Protein GI | 119719840 |
COG category | [R] General function prediction only |
COG ID | [COG4882] Predicted aminopeptidase, Iap family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0562565 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAATAC GGGAAATCGC GTACGAGCTG ACATCAGCGT TGCCGCGCGG CGACGTGGTA GCGGGGAGCA GGGAGGAGCA CGAATCCCTG GAAGCAGTTG CAGGGTACCT AGACGGGCTC TCGGCGCAGG TGCACTACTT CCCATGCTCC ACGTGGGTCG AGAAGGGGGT CGAGCTGTCC GGGGGAGGCT TACGCGTTAA GGCTGTCGCC ATGCCCGGCT CCCCCTCGGG CGAGCTCTAC CTCGAGCCCG TGTACCTGGG CGAGAGGGTT CTCCCGGAGG AGTGGGAGGG GGTAGACCTC GAGGGCAAGA TAGCTGTCGT CAAGATGTAC GGAAAGGTCG ACGAAGCGGC CTGGCAGTAC GTACAGGCGG TGGCGAGGGG CGCCGAGGCT GTCGTCTTCG TAGACCCCTT CCCGGATAGA AGGAGGAGAA TAGTCGTCAC CGCTACCCCC GATTACCGCT TCGGCCCAGG CACCCCGCCT CCAGTGCCCG CCGTTGCTGT CTCGCTGGAG GACGGGTTGA GGCTTGCGAG GGTCTCGGGC AGGGGGGAGA AGCTATACCT CCGCGTTGAG ACAGCCTTCG ACCACTCCGC GAGGACAGGC GTCGTGGTGG CCGGGAACGC CGGGGGCCCG CTGTTCACGG CGCACGTCGA TAAGTGGCTC TCGGGGTTCA CGGACAACGT TCTCGGGGTA GCGCTGGTCG TAGCTCTCTC CAGGGCCTTC GGGGAGAGCG CGGGCTACGC GGTTTTCGGG GCCGAGGAGT ACGGCGCGCC CGGGCATTCC CCGTGGTACT GGATATGGGG TTCCCGCTCC TACGCCGACT TCTTGGAGAG GAGGGGGGAG CTCGACTCCC TGGGAGTAGT CGTCAACCTC GACGTGCTCG GGGGAGCTGG GATCACGGTG TCAGCCTCGG GGCCCGACTT CCAGGGAGGG CTCGGGAAGG CTCTGGGAGA AGGCTACAGG TACACCTCGG ACCAGGTGAT ATTCGACAGC TTCAGCTTCA CCATGAAGGG TGTAGCGGCG GCGACTCTGC ACACCTTCCA GGACGTGCTC CCCGTGTACC ACACGGACCT CGACGAGCCC CAGAGGGTTG ACTGGGAGAA GGTCTTAGAA GCCTACACGC TGGCAGAGAG GGCGGGCGAA GCTTTCCTTC GGGAGGAGTG GGGGTTGCTG GAATACTCCC TCCTCAAGCG CGTAGCTCTG GAGAAGCTCG AGAAGGTCTA CTTCCTCGAG GAGGCCAGGA GGGTCGCCGG GCTCCTAGAG GGGGTGAACA TTAGGGACGA GCACGACGCG CGCCATCTCA GGCGGCTTTT CACGCGCCCA CTGCACAGGG GGAGGTACGG CGAAGTGTTC TCAGAGGTCG AAGCCGTTTA CCCATACATC CTCGACGCCG TGGAAGACCT CCTAGTGCTG AAGAGGGCTG TCGAAGAGGG CTCCCGGAGC GTGCCGGCGA GGATTTTCTT CACCAGGATC GTACCTGGAT GGGAGGAGGT TTTAGTAGAC CTAGAGCCCC CGGGGAAGAG GCACGGAGGA GTGTTGAAAG AGTACTACGA GTCGTTCCTG CGCGCGGTTC GACGTAGCCT TGAATCGATG GAGGAAGCAC TTCTAGAGCT TAAGAGGTAG
|
Protein sequence | MKIREIAYEL TSALPRGDVV AGSREEHESL EAVAGYLDGL SAQVHYFPCS TWVEKGVELS GGGLRVKAVA MPGSPSGELY LEPVYLGERV LPEEWEGVDL EGKIAVVKMY GKVDEAAWQY VQAVARGAEA VVFVDPFPDR RRRIVVTATP DYRFGPGTPP PVPAVAVSLE DGLRLARVSG RGEKLYLRVE TAFDHSARTG VVVAGNAGGP LFTAHVDKWL SGFTDNVLGV ALVVALSRAF GESAGYAVFG AEEYGAPGHS PWYWIWGSRS YADFLERRGE LDSLGVVVNL DVLGGAGITV SASGPDFQGG LGKALGEGYR YTSDQVIFDS FSFTMKGVAA ATLHTFQDVL PVYHTDLDEP QRVDWEKVLE AYTLAERAGE AFLREEWGLL EYSLLKRVAL EKLEKVYFLE EARRVAGLLE GVNIRDEHDA RHLRRLFTRP LHRGRYGEVF SEVEAVYPYI LDAVEDLLVL KRAVEEGSRS VPARIFFTRI VPGWEEVLVD LEPPGKRHGG VLKEYYESFL RAVRRSLESM EEALLELKR
|
| |