Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0452 |
Symbol | |
ID | 4601855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 411602 |
End bp | 412612 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639773219 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_919864 |
Protein GI | 119719369 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTGTTCG GCATGCGCGC GTTAAAGGTG CTAGGCATAG AGTCTACGGC TCACACGTTT GGAGTCGGTA TAGCTACTTC TTCTGGAGAT ATTCTGGTCA ACGTTAATCA CACGTATGTT CCGCGACATG GAGGCATAAA GCCGACGGAG GCCGCCGAGC ACCATAGCAG GGTTGCTCCC AAAGTCCTCT CCGAGGCGCT CCAGAAAGCG GGTATCAGTG TTGAAGAGGT GGACGCAGTC GCGGTCGCGC TGGGCCCGGG TATGGGTCCC TGCCTAAGGG TCGGGGCTAC GCTTGCAAGG TACCTGGCCT TAAAGTTTGG TAAGCCGCTA GTACCGGTTA ACCACGCAAT AGCCCACTTA GAGATTTCTA GGCTGACTAC GGGGCTGGAG GACCCCGTGT TCGTCTACGT TGCCGGTGGA AACACGATGG TCACGACTTT CAACGAGGGT AGATACCGGG TATTCGGCGA GACTCTAGAT ATACCGCTCG GAAACTGCCT CGACACGTTT GCCAGAGAAG TGGGGCTGGG GTTTCCCGGG GTTCCGCGAG TAGAGGAGCT GGCGCTTAAA GGGCGGGAGT ACATACCCTT ACCGTACACG GTCAAGGGGC AGGACGTATC TTACTCGGGG TTGCTCACCC ACGCTCTCTC CCTGTACAGA TCCGGAAGAG CACGGTTAGA GGACGTCTGC TACAGCCTCG TCGAAACCGC CTATTCGATG CTGGTCGAAG TCGCGGAGAG AGCCTTAGCG CACACCGGTA AGAGCCAGCT CGTCCTCACG GGCGGCGTTG CCAGGAGCAG GATACTCCTG GAGAAGCTAA GGAGAATGGT CGAGGATAGA GGCGGAGTGC TCGGGGTTGT CCCGCCTGAG TACGCAGGAG ACAACGGCGC CATGATAGCG TATACAGGCG CGCTGGCTTT TTCCCACGGT GTCAGAGTGC CGGTTGAGGA AAGCCGTATA CAGCCCTACT GGAGGGTGGA CGAGGTCGTC ATTCCGTGGC GATCCAGGTG A
|
Protein sequence | MLFGMRALKV LGIESTAHTF GVGIATSSGD ILVNVNHTYV PRHGGIKPTE AAEHHSRVAP KVLSEALQKA GISVEEVDAV AVALGPGMGP CLRVGATLAR YLALKFGKPL VPVNHAIAHL EISRLTTGLE DPVFVYVAGG NTMVTTFNEG RYRVFGETLD IPLGNCLDTF AREVGLGFPG VPRVEELALK GREYIPLPYT VKGQDVSYSG LLTHALSLYR SGRARLEDVC YSLVETAYSM LVEVAERALA HTGKSQLVLT GGVARSRILL EKLRRMVEDR GGVLGVVPPE YAGDNGAMIA YTGALAFSHG VRVPVEESRI QPYWRVDEVV IPWRSR
|
| |