Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0043 |
Symbol | |
ID | 4600578 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 31935 |
End bp | 33326 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639772796 |
Product | peptidase U62, modulator of DNA gyrase |
Protein accession | YP_919456 |
Protein GI | 119718961 |
COG category | [R] General function prediction only |
COG ID | [COG0312] Predicted Zn-dependent proteases and their inactivated homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACGAAG ACGACGTGAA GCTGGTACTC GGGAAAGCGG GTTCTCTGGG CGTACAGTTC GCCGAGGTCC GGTTCGAGGA GAGCACCCGC GAGATCATCT CCTACGTTAA CGGCAGGATA GCTTCTATGA GCAACCAGCG CCTGAGGGGA GTCGGCGTAA GGGTGCTCTA CAACGGCAAC TTCGGCTTCG CATCCACCTT CGACTTGAGC AGGGATGGGT TGCTCTCGGC GCTCGAAGAG GCCTTCAAGG CGGCTAGGGC TCTGGGGCCC GGCAACAAGA GGCTAGCGGA GGTGGAGGCC TCCAAGAGGC GGTACAGCAT AGAGGGCGTG AAGAGGCACC CGGCGCGCGC AGAGCTCGCG GAGAAGCTCG ACCTCGTCAA GAGGGCTTAC GAGACCCTAA AGTCCGGGGG CGCCTCCTCG GCTGTCGTGA GGTACGGGGC GTACTACGGG AGGGTGGAGG TCTATACGAG CGAGGGGCTG GAGGTGTCCT CGGAGAGGCT CGTTACGGGC GTCTCGGCTA CCGCCGTGCT TAGGGAGAAC GGCAGGGTGG GCGACGGGAC CGAGACTTAC GGAGCGTCGA AGGGCCTCGA GGCGTTCACG GGGGACCGCG CGCCCGAGGC TATAGCCGAG AAGGCACTCG AGGTCGCTAG GGCCGCGCTG AACGCGTCTA GGCCTCCCGC CGGGCTACAG ACCGTCATCA CCCGCCCCGA GCTCACGGGG GTGTTCGCCC ACGAGAGCTT CGGGCACTTG ACGGAAGGCG ACGGGGTGTT CGCCGGGTCC AGCCCGCTGG TCGGGAGGCT GGGAGAGGTA CTCGCAAGCG AGCAGGTAAC AATAGTCGAC TCCGGGTTCG ACGAGAGGGG AGGCTACGTG TTGCTCGCCG ACGACGAGGG GGTTCCCACG GAGCGCACGA TCCTAGTGGA GAAGGGGGTG CTCAAGGGCT ACCTGCACAG CAGGGAGAGC GCGGCACTCA CGGGGATGAA GCCTACAGGC AACGGGCGCG CGCAGAGCTT TGCCCACGAC GTGATCGTCC GCATGCGCAA CACGTTCTTC GAGGCCGGCG ACTGGACCGA GGAAGAGATA ATCAGGGAGA CTAGGCACGG CATACTGCTG GACAAGCCGG CGGGCGGGCA GGTGGAGGAG GACGGCACGT TCACGTTCAA CGCCAGGATA GGGTATATAG TGGAGAACGG GGAGCTGAAG CAGCCAGTCA GGGACGTCGT ACTGGCGGGG AACATACTCG AGATGCTGAA GTACGTAGAT GCCGCCGGTA AAAACGTAGA AATATCCACG AGCCCCTTCG GCGGTTGCGG CAAGTGGGGA CAGATGGTCC ACGTAGGCGA CGGTGGACCG ACGCTCAGAG TTTCCCGCTT GCTCGTGGGT GGTGAGAGAT GA
|
Protein sequence | MHEDDVKLVL GKAGSLGVQF AEVRFEESTR EIISYVNGRI ASMSNQRLRG VGVRVLYNGN FGFASTFDLS RDGLLSALEE AFKAARALGP GNKRLAEVEA SKRRYSIEGV KRHPARAELA EKLDLVKRAY ETLKSGGASS AVVRYGAYYG RVEVYTSEGL EVSSERLVTG VSATAVLREN GRVGDGTETY GASKGLEAFT GDRAPEAIAE KALEVARAAL NASRPPAGLQ TVITRPELTG VFAHESFGHL TEGDGVFAGS SPLVGRLGEV LASEQVTIVD SGFDERGGYV LLADDEGVPT ERTILVEKGV LKGYLHSRES AALTGMKPTG NGRAQSFAHD VIVRMRNTFF EAGDWTEEEI IRETRHGILL DKPAGGQVEE DGTFTFNARI GYIVENGELK QPVRDVVLAG NILEMLKYVD AAGKNVEIST SPFGGCGKWG QMVHVGDGGP TLRVSRLLVG GER
|
| |