Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_05840 |
Symbol | |
ID | 7313556 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | + |
Start bp | 639099 |
End bp | 640319 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643611014 |
Product | peptidase U32 |
Protein accession | YP_002508336 |
Protein GI | 220931428 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0000158903 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAAAAG TTGAACTACT GGCTCCGGCA GGAAATCTAG AAAAATTAAA ACTGGCTATT TTATATGGTG CCGATGCTGT TTACTGTGGT GGTTTGCGTT TTGGACTTCG TTATGGGGCT GATAACTTTA CTCCTGAAGA ACTGGAAGAA GGAACCAGAT TTGTCCATAA CCACGGAGGC AAAATTTATA TAACGGTAAA TATATATCCC CATAACAATG ATCTGGCAAA GCTGCCAGAC TATTTACATA AACTGGAAGA AATCGGGGTT GATGGTTTAA TCGTGTCAGA TCCCGGTGTA ATTGAATTTA TAAACAGGGA AAAAATAGAG ATACCTCTCC ATCTCAGTAC TCAGGCCAAT ACTGTTAACT GGGCCAGTGC CAGCTTCTGG CATAAACAGG GTATTGAAAG AATAATTCTG GCCCGGGAAT TGAGCCGTGA GGAAATAAAG GAAATTAGAG ACAGAACCAG TATTTCCCTG GAAATGTTCG TTCATGGTTC AATGTGTATT TCCTATTCTG GTCGGTGTTT ATTGAGCAAT TATATGGTCG GGCGTGATGC CAACCGGGGG AAATGTGCCC ATCCCTGTCG CTGGAAGTAT CATCTGGTTG AGGAACAGCG ACCCGGGGAA TACTATCCAG TATATGAGAA TGAACAGGGA ACTTTTATTA TGAATTCAAA AGACCTCTGC CTTATTGAGT ATTTACCAGA CGTTATTTCA ACCGGGGTAG ATAGTTTAAA GATAGAGGGA CGAATGAAGA GCCTTCATTA TGTAGCTACT GTAACCAGGG TTTACCGTAA GGCCATAGAC TCTTATTATC ATGATCCTGA AAATTTTAAG GTTAAACCTG AGTGGCTTGA TGAGCTAAAG AAAGTAAGCC ACCGGGGTTA TACAACAGGG TTTTTTATCT CTCCTCCGAC TGGAGAAGAC CATAATTATA ATTCTTCAGT ATATATAAGG GATCATGACT TTATGGGGAT TATCAGGGAT TATGACAAAA AGAAAAATGA GGCTGTAGTT GAAGTCAGGC ATAAATTCTT TAAAGGTGAC AGGGTTGAGG TGATGGGACC GGATACAACT AATTTTGAAA CAACTGTAAA TTATATAATC AATGAAAACG GGGAAGAAGT GGATGAAGCT CCCCATCCCA GGGAGCTAAT ACGGATACCG GTAACCCATA AGGTTAAACC CTATTACCTT GTAAGGAGGA AAAAGTCATG A
|
Protein sequence | MKKVELLAPA GNLEKLKLAI LYGADAVYCG GLRFGLRYGA DNFTPEELEE GTRFVHNHGG KIYITVNIYP HNNDLAKLPD YLHKLEEIGV DGLIVSDPGV IEFINREKIE IPLHLSTQAN TVNWASASFW HKQGIERIIL ARELSREEIK EIRDRTSISL EMFVHGSMCI SYSGRCLLSN YMVGRDANRG KCAHPCRWKY HLVEEQRPGE YYPVYENEQG TFIMNSKDLC LIEYLPDVIS TGVDSLKIEG RMKSLHYVAT VTRVYRKAID SYYHDPENFK VKPEWLDELK KVSHRGYTTG FFISPPTGED HNYNSSVYIR DHDFMGIIRD YDKKKNEAVV EVRHKFFKGD RVEVMGPDTT NFETTVNYII NENGEEVDEA PHPRELIRIP VTHKVKPYYL VRRKKS
|
| |