Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_14790 |
Symbol | |
ID | 7312672 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | - |
Start bp | 1573374 |
End bp | 1575377 |
Gene Length | 2004 bp |
Protein Length | 667 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 643611920 |
Product | peptidase U32 |
Protein accession | YP_002509223 |
Protein GI | 220932315 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGATAA ATAACGTGGA ACTACTGGCT CCTGCCGGTA AGTGGGAGGC CCTGGAAACG GTAATTGAAG CCGGGGCAGA TGCTGTTTAC CTGGGTGGTA AAAAGCATAA TATGAGGCTT CTCAGGACCG GTTTCAACTT CAGTAATGAA GAATTAAAAG AGGCTGTTAA CCTGGCCCAC AGCCGGGGAG TAAAGATATA TGTAACTGTT AATAATTTAC AGGGCGATGA GGAACTTGAT GAGATAGCAC CATATCTTCA GTATCTGGCT GAAATTGAGG TTGATGCTTT TATAGTTCAG GACCTGGGAT TACTCTATCT TATTAATGAG CTGGGGCTTG AGGTTCCTGT ACACTCCAGT GTAATGATGA ATGTTCATAA CATAGATATG GCCCGGTATT TACATGACCA TGGAGTCAGA AGGTTTATAG TTAGCCGGGA ATTATCCTTT GACCAGGTGA GGCATATGAC CCGGGAAACC GGCTTTGAAT ATGAATATTT TATACATGGA GATATGTGTT TTTCTCAGAG TGGGCAGTGT TTATTGAGTG GTATGGTTTT TGGGAACAGC AGCAACAGGG GGCGTTGTCT AAAGCCCTGT CGCTGGCCCT ATAGCCTGGC CCGGTATAAA AACGGTTATT TTGAAGAGGG GGTAGAGGTT AAGGCTGACG GTCCTTACTT CCTGGCGGTA AAGGATATGT GTGTTTTCCG GCATATACCC CAGTTGATCA GGTCGGGAAT TGTCTCCTTT AAGATAGAGG GCAGAATGAA GCCGGCCAGT CAGCTTAAGA GAATTGTATC TGCTTACCGG ACAGCTATCG ACAGGTACCT TGATGATCCG GTGGGTTATA CGGTTGATGA GGATATTTAC CGTGACCTCT ATGATCACCG GGTCCGGGAT TTCAGTACCT GTTTTGCCCT TAAGAATCCG GGGTCAGATG GAATCGGTTA TACCGGGGAG AGGGAACCCA AGTTTTTTAG TGAAGCCCGG GAGGAGAAAA AATTAAATCC TGACATGGAT CTTGATATGG ATCTTAAACT GGATATAGAT AACCCCACGG ATTATAGTTC AGGGGCAAAG GAGGTTGTGG CCCTTCCTTT GCTATCGATT AAGGTTAACG GTCTTGAGGA GGCTGAAGCT GCTTTAAAGT CAGGGGTAGA CCGCATCTAT ATAGGTGGTG AAACTCCGTC CTGGAAACCT CCCTGTGGCC AGGAGGTTAT AAACAAGGTC CTTGATAAAG CCGAAAAAGC AGGGGTGGAA GTGGTAGTAA CTACCCCCAG AATTACTTTC TCTGATGAAA TGGAAGAATA TATTGAGTTG TTGAAGGGAT TAGATCTGGA GCAAACCGGT GGAGTTATGG CCGGTAACCT GGGGATGATC AGAGCCCTGA ATGAGTATTT TGATACCAGG GTTATGGCAG ATTTTGGAGT TAATGCGTTT AACACCAGGG CTCTCAGTAT TTTAAAAGAA TCCGGGGTGG TTCAGGTTAC AAATCAACTG GAATCTTCCC TGAAGCAGAT TTTAAAGATG GCTTCAGGTA CAGATATGGA CCTCGAGCTT ATCGGTCATG GCCATCTTCC CTTTATGGTA TCTGACCACT GTCTCCTTTC TGAACTTCTG GAAGGGAAGA CTCCGGAGGA TCAGTGTTCT GCCCCCTGTC GGGGTGAGAG ATATGGACTT GTCAATGATA AAAAGAGGGT TTACCCTGTT ATGACCGATC AGTATTGCCG GACCCATCTT TATCTCAGTA AAGAGCTTGC TCTCCTCCCA TTTCTGGATA GAATATTATT ATCAGGTATA AAGAGTTTCA GGATTGAGGC CGGACTTTAT AATGCTGCAA AGGTAGAGGC TGTAGTTGAT ATCTATAAAC GGGCCTTTAT AGCTATTAAA AATGGTCGCT GGTCACAGGA AAAAACATCG TTATATAATG AGCTTAAGGG GTTAAGTGAT ACCGGTTATA CCCTGGCAGC CTACGAGAAA GGGGTCCTGG GGACCGGTAC TTAA
|
Protein sequence | MMINNVELLA PAGKWEALET VIEAGADAVY LGGKKHNMRL LRTGFNFSNE ELKEAVNLAH SRGVKIYVTV NNLQGDEELD EIAPYLQYLA EIEVDAFIVQ DLGLLYLINE LGLEVPVHSS VMMNVHNIDM ARYLHDHGVR RFIVSRELSF DQVRHMTRET GFEYEYFIHG DMCFSQSGQC LLSGMVFGNS SNRGRCLKPC RWPYSLARYK NGYFEEGVEV KADGPYFLAV KDMCVFRHIP QLIRSGIVSF KIEGRMKPAS QLKRIVSAYR TAIDRYLDDP VGYTVDEDIY RDLYDHRVRD FSTCFALKNP GSDGIGYTGE REPKFFSEAR EEKKLNPDMD LDMDLKLDID NPTDYSSGAK EVVALPLLSI KVNGLEEAEA ALKSGVDRIY IGGETPSWKP PCGQEVINKV LDKAEKAGVE VVVTTPRITF SDEMEEYIEL LKGLDLEQTG GVMAGNLGMI RALNEYFDTR VMADFGVNAF NTRALSILKE SGVVQVTNQL ESSLKQILKM ASGTDMDLEL IGHGHLPFMV SDHCLLSELL EGKTPEDQCS APCRGERYGL VNDKKRVYPV MTDQYCRTHL YLSKELALLP FLDRILLSGI KSFRIEAGLY NAAKVEAVVD IYKRAFIAIK NGRWSQEKTS LYNELKGLSD TGYTLAAYEK GVLGTGT
|
| |