Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_01980 |
Symbol | |
ID | 7312517 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | + |
Start bp | 203209 |
End bp | 204225 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643610621 |
Product | putative metalloendopeptidase, glycoprotease family |
Protein accession | YP_002507955 |
Protein GI | 220931047 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.160038 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAATA AAAATGTTAT TATTCTGGCA ATAGAGACAT CCTGTGACGA GACTGCAGCT GCAGTTGTGA AAAATGGGGT TGAAGTTCTG TCCAATATTG TGGCCTCTCA GGTAGACTGG CACCGTAAAT ATGGTGGGGT TGTCCCTGAA ATAGCATCCC GGAAACACCT GGAATTTATT AATCCTGTGG TCAAAGAAGC CCTTGACAGG GCGGGTCTGA CCTTCAAGGA TTTAGATGCA GTAGCCTGTA CCTATGGACC AGGATTGGTA GGTGGCCTTT TAGTTGGGTT ATCGGCGGCA AAGGCAATTG CTTATGCTAC CGGTAAACCA TTTATCGGAG TTAATCATAT TGCCGGACAT ATTTACGCCA ATTTCATATC CCATAATGAT ATTGAACTCC CGGCTGCCTG TCTTACTATT TCTGGTGGTC ATACGGATCT TTTATATTTT AAAAATCGCG GGGAGTATAA AATACTGGGG CGGACCCGTG ATGATGCGGC CGGTGAAGCC TTTGATAAAA CAGCCCGGGT CCTGAAACTG GGATATCCCG GGGGGCCGGC CATTGAAAAG GCAGCCAGGG ATGGAAATCC CCGTGCTGTA GATTTTCCCC GCCCTTTTCT GGAAAAGGAT ACTTTTGATT TCAGTTTTAG TGGTTTAAAG ACAGCTGTAA TCAACTATAT TCATAATAGA AAACAGCGTG GCCAGGAGAT AAATGTCAAT GATGTGGCAG CAGGCTTTCA ACAGGCTGTT ATTGATGTTT TGGTCAGTAA AGTTATAAAA GTGACAAAAA AATTACCCGT AAAGAGTGTT ATCCTGTCAG GTGGGGTGGC GGCCAATCGG TCTTTAAGAT CACAGCTTCA AAAAGAATTA AATTACAGGA ATATCCCCCT TTATTACCCC AGGCTCGAGT ACTGTACCGA TAATGCAGCA ATGATAGGAG TAGTGGCTTA TTACCAGTAC CTGAAGGGTG ATTTTAACGA TTTAAGTTTA AATGCAGAAG CCAATTTAAA ATTATAA
|
Protein sequence | MKNKNVIILA IETSCDETAA AVVKNGVEVL SNIVASQVDW HRKYGGVVPE IASRKHLEFI NPVVKEALDR AGLTFKDLDA VACTYGPGLV GGLLVGLSAA KAIAYATGKP FIGVNHIAGH IYANFISHND IELPAACLTI SGGHTDLLYF KNRGEYKILG RTRDDAAGEA FDKTARVLKL GYPGGPAIEK AARDGNPRAV DFPRPFLEKD TFDFSFSGLK TAVINYIHNR KQRGQEINVN DVAAGFQQAV IDVLVSKVIK VTKKLPVKSV ILSGGVAANR SLRSQLQKEL NYRNIPLYYP RLEYCTDNAA MIGVVAYYQY LKGDFNDLSL NAEANLKL
|
| |