Gene Hore_01980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_01980 
Symbol 
ID7312517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp203209 
End bp204225 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content43% 
IMG OID643610621 
Productputative metalloendopeptidase, glycoprotease family 
Protein accessionYP_002507955 
Protein GI220931047 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.160038 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAATA AAAATGTTAT TATTCTGGCA ATAGAGACAT CCTGTGACGA GACTGCAGCT 
GCAGTTGTGA AAAATGGGGT TGAAGTTCTG TCCAATATTG TGGCCTCTCA GGTAGACTGG
CACCGTAAAT ATGGTGGGGT TGTCCCTGAA ATAGCATCCC GGAAACACCT GGAATTTATT
AATCCTGTGG TCAAAGAAGC CCTTGACAGG GCGGGTCTGA CCTTCAAGGA TTTAGATGCA
GTAGCCTGTA CCTATGGACC AGGATTGGTA GGTGGCCTTT TAGTTGGGTT ATCGGCGGCA
AAGGCAATTG CTTATGCTAC CGGTAAACCA TTTATCGGAG TTAATCATAT TGCCGGACAT
ATTTACGCCA ATTTCATATC CCATAATGAT ATTGAACTCC CGGCTGCCTG TCTTACTATT
TCTGGTGGTC ATACGGATCT TTTATATTTT AAAAATCGCG GGGAGTATAA AATACTGGGG
CGGACCCGTG ATGATGCGGC CGGTGAAGCC TTTGATAAAA CAGCCCGGGT CCTGAAACTG
GGATATCCCG GGGGGCCGGC CATTGAAAAG GCAGCCAGGG ATGGAAATCC CCGTGCTGTA
GATTTTCCCC GCCCTTTTCT GGAAAAGGAT ACTTTTGATT TCAGTTTTAG TGGTTTAAAG
ACAGCTGTAA TCAACTATAT TCATAATAGA AAACAGCGTG GCCAGGAGAT AAATGTCAAT
GATGTGGCAG CAGGCTTTCA ACAGGCTGTT ATTGATGTTT TGGTCAGTAA AGTTATAAAA
GTGACAAAAA AATTACCCGT AAAGAGTGTT ATCCTGTCAG GTGGGGTGGC GGCCAATCGG
TCTTTAAGAT CACAGCTTCA AAAAGAATTA AATTACAGGA ATATCCCCCT TTATTACCCC
AGGCTCGAGT ACTGTACCGA TAATGCAGCA ATGATAGGAG TAGTGGCTTA TTACCAGTAC
CTGAAGGGTG ATTTTAACGA TTTAAGTTTA AATGCAGAAG CCAATTTAAA ATTATAA
 
Protein sequence
MKNKNVIILA IETSCDETAA AVVKNGVEVL SNIVASQVDW HRKYGGVVPE IASRKHLEFI 
NPVVKEALDR AGLTFKDLDA VACTYGPGLV GGLLVGLSAA KAIAYATGKP FIGVNHIAGH
IYANFISHND IELPAACLTI SGGHTDLLYF KNRGEYKILG RTRDDAAGEA FDKTARVLKL
GYPGGPAIEK AARDGNPRAV DFPRPFLEKD TFDFSFSGLK TAVINYIHNR KQRGQEINVN
DVAAGFQQAV IDVLVSKVIK VTKKLPVKSV ILSGGVAANR SLRSQLQKEL NYRNIPLYYP
RLEYCTDNAA MIGVVAYYQY LKGDFNDLSL NAEANLKL