Gene Hore_14920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_14920 
Symbol 
ID7313083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1589869 
End bp1591545 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content46% 
IMG OID643611933 
ProductSporulation protease LonB 
Protein accessionYP_002509236 
Protein GI220932328 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID[TIGR00764] lon-related putative ATP-dependent protease
[TIGR02902] ATP-dependent protease LonB 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value0.989264 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGATCAAA ATCTCTTAAA TGTTTTTACG ATTATTCAGT TTTTCTTTGC GGTGGTTATC 
GGTCTCTACT TCTGGAATAT GCTTCGTAAC CAGCAGGGTA CTAAACGTGC TGTGGTTAAG
GAATCAAGAA AGGAATTAAA TAAATTGAGG GAGATGAGAA GGATATCCCT CACTGAACCC
CTGGCCGAAA AAACAAGGCC AACCGGGTTT GAGGACATAA TAGGACAAAA AGAGGGGATT
GAGGCCCTGA AAGCAGCACT GTGTGGGCCC AATCCCCAGC ATGTTATATT ATATGGACCG
CCAGGTGTCG GTAAAACCGC AGCAGCCAGG CTGGTCCTGG AAACGGCCAA ACGAAATCCC
CTGTCACCCT TCAAACGTGA TGCTAAATTT ATTGAAATAG ACGGGACTAC GGCCCGTTTT
GATGAAAGAA GTATTGCCGA TCCCCTGATA GGTTCGGTCC ATGACCCGAT TTATCAGGGA
GCGGGTCCCA TGGGGATGGC CGGTATACCC CAGCCGAAAC CGGGGGCAGT TACCAAGGCC
CACGGAGGAG TTCTTTTTAT AGATGAAATC GGGGAACTAC ATCCCATTCA GATGAACAAG
CTTTTAAAGG TACTTGAAGA TAGAAAGGTT TTTCTTGATA GTGCCTATTA TAGTGAAGAG
GATACCAATA TACCTCAGTA TATCCATGAT ATCTTTCAAA ACGGTCTTCC GGCTGATTTC
AGGTTGATTG GTGCGACAAC AAGACAGCCT GATGAAATTC CGCCGGCTAT CAGGTCCAGG
TGTCTTGAGA TCTTCTTCAG GGAACTGAGT ACAGACGAAA TAAGGAAGAT TGCAGTGCGG
GCTGTTAAAA AGATCAGGTT TAAAATTGAG GATAAGGCCC TTGACCTTAT TGAGAAATAT
GCCAAGAACG GGCGGGAAAC GGTCAATATG GTTCAGCTGG CCGGTGGTAT TGCTATTGCC
GGGCACCGTC AGGAAATAAA GGCTCATGAT ATTGAAAAGG TTTTAAATAA CGGCCAGTAT
TCTCCCCGAC TTATTAAAAA GATTCATGGC TTTCCCCAGA TAGGCGTTGT AAACGGCCTG
GCCGTAAGGG GTCCCAATAT CGGGATGTTA CTGGAGATTG AGGTTGCCGC CATAAAAAAA
AGGTCTTCTC CGGGTCAGAT TAAGATAACA GGTGTTATTG AGGAAGAGGA GATAGGGTCA
ATGGGGCATA CGGTCCGCCG GAAAAGTATG GCCCGGGAGT CTGCCGAAAA TGCCCTGACC
GTGTTACGCC GGATGATGCC GGTTGATCCC CATAATTATG ATATCCATGT TAATTTCCCG
GGAGGGATTC CGGTCGATGG CCCTTCAGCC GGGGTTGCCA TGTCGGTGGC CATTTATTCA
GCTATAACCA AAAAACCAGT TGATAACCAT ATTGCCATGA CCGGTGAGGT TTCAATCAGG
GGTCTTGTTA AACCAGTTGG TGGAATCGCT GCCAAAATTG AGGCTGCCAG CAAAGCTGGA
GCCAGAAAAG TGTTAATACC CAGGGAAAAC TGGCAGAACC TCTTTGAACT CAGGGATGAT
ATTGAAATTA TTCCGATAGA GACCCTGGAA GAGGCTATTG AGAAATCGGT GGCTATAAAA
GAAGATGAGA AAATAAAATT AATTAAAGCA GATAGTTTAA TGACTGTTCC CCAATAA
 
Protein sequence
MDQNLLNVFT IIQFFFAVVI GLYFWNMLRN QQGTKRAVVK ESRKELNKLR EMRRISLTEP 
LAEKTRPTGF EDIIGQKEGI EALKAALCGP NPQHVILYGP PGVGKTAAAR LVLETAKRNP
LSPFKRDAKF IEIDGTTARF DERSIADPLI GSVHDPIYQG AGPMGMAGIP QPKPGAVTKA
HGGVLFIDEI GELHPIQMNK LLKVLEDRKV FLDSAYYSEE DTNIPQYIHD IFQNGLPADF
RLIGATTRQP DEIPPAIRSR CLEIFFRELS TDEIRKIAVR AVKKIRFKIE DKALDLIEKY
AKNGRETVNM VQLAGGIAIA GHRQEIKAHD IEKVLNNGQY SPRLIKKIHG FPQIGVVNGL
AVRGPNIGML LEIEVAAIKK RSSPGQIKIT GVIEEEEIGS MGHTVRRKSM ARESAENALT
VLRRMMPVDP HNYDIHVNFP GGIPVDGPSA GVAMSVAIYS AITKKPVDNH IAMTGEVSIR
GLVKPVGGIA AKIEAASKAG ARKVLIPREN WQNLFELRDD IEIIPIETLE EAIEKSVAIK
EDEKIKLIKA DSLMTVPQ