Gene Hore_14790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_14790 
Symbol 
ID7312672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1573374 
End bp1575377 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content45% 
IMG OID643611920 
Productpeptidase U32 
Protein accessionYP_002509223 
Protein GI220932315 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGATAA ATAACGTGGA ACTACTGGCT CCTGCCGGTA AGTGGGAGGC CCTGGAAACG 
GTAATTGAAG CCGGGGCAGA TGCTGTTTAC CTGGGTGGTA AAAAGCATAA TATGAGGCTT
CTCAGGACCG GTTTCAACTT CAGTAATGAA GAATTAAAAG AGGCTGTTAA CCTGGCCCAC
AGCCGGGGAG TAAAGATATA TGTAACTGTT AATAATTTAC AGGGCGATGA GGAACTTGAT
GAGATAGCAC CATATCTTCA GTATCTGGCT GAAATTGAGG TTGATGCTTT TATAGTTCAG
GACCTGGGAT TACTCTATCT TATTAATGAG CTGGGGCTTG AGGTTCCTGT ACACTCCAGT
GTAATGATGA ATGTTCATAA CATAGATATG GCCCGGTATT TACATGACCA TGGAGTCAGA
AGGTTTATAG TTAGCCGGGA ATTATCCTTT GACCAGGTGA GGCATATGAC CCGGGAAACC
GGCTTTGAAT ATGAATATTT TATACATGGA GATATGTGTT TTTCTCAGAG TGGGCAGTGT
TTATTGAGTG GTATGGTTTT TGGGAACAGC AGCAACAGGG GGCGTTGTCT AAAGCCCTGT
CGCTGGCCCT ATAGCCTGGC CCGGTATAAA AACGGTTATT TTGAAGAGGG GGTAGAGGTT
AAGGCTGACG GTCCTTACTT CCTGGCGGTA AAGGATATGT GTGTTTTCCG GCATATACCC
CAGTTGATCA GGTCGGGAAT TGTCTCCTTT AAGATAGAGG GCAGAATGAA GCCGGCCAGT
CAGCTTAAGA GAATTGTATC TGCTTACCGG ACAGCTATCG ACAGGTACCT TGATGATCCG
GTGGGTTATA CGGTTGATGA GGATATTTAC CGTGACCTCT ATGATCACCG GGTCCGGGAT
TTCAGTACCT GTTTTGCCCT TAAGAATCCG GGGTCAGATG GAATCGGTTA TACCGGGGAG
AGGGAACCCA AGTTTTTTAG TGAAGCCCGG GAGGAGAAAA AATTAAATCC TGACATGGAT
CTTGATATGG ATCTTAAACT GGATATAGAT AACCCCACGG ATTATAGTTC AGGGGCAAAG
GAGGTTGTGG CCCTTCCTTT GCTATCGATT AAGGTTAACG GTCTTGAGGA GGCTGAAGCT
GCTTTAAAGT CAGGGGTAGA CCGCATCTAT ATAGGTGGTG AAACTCCGTC CTGGAAACCT
CCCTGTGGCC AGGAGGTTAT AAACAAGGTC CTTGATAAAG CCGAAAAAGC AGGGGTGGAA
GTGGTAGTAA CTACCCCCAG AATTACTTTC TCTGATGAAA TGGAAGAATA TATTGAGTTG
TTGAAGGGAT TAGATCTGGA GCAAACCGGT GGAGTTATGG CCGGTAACCT GGGGATGATC
AGAGCCCTGA ATGAGTATTT TGATACCAGG GTTATGGCAG ATTTTGGAGT TAATGCGTTT
AACACCAGGG CTCTCAGTAT TTTAAAAGAA TCCGGGGTGG TTCAGGTTAC AAATCAACTG
GAATCTTCCC TGAAGCAGAT TTTAAAGATG GCTTCAGGTA CAGATATGGA CCTCGAGCTT
ATCGGTCATG GCCATCTTCC CTTTATGGTA TCTGACCACT GTCTCCTTTC TGAACTTCTG
GAAGGGAAGA CTCCGGAGGA TCAGTGTTCT GCCCCCTGTC GGGGTGAGAG ATATGGACTT
GTCAATGATA AAAAGAGGGT TTACCCTGTT ATGACCGATC AGTATTGCCG GACCCATCTT
TATCTCAGTA AAGAGCTTGC TCTCCTCCCA TTTCTGGATA GAATATTATT ATCAGGTATA
AAGAGTTTCA GGATTGAGGC CGGACTTTAT AATGCTGCAA AGGTAGAGGC TGTAGTTGAT
ATCTATAAAC GGGCCTTTAT AGCTATTAAA AATGGTCGCT GGTCACAGGA AAAAACATCG
TTATATAATG AGCTTAAGGG GTTAAGTGAT ACCGGTTATA CCCTGGCAGC CTACGAGAAA
GGGGTCCTGG GGACCGGTAC TTAA
 
Protein sequence
MMINNVELLA PAGKWEALET VIEAGADAVY LGGKKHNMRL LRTGFNFSNE ELKEAVNLAH 
SRGVKIYVTV NNLQGDEELD EIAPYLQYLA EIEVDAFIVQ DLGLLYLINE LGLEVPVHSS
VMMNVHNIDM ARYLHDHGVR RFIVSRELSF DQVRHMTRET GFEYEYFIHG DMCFSQSGQC
LLSGMVFGNS SNRGRCLKPC RWPYSLARYK NGYFEEGVEV KADGPYFLAV KDMCVFRHIP
QLIRSGIVSF KIEGRMKPAS QLKRIVSAYR TAIDRYLDDP VGYTVDEDIY RDLYDHRVRD
FSTCFALKNP GSDGIGYTGE REPKFFSEAR EEKKLNPDMD LDMDLKLDID NPTDYSSGAK
EVVALPLLSI KVNGLEEAEA ALKSGVDRIY IGGETPSWKP PCGQEVINKV LDKAEKAGVE
VVVTTPRITF SDEMEEYIEL LKGLDLEQTG GVMAGNLGMI RALNEYFDTR VMADFGVNAF
NTRALSILKE SGVVQVTNQL ESSLKQILKM ASGTDMDLEL IGHGHLPFMV SDHCLLSELL
EGKTPEDQCS APCRGERYGL VNDKKRVYPV MTDQYCRTHL YLSKELALLP FLDRILLSGI
KSFRIEAGLY NAAKVEAVVD IYKRAFIAIK NGRWSQEKTS LYNELKGLSD TGYTLAAYEK
GVLGTGT