Gene Hore_23310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_23310 
Symbol 
ID7314214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2550942 
End bp2552861 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content43% 
IMG OID643612783 
ProductSporulation protease LonC 
Protein accessionYP_002510071 
Protein GI220933163 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID[TIGR00764] lon-related putative ATP-dependent protease
[TIGR02903] ATP-dependent protease, Lon family 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTTCT TGGATGGCCT TTTTAGTAAA GATGATAATA AAATAAAAAA GGCTGATAAA 
AAAGAAAAAG AACTTAAAGT ATTGTACAAA AAAGCCAATG ATTATTATGG TAAGGAACAG
TTCATACTTA AAGCAGGGAA GGTTAATGCC CTGGATTTAA TAAGTTCCAG TAAATTATTT
GATAAACTGA CAGCCCTGGA GAGAATAATT TATGAAGACC CTACCATCCA GGTGGGAAAG
GGAAATTTTG AGGAACGAAT TTTAAAATTA GAGGATAAGA TTGCTGACAT GCTGGCCGAG
CGCTCCGTTG AAAAGGATAT TGAAGAACAG ATTGCCGAGA GGATGGAGAA GAGGCAGCGT
GAATATATTA AGGAGATTAA AAAGGAAATA GTCAATGACG ATCCTCCAGA TAATCCGGAA
ACATTAAGAA GACTGGCCCG GCTTGAAAAG CTGGATGCCA GGAGTTTAAA TAAATCTGTC
ATTGATCTGG TTAGGCCCAA ATCACTTGAT GAGATAGTGG GTCAGCAGCG GGCTTTAAAG
GCCCTTGTTT CCAAAATTGC TTCTCCCTAC CCCCAGCACG TAATCCTATA TGGACCCCCG
GGGGTGGGTA AAACTACGGC TGCCCGACTG GCCCTGGAAG AGGCCAAGAA GAGGCAAAAT
ACCCCTTTTT ACGGGGATTC AAAATTCGTT GAAGTTGATG GGGCGACCCT GAGGTGGGAC
CCGAGGGAAG TTACCAATCC CTTGCTGGGC TCAGTTCATG ACCCCATATA CCAGGGTGCC
AAAAAAGTTC TGGCCGAGGG TGGAGTCCCG GAACCCAAGA CAGGGCTGGT AACCGAAGCC
CATGCTGGTA TCCTCTTTAT CGATGAAATA GGGGAACTGG ATCCAATGCT TCAGAACAAA
CTGTTGAAAG TAATGGAGGA TAAAAGGGTT AAGTTTGAAT CTTCCTATTA TGATAAAAAT
GATGAAAATA TCCCCTTATA TATTAAAAAG CTCTTTGAAG AAGGGGCTCC GGCTGATTTT
ATTTTAATCG GAGCTACCAC CAGAAGTCCC AGTAAAATAA ATCCTGCTTT CCGCTCCCGG
TGTGCAGAGG TATTCTTTAA TCCCCTGTCC CGGGAAGATA TACAGCAAAT TGTTATTAAT
GCTGTCAAAA AACTCACGGT AAAAATTGAG GATGAAATAC CGGAGATAAT AAGTGAATAT
ACGACTGAGG GGAGAACAGC TATAAACCTA CTGATAGATG CCTACAGCCT TGTCCTCTAT
GAAAATGAAG GAGCTGACGA GCAGGAGCTT ATAATTACCA GGGATAAGCT CTTTGAAGCT
ATCCAGAACC GGCGAATGAT TCCCCACAAC AAGATTAAGT CCAGTGAAAA ATCAGAAATT
GGAAAGGTCT TTGGACTGGG GGTCAATGGT TACCTTGGTA CAGTTATTGA AATTGAAGCC
GTTGCCTTTA CAGCCGAGGA AAAGGGTAAT GGTAAGCTGA GATTTAATGA AACCGCCGGT
AAAATGGCTA AAGATTCCCT TTTTAATGCT GCAGCTGTAA TCAGAAAAAT AACCGGGAAG
AAAATGAAGG ATTATGACCT CCATGTCAAT ATTGTCGGGG GAGGTAATGT AGATGGTCCT
TCGGCCGGTA TTGCGATGCT GCTGGCTTTA ATAAGTGCTA TTGAGGAAGT ACCCTTAAAA
CAGGATATAG CTGTGACCGG TGAGGTTTCA ATCAGGGGTA ATATTAAACC GGTCAGTGGT
ATCAGGGAAA AGATATATGC CGCCGAACAG GCAGGAATGA GAGAGGTTTT GGTTCCCGCT
GAAAATATGA TAGATATACA GGAGGACTGG GATATAAAGG TAACCCCGAT ATCTACGGTA
GAAGAAGCCC TGAAGCGGGT TCTTATTGAC CAGGATCAAT TAAAGCTATC AATTATTTAA
 
Protein sequence
MSFLDGLFSK DDNKIKKADK KEKELKVLYK KANDYYGKEQ FILKAGKVNA LDLISSSKLF 
DKLTALERII YEDPTIQVGK GNFEERILKL EDKIADMLAE RSVEKDIEEQ IAERMEKRQR
EYIKEIKKEI VNDDPPDNPE TLRRLARLEK LDARSLNKSV IDLVRPKSLD EIVGQQRALK
ALVSKIASPY PQHVILYGPP GVGKTTAARL ALEEAKKRQN TPFYGDSKFV EVDGATLRWD
PREVTNPLLG SVHDPIYQGA KKVLAEGGVP EPKTGLVTEA HAGILFIDEI GELDPMLQNK
LLKVMEDKRV KFESSYYDKN DENIPLYIKK LFEEGAPADF ILIGATTRSP SKINPAFRSR
CAEVFFNPLS REDIQQIVIN AVKKLTVKIE DEIPEIISEY TTEGRTAINL LIDAYSLVLY
ENEGADEQEL IITRDKLFEA IQNRRMIPHN KIKSSEKSEI GKVFGLGVNG YLGTVIEIEA
VAFTAEEKGN GKLRFNETAG KMAKDSLFNA AAVIRKITGK KMKDYDLHVN IVGGGNVDGP
SAGIAMLLAL ISAIEEVPLK QDIAVTGEVS IRGNIKPVSG IREKIYAAEQ AGMREVLVPA
ENMIDIQEDW DIKVTPISTV EEALKRVLID QDQLKLSII