Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_23310 |
Symbol | |
ID | 7314214 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | - |
Start bp | 2550942 |
End bp | 2552861 |
Gene Length | 1920 bp |
Protein Length | 639 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643612783 |
Product | Sporulation protease LonC |
Protein accession | YP_002510071 |
Protein GI | 220933163 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1067] Predicted ATP-dependent protease |
TIGRFAM ID | [TIGR00764] lon-related putative ATP-dependent protease [TIGR02903] ATP-dependent protease, Lon family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCTTCT TGGATGGCCT TTTTAGTAAA GATGATAATA AAATAAAAAA GGCTGATAAA AAAGAAAAAG AACTTAAAGT ATTGTACAAA AAAGCCAATG ATTATTATGG TAAGGAACAG TTCATACTTA AAGCAGGGAA GGTTAATGCC CTGGATTTAA TAAGTTCCAG TAAATTATTT GATAAACTGA CAGCCCTGGA GAGAATAATT TATGAAGACC CTACCATCCA GGTGGGAAAG GGAAATTTTG AGGAACGAAT TTTAAAATTA GAGGATAAGA TTGCTGACAT GCTGGCCGAG CGCTCCGTTG AAAAGGATAT TGAAGAACAG ATTGCCGAGA GGATGGAGAA GAGGCAGCGT GAATATATTA AGGAGATTAA AAAGGAAATA GTCAATGACG ATCCTCCAGA TAATCCGGAA ACATTAAGAA GACTGGCCCG GCTTGAAAAG CTGGATGCCA GGAGTTTAAA TAAATCTGTC ATTGATCTGG TTAGGCCCAA ATCACTTGAT GAGATAGTGG GTCAGCAGCG GGCTTTAAAG GCCCTTGTTT CCAAAATTGC TTCTCCCTAC CCCCAGCACG TAATCCTATA TGGACCCCCG GGGGTGGGTA AAACTACGGC TGCCCGACTG GCCCTGGAAG AGGCCAAGAA GAGGCAAAAT ACCCCTTTTT ACGGGGATTC AAAATTCGTT GAAGTTGATG GGGCGACCCT GAGGTGGGAC CCGAGGGAAG TTACCAATCC CTTGCTGGGC TCAGTTCATG ACCCCATATA CCAGGGTGCC AAAAAAGTTC TGGCCGAGGG TGGAGTCCCG GAACCCAAGA CAGGGCTGGT AACCGAAGCC CATGCTGGTA TCCTCTTTAT CGATGAAATA GGGGAACTGG ATCCAATGCT TCAGAACAAA CTGTTGAAAG TAATGGAGGA TAAAAGGGTT AAGTTTGAAT CTTCCTATTA TGATAAAAAT GATGAAAATA TCCCCTTATA TATTAAAAAG CTCTTTGAAG AAGGGGCTCC GGCTGATTTT ATTTTAATCG GAGCTACCAC CAGAAGTCCC AGTAAAATAA ATCCTGCTTT CCGCTCCCGG TGTGCAGAGG TATTCTTTAA TCCCCTGTCC CGGGAAGATA TACAGCAAAT TGTTATTAAT GCTGTCAAAA AACTCACGGT AAAAATTGAG GATGAAATAC CGGAGATAAT AAGTGAATAT ACGACTGAGG GGAGAACAGC TATAAACCTA CTGATAGATG CCTACAGCCT TGTCCTCTAT GAAAATGAAG GAGCTGACGA GCAGGAGCTT ATAATTACCA GGGATAAGCT CTTTGAAGCT ATCCAGAACC GGCGAATGAT TCCCCACAAC AAGATTAAGT CCAGTGAAAA ATCAGAAATT GGAAAGGTCT TTGGACTGGG GGTCAATGGT TACCTTGGTA CAGTTATTGA AATTGAAGCC GTTGCCTTTA CAGCCGAGGA AAAGGGTAAT GGTAAGCTGA GATTTAATGA AACCGCCGGT AAAATGGCTA AAGATTCCCT TTTTAATGCT GCAGCTGTAA TCAGAAAAAT AACCGGGAAG AAAATGAAGG ATTATGACCT CCATGTCAAT ATTGTCGGGG GAGGTAATGT AGATGGTCCT TCGGCCGGTA TTGCGATGCT GCTGGCTTTA ATAAGTGCTA TTGAGGAAGT ACCCTTAAAA CAGGATATAG CTGTGACCGG TGAGGTTTCA ATCAGGGGTA ATATTAAACC GGTCAGTGGT ATCAGGGAAA AGATATATGC CGCCGAACAG GCAGGAATGA GAGAGGTTTT GGTTCCCGCT GAAAATATGA TAGATATACA GGAGGACTGG GATATAAAGG TAACCCCGAT ATCTACGGTA GAAGAAGCCC TGAAGCGGGT TCTTATTGAC CAGGATCAAT TAAAGCTATC AATTATTTAA
|
Protein sequence | MSFLDGLFSK DDNKIKKADK KEKELKVLYK KANDYYGKEQ FILKAGKVNA LDLISSSKLF DKLTALERII YEDPTIQVGK GNFEERILKL EDKIADMLAE RSVEKDIEEQ IAERMEKRQR EYIKEIKKEI VNDDPPDNPE TLRRLARLEK LDARSLNKSV IDLVRPKSLD EIVGQQRALK ALVSKIASPY PQHVILYGPP GVGKTTAARL ALEEAKKRQN TPFYGDSKFV EVDGATLRWD PREVTNPLLG SVHDPIYQGA KKVLAEGGVP EPKTGLVTEA HAGILFIDEI GELDPMLQNK LLKVMEDKRV KFESSYYDKN DENIPLYIKK LFEEGAPADF ILIGATTRSP SKINPAFRSR CAEVFFNPLS REDIQQIVIN AVKKLTVKIE DEIPEIISEY TTEGRTAINL LIDAYSLVLY ENEGADEQEL IITRDKLFEA IQNRRMIPHN KIKSSEKSEI GKVFGLGVNG YLGTVIEIEA VAFTAEEKGN GKLRFNETAG KMAKDSLFNA AAVIRKITGK KMKDYDLHVN IVGGGNVDGP SAGIAMLLAL ISAIEEVPLK QDIAVTGEVS IRGNIKPVSG IREKIYAAEQ AGMREVLVPA ENMIDIQEDW DIKVTPISTV EEALKRVLID QDQLKLSII
|
| |