Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0601 |
Symbol | |
ID | 4709295 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 678541 |
End bp | 681003 |
Gene Length | 2463 bp |
Protein Length | 820 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639855061 |
Product | ATP-dependent protease La |
Protein accession | YP_001002189 |
Protein GI | 121997402 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0466] ATP-dependent Lon protease, bacterial type |
TIGRFAM ID | [TIGR00763] ATP-dependent protease La |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.273158 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGAGC CAACCGCTCA CAATCCCGAA TCCCCGCAGA CCGTCACCCA GGCGCCGGTT TTGCCGCTGC GCGACGTGGT GGTCTATCCG CACATGGTCA TCCCGCTGTT CGTTGGGCGG GAACGCTCCA TCCACGCCCT CGAGGCGGCC ATGGAGCAGG ACAAGCGGAT CTTCCTGATC GCCCAGCGCA GCGCCGAGGT CGATGACCCG GGTGTCGAAG AACTCTATGG CTACGGCACC GTGGCCTCGA TCCTGCAGAT GCTCAAGCTC CCCGACGGTA CGGTGAAGGT GCTGGTCGAG GGCGGGGAGC GGGCGCGCCT GGTCGAGCTG CTCGATAGCG GCGAATACCT CTCGGCCCAT CTGGTTACGG TGCCCGAGCC GCAGCCCAGC GACGAGGACC GCGAGCTGGA GGTGGTGGCG CGGTCCGCGA CCAACGTGTT TGAGCAGTAC GTCAAGCTCA ACAAGAAGAT CCCGCCGGAG ATTCTCTCTT CGCTTTCCGG GATCGAGGAG CCCGGCCGAC TGGCTGACAC CATTGCTGCG CACATGGCGC TGAAGGTCGA GGAGAAGCAG AAGGTCCTCG AGATGGAGGG CCCGCGCGAG CGCCTCGAGC ACCTGATGGG CCTGATCGAG GGGGAGATCG ACATCCTCCA GATCGAGAAG CGCATCCGCG GGCGCGTCAA GCAGCAGATG GAGAAATCCC AGCGGGAGTA CTACCTCAAT GAGCAGATGA AGGCCATCCA GAAGGAGCTG GGTGAGCTGG AGGACGTGCC CAACGAGGTC GAGGACCTCG AGAACAAGAT CGACCAGGCC GGGATGCCGC AGCAGGCCCT GGACAAGGCG AAGCAGGAGC TGAACAAGCT CAAGATGATG TCGCCGATGT CCGCCGAGGC CACCGTGGTG CGCAATTACC TCGACTGGCT GGTGAGCCTG CCCTGGAAGG ACAAGACGCG CGTGCGGCAC GATCTCAAGC ACGCCGCCAA GGTCCTCGAT CAAGACCACT ACGGCTTGGA CAAGGTCAAG GAGCGCATCC TCGAGTACCT GGCCGTGCAG CGACGGGTGC GCAAGCTCAA GGGGCCGATC CTCTGTTTGG TTGGGCCGCC GGGTGTCGGC AAGACGTCGC TGGGGCAGTC CATCGCCCGG GCCACCAACC GCAAGTTCAC CCGCATGTCC CTCGGTGGTG TGCGCGACGA GGCAGAGATC CGTGGCCACC GCCGCACCTA CATCGGCTCG CTGCCGGGCA AGATTGTGCA GAACCTGAGC AAGGCGGGTA AGCGCAACCC GCTGTTCCTG CTCGATGAGG TGGACAAGAT GGCCATGGAC TTCCGGGGCG ACCCGTCCTC GGCCCTGTTG GAGGTCCTGG ATCCGGAGCA GAACAGCAGC TTCAGCGACC ACTACCTTGA GGTCGACTTC GATCTCTCCG ACGTGATGTT CGTCTGCACG GCGAACACCA TGAACATCCC GGCGCCGCTG CTCGACCGCA TGGAGGTCAT CCGCCTGCCG GGCTACACCG AGGAGGAGAA GCTCGCCATC GCGCAGAGCT ATCTGCTGCC CAAGCAGATG AAGGCCAACG GCATCCGCAA GGGCGAGCTG GACGTCAAGG AGAGTGCCAT CCGCGACGTC ATCCGCTACT ACACCCGTGA GGCCGGCGTG CGAAATCTGG AGCGGGAGCT GGCGACGGTC TGCCGCAAGG TGGTCAAGGG GCTGGTCGAG GGCGAGTCCA AGGGCCGCAA GCGCAGCGCC GGCGTGCAGG TGACCCGCCG CAATATCGAC AAGTACCTCG GGGTGCGCCG CTACCGCTTC GGGGTCGCCG AGACCGAGGA TCGCATCGGC CAGTCCACCG GTCTGGCCTG GACCGAGGTC GGTGGCGAGT TGCTGACCAT CGAGGTGGCT GTGGTGCCCG GCAAGGGGCG GGCGACGCAG ACCGGACAGC TCGGCGACGT CATGAAGGAG TCCATCGACG CTGCCACCAC CGTGGTGCGC AGCCGGGCCC GCACCCTGGG ACTGGATCCG GAGTTCTACA CCAAGAACGA CTACCACATC CACGTCCCCG AGGGGGCCAT CCCCAAGGAC GGTCCGTCGG CTGGCACTGG CATGTGTGTC GCCCTGGTCT CGGCCTTGAC CGGCATCCCG GTGCGGGCGG GCGTGGGCAT GACCGGTGAG ATCACCCTGC GCGGCGAGGT GTTGCCCATC GGTGGGCTCA AGGAGAAGCT TCTGGCCGCC CTGCGGGGCG GCATCGACAC GGTGCTGATC CCGTCGGAGA ACGAAAAGGA TCTGGCGGAT GTGCCCAAGG ACGTGAAGTC GAAGCTCGAC ATCCGGCCGG TGCGCTGGAT CGACGAGGTC CTCGAGGTGG CCCTGACCCG TCAGCCTGAA CCGCTCCCGG CCCCGGAGGG TGAAGGCGAT GCCGACGCGG CCACGCGCGT GGCAGTCGGC GAGGGGGAGG GTGACCCCAA GCGGCCACAC TGA
|
Protein sequence | MAEPTAHNPE SPQTVTQAPV LPLRDVVVYP HMVIPLFVGR ERSIHALEAA MEQDKRIFLI AQRSAEVDDP GVEELYGYGT VASILQMLKL PDGTVKVLVE GGERARLVEL LDSGEYLSAH LVTVPEPQPS DEDRELEVVA RSATNVFEQY VKLNKKIPPE ILSSLSGIEE PGRLADTIAA HMALKVEEKQ KVLEMEGPRE RLEHLMGLIE GEIDILQIEK RIRGRVKQQM EKSQREYYLN EQMKAIQKEL GELEDVPNEV EDLENKIDQA GMPQQALDKA KQELNKLKMM SPMSAEATVV RNYLDWLVSL PWKDKTRVRH DLKHAAKVLD QDHYGLDKVK ERILEYLAVQ RRVRKLKGPI LCLVGPPGVG KTSLGQSIAR ATNRKFTRMS LGGVRDEAEI RGHRRTYIGS LPGKIVQNLS KAGKRNPLFL LDEVDKMAMD FRGDPSSALL EVLDPEQNSS FSDHYLEVDF DLSDVMFVCT ANTMNIPAPL LDRMEVIRLP GYTEEEKLAI AQSYLLPKQM KANGIRKGEL DVKESAIRDV IRYYTREAGV RNLERELATV CRKVVKGLVE GESKGRKRSA GVQVTRRNID KYLGVRRYRF GVAETEDRIG QSTGLAWTEV GGELLTIEVA VVPGKGRATQ TGQLGDVMKE SIDAATTVVR SRARTLGLDP EFYTKNDYHI HVPEGAIPKD GPSAGTGMCV ALVSALTGIP VRAGVGMTGE ITLRGEVLPI GGLKEKLLAA LRGGIDTVLI PSENEKDLAD VPKDVKSKLD IRPVRWIDEV LEVALTRQPE PLPAPEGEGD ADAATRVAVG EGEGDPKRPH
|
| |