Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1447 |
Symbol | |
ID | 4270228 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 1651112 |
End bp | 1653382 |
Gene Length | 2271 bp |
Protein Length | 756 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638126203 |
Product | ATP-dependent Clp protease, ATP-binding subunit clpA |
Protein accession | YP_742286 |
Protein GI | 114320603 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0542] ATPases with chaperone activity, ATP-binding subunit |
TIGRFAM ID | [TIGR02639] ATP-dependent Clp protease ATP-binding subunit clpA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.721144 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.19219 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAGTA AAGAGCTGGA GTTCACGCTG AACATGGCAT TCAAGGATGC CAGGGAGAAA CGGCATGAGT TTCTCACCGT GGAGCACCTG CTGCTGGCCC TCACCGACAA TCCGGCAGCC GTGGCCGTGC TGAAAGGCTG CGGCGTCAAG CTGGACAAGC TTCGCCGCGA TCTGGAGGGC TTCCTGGCCG AGACCACGCC GCTGCTGCCC GCCAATGACA CCCGCGAGAC CCAGCCGACG CTGGGCTTCC AGCGGGTCCT GCAGCGCGCC ATCCTGCACG TGCAATCCTC CGGCAAACGC GAGGTGACCG GCGCCAACGT CCTGGTGGCC ATCTTCAGTG AGCAGGAGTC GCAGGCGGTT TACTTCCTGC ATCGGCAGAA CGTCTCCCGT CTCGACGTGG TCAACTACCT CTCCCACGGC ACCTCCAGCG TCGCCAACGA CCCGGAGGAG GGCGAGGACG CCGGCGGAAC CGGCAACGTG GAGGAGGAGG CGGAGCCGGC TCAGGGTAAC TCGCCGCTGG ATCAGTACGC CACCAACCTC AACGCCAAGG CGCGTAAGGG CCAGATCGAC CCCCTGATCG GCCGCGAGCA CGAGGTCGAG CGGACCATCC AGGTCCTCTG CCGCCGGCGC AAGAACAATC CGCTCTACGT GGGCGAGGCC GGCGTCGGCA AGACGGCGAT CGCCGAGGGC CTGGCCAAGA TGATCGAGGA CAGCCAGGTG CCGGAGGTCC TCGCCGATGC CACCATCTAT TCGCTGGACT TGGGCGCGCT GGTGGCGGGC ACCAAATACC GCGGTGATTT CGAGAAGCGG CTCAAGGCCT TGCTGCAACA ACTGCGCAAC GACCAGCATG CGGTGCTGTT CATTGATGAG ATTCACACCA TCATCGGGGC CGGCTCGGCC TCGGGCGGGG TGATGGATGC CTCCAACCTC ATCAAGCCCA TGCTCGCCAG CGGCGAGTTG AAGTGCATCG GTTCCACCAC CTACCAGGAG TACCGCGGCA TCTTCGAAAA GGATCGCGCC CTGGCGCGGC GGTTCCAGAA GATCGACGTG GGTGAACCCA GCGTCAGCGA GACGGTGCAG ATCCTCAAGG GGCTGAAGAG CCGTTTCGAG GCGCACCACG GGGTGCGCTT CACCGAGCCG GCGCTGAACG CCGCCGCGGA GCTGTCGGCC CGCTACATCA ACGACCGGCG GCTGCCCGAC AAGGCCATCG ACGTGATCGA CGAGGCCGGC GCCCGGCTGC GGCTGCGGCC AAAGTCCCGC CGGCGCAAGA CGGTGGGGGT GCAGGACATC GAGGGCATTG TGGCCAAGAT CGCCCGCATC CCGCCGAAGC GGGTCTCGGC CACCGACATG CAGGTGCTGG AGAATCTCGA AAAGGACCTC AAGGGGCTGA TTTTCGGCCA GGACGAGGCC ATCGACACCC TCGCCTCCAC CATCAAGCTC TCCCGCGCCG GGCTGGGCCA GCCGGAGAAG CCGGTGGGCA GCTTCCTCTT CTCCGGCCCC ACCGGTGTGG GCAAGACCGA GGTCTCCCGC CGGCTGGCCG AGCTGATGGG CGTGAAGCTG ATCCGCTTCG ATATGTCGGA GTACATGGAG CGGCACACGG TCTCGCGGCT CATCGGTGCG CCCCCGGGGT ACGTGGGCTA CGACCAGGGC GGCCTGCTCA CCGAGGAGGT CATCAAGCAC CCGCACTCGG TGGTCCTGCT TGACGAGCTG GAGAAGGCCC ACCCGGACGT CTTCAACCTG CTGTTACAGG TGATGGACCA CGGTACCCTC ACCGACAACA ACGGTCGCGA GGCGGACTTC CGCAATGTCA TTCTGATTAT GACCACCAAC GCGGGCGCTG AGGACATGAG CCGTCGCTCC ATCGGCTTCA TGCCGCAGGA CCACAGCAGC GACGGGCTGG AGGCCATCAA GCGCCAGTTC ACGCCGGAGT TCCGCAATCG GCTGGATGCC GTGGTCCAGT TCAACCCGCT GGACGAGGAC AACGTGCAGC GGGTGGTCGA CAAGTTCGTG CGCGAGCTCT CGGTGCAGTT GGCCGAAAAG CGGGTTACGC TCATGGTCGA TGGTGCGGCG CGGCGCTGGC TGGGCGAGAA GGGCTACGAT CCCAGCATGG GCGCCCGACC CATGGCCCGG ATCATCCAGC AGCACGTCAA GAAGCCGCTG GCCGAGAAGC TGCTGTTCGG GGAGCTTGCT GACGGCGGCG AGGTTGAGGT CAGCGTGGAG GACGGCGAGC TGAAGATCAA CGTCCGGGAG GCGGACGCGG CGGGCGCCTG A
|
Protein sequence | MLSKELEFTL NMAFKDAREK RHEFLTVEHL LLALTDNPAA VAVLKGCGVK LDKLRRDLEG FLAETTPLLP ANDTRETQPT LGFQRVLQRA ILHVQSSGKR EVTGANVLVA IFSEQESQAV YFLHRQNVSR LDVVNYLSHG TSSVANDPEE GEDAGGTGNV EEEAEPAQGN SPLDQYATNL NAKARKGQID PLIGREHEVE RTIQVLCRRR KNNPLYVGEA GVGKTAIAEG LAKMIEDSQV PEVLADATIY SLDLGALVAG TKYRGDFEKR LKALLQQLRN DQHAVLFIDE IHTIIGAGSA SGGVMDASNL IKPMLASGEL KCIGSTTYQE YRGIFEKDRA LARRFQKIDV GEPSVSETVQ ILKGLKSRFE AHHGVRFTEP ALNAAAELSA RYINDRRLPD KAIDVIDEAG ARLRLRPKSR RRKTVGVQDI EGIVAKIARI PPKRVSATDM QVLENLEKDL KGLIFGQDEA IDTLASTIKL SRAGLGQPEK PVGSFLFSGP TGVGKTEVSR RLAELMGVKL IRFDMSEYME RHTVSRLIGA PPGYVGYDQG GLLTEEVIKH PHSVVLLDEL EKAHPDVFNL LLQVMDHGTL TDNNGREADF RNVILIMTTN AGAEDMSRRS IGFMPQDHSS DGLEAIKRQF TPEFRNRLDA VVQFNPLDED NVQRVVDKFV RELSVQLAEK RVTLMVDGAA RRWLGEKGYD PSMGARPMAR IIQQHVKKPL AEKLLFGELA DGGEVEVSVE DGELKINVRE ADAAGA
|
| |