Gene Mlg_1447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1447 
Symbol 
ID4270228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1651112 
End bp1653382 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content66% 
IMG OID638126203 
ProductATP-dependent Clp protease, ATP-binding subunit clpA 
Protein accessionYP_742286 
Protein GI114320603 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID[TIGR02639] ATP-dependent Clp protease ATP-binding subunit clpA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.721144 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.19219 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAGTA AAGAGCTGGA GTTCACGCTG AACATGGCAT TCAAGGATGC CAGGGAGAAA 
CGGCATGAGT TTCTCACCGT GGAGCACCTG CTGCTGGCCC TCACCGACAA TCCGGCAGCC
GTGGCCGTGC TGAAAGGCTG CGGCGTCAAG CTGGACAAGC TTCGCCGCGA TCTGGAGGGC
TTCCTGGCCG AGACCACGCC GCTGCTGCCC GCCAATGACA CCCGCGAGAC CCAGCCGACG
CTGGGCTTCC AGCGGGTCCT GCAGCGCGCC ATCCTGCACG TGCAATCCTC CGGCAAACGC
GAGGTGACCG GCGCCAACGT CCTGGTGGCC ATCTTCAGTG AGCAGGAGTC GCAGGCGGTT
TACTTCCTGC ATCGGCAGAA CGTCTCCCGT CTCGACGTGG TCAACTACCT CTCCCACGGC
ACCTCCAGCG TCGCCAACGA CCCGGAGGAG GGCGAGGACG CCGGCGGAAC CGGCAACGTG
GAGGAGGAGG CGGAGCCGGC TCAGGGTAAC TCGCCGCTGG ATCAGTACGC CACCAACCTC
AACGCCAAGG CGCGTAAGGG CCAGATCGAC CCCCTGATCG GCCGCGAGCA CGAGGTCGAG
CGGACCATCC AGGTCCTCTG CCGCCGGCGC AAGAACAATC CGCTCTACGT GGGCGAGGCC
GGCGTCGGCA AGACGGCGAT CGCCGAGGGC CTGGCCAAGA TGATCGAGGA CAGCCAGGTG
CCGGAGGTCC TCGCCGATGC CACCATCTAT TCGCTGGACT TGGGCGCGCT GGTGGCGGGC
ACCAAATACC GCGGTGATTT CGAGAAGCGG CTCAAGGCCT TGCTGCAACA ACTGCGCAAC
GACCAGCATG CGGTGCTGTT CATTGATGAG ATTCACACCA TCATCGGGGC CGGCTCGGCC
TCGGGCGGGG TGATGGATGC CTCCAACCTC ATCAAGCCCA TGCTCGCCAG CGGCGAGTTG
AAGTGCATCG GTTCCACCAC CTACCAGGAG TACCGCGGCA TCTTCGAAAA GGATCGCGCC
CTGGCGCGGC GGTTCCAGAA GATCGACGTG GGTGAACCCA GCGTCAGCGA GACGGTGCAG
ATCCTCAAGG GGCTGAAGAG CCGTTTCGAG GCGCACCACG GGGTGCGCTT CACCGAGCCG
GCGCTGAACG CCGCCGCGGA GCTGTCGGCC CGCTACATCA ACGACCGGCG GCTGCCCGAC
AAGGCCATCG ACGTGATCGA CGAGGCCGGC GCCCGGCTGC GGCTGCGGCC AAAGTCCCGC
CGGCGCAAGA CGGTGGGGGT GCAGGACATC GAGGGCATTG TGGCCAAGAT CGCCCGCATC
CCGCCGAAGC GGGTCTCGGC CACCGACATG CAGGTGCTGG AGAATCTCGA AAAGGACCTC
AAGGGGCTGA TTTTCGGCCA GGACGAGGCC ATCGACACCC TCGCCTCCAC CATCAAGCTC
TCCCGCGCCG GGCTGGGCCA GCCGGAGAAG CCGGTGGGCA GCTTCCTCTT CTCCGGCCCC
ACCGGTGTGG GCAAGACCGA GGTCTCCCGC CGGCTGGCCG AGCTGATGGG CGTGAAGCTG
ATCCGCTTCG ATATGTCGGA GTACATGGAG CGGCACACGG TCTCGCGGCT CATCGGTGCG
CCCCCGGGGT ACGTGGGCTA CGACCAGGGC GGCCTGCTCA CCGAGGAGGT CATCAAGCAC
CCGCACTCGG TGGTCCTGCT TGACGAGCTG GAGAAGGCCC ACCCGGACGT CTTCAACCTG
CTGTTACAGG TGATGGACCA CGGTACCCTC ACCGACAACA ACGGTCGCGA GGCGGACTTC
CGCAATGTCA TTCTGATTAT GACCACCAAC GCGGGCGCTG AGGACATGAG CCGTCGCTCC
ATCGGCTTCA TGCCGCAGGA CCACAGCAGC GACGGGCTGG AGGCCATCAA GCGCCAGTTC
ACGCCGGAGT TCCGCAATCG GCTGGATGCC GTGGTCCAGT TCAACCCGCT GGACGAGGAC
AACGTGCAGC GGGTGGTCGA CAAGTTCGTG CGCGAGCTCT CGGTGCAGTT GGCCGAAAAG
CGGGTTACGC TCATGGTCGA TGGTGCGGCG CGGCGCTGGC TGGGCGAGAA GGGCTACGAT
CCCAGCATGG GCGCCCGACC CATGGCCCGG ATCATCCAGC AGCACGTCAA GAAGCCGCTG
GCCGAGAAGC TGCTGTTCGG GGAGCTTGCT GACGGCGGCG AGGTTGAGGT CAGCGTGGAG
GACGGCGAGC TGAAGATCAA CGTCCGGGAG GCGGACGCGG CGGGCGCCTG A
 
Protein sequence
MLSKELEFTL NMAFKDAREK RHEFLTVEHL LLALTDNPAA VAVLKGCGVK LDKLRRDLEG 
FLAETTPLLP ANDTRETQPT LGFQRVLQRA ILHVQSSGKR EVTGANVLVA IFSEQESQAV
YFLHRQNVSR LDVVNYLSHG TSSVANDPEE GEDAGGTGNV EEEAEPAQGN SPLDQYATNL
NAKARKGQID PLIGREHEVE RTIQVLCRRR KNNPLYVGEA GVGKTAIAEG LAKMIEDSQV
PEVLADATIY SLDLGALVAG TKYRGDFEKR LKALLQQLRN DQHAVLFIDE IHTIIGAGSA
SGGVMDASNL IKPMLASGEL KCIGSTTYQE YRGIFEKDRA LARRFQKIDV GEPSVSETVQ
ILKGLKSRFE AHHGVRFTEP ALNAAAELSA RYINDRRLPD KAIDVIDEAG ARLRLRPKSR
RRKTVGVQDI EGIVAKIARI PPKRVSATDM QVLENLEKDL KGLIFGQDEA IDTLASTIKL
SRAGLGQPEK PVGSFLFSGP TGVGKTEVSR RLAELMGVKL IRFDMSEYME RHTVSRLIGA
PPGYVGYDQG GLLTEEVIKH PHSVVLLDEL EKAHPDVFNL LLQVMDHGTL TDNNGREADF
RNVILIMTTN AGAEDMSRRS IGFMPQDHSS DGLEAIKRQF TPEFRNRLDA VVQFNPLDED
NVQRVVDKFV RELSVQLAEK RVTLMVDGAA RRWLGEKGYD PSMGARPMAR IIQQHVKKPL
AEKLLFGELA DGGEVEVSVE DGELKINVRE ADAAGA