Gene Smed_1117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1117 
Symbol 
ID5321963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1185891 
End bp1188407 
Gene Length2517 bp 
Protein Length838 aa 
Translation table11 
GC content60% 
IMG OID640790058 
ProductATP-dependent Clp protease, ATP-binding subunit clpA 
Protein accessionYP_001326803 
Protein GI150396336 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID[TIGR02639] ATP-dependent Clp protease ATP-binding subunit clpA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.328847 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.667342 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCAACAT TTTCGCCCAG CCTCGAAAAG GCGCTGCATC AGGCACTGAC TTTTGCCAAC 
GAGCGCCATC ATGAATATGC GACGCTCGAG CACCTGCTGC TGGCATTGAT CGACGATGCC
GATGCGGCGG CTGTGATGGG CGCGTGCAAT GTCAATCTGG ATACGCTTCG TAAGACCGTG
ACCGATTATG TCGACAATGA ATTGTCGAAC CTGGTCACCG GTTATGATGA AGATTCCAAG
CCGACGGCCG GTTTTCAGCG CGTAATCCAG CGCGCCGTCA TCCATGTGCA GTCGTCCGGC
CGCGAGGAGG TGACGGGCGC CAACGTGCTC GTGGCGATCT TCGCCGAGCG CGAGAGTCAT
GCTGCCTATT TCCTGCAGGA GCAGGAAATG ACGCGTTACG ACGCCGTCAA TTTCATTTCG
CACGGTATCG GCAAGCGGCC GGGCAGTTCC GAGGCGCGGC CGGTGCGCGG GGCTGAAGAC
CAGGACTCCG AGCAGAAGGC ATCGCGCGAG AGCGAGGAAA CCGGTCCGAA GAAACAGCAG
GATGCGCTCA CCGCCTATTG CGTGAACCTC AACGAAAAGG CGAAGTCCGG CAAGATCGAT
CCGTTGATCG GGCGTCATGC GGAGGTCAAC CGGACGATTC AGGTCCTTTG CCGCAGGTCC
AAGAACAACC CGCTCTATGT GGGCGATCCG GGCGTGGGCA AGACGGCGAT CGCCGAAGGT
CTCGCCAAGC GCATCATAGA AAAGAAAGTG CCAGAGGCGC TGCAGGACGC CACGATTTTC
GCGCTCGACA TGGGGACGCT GCTCGCCGGC ACGCGCTACC GCGGCGACTT CGAGGAACGG
TTGAAGCAGG TAGTCAAGGA GCTCGAAGAC TATCCCGGCG CGGTGCTTTT CATCGATGAG
ATCCATACGG TTATCGGAGC CGGCGCCACA TCCGGCGGCG CCATGGATGC GTCGAACCTC
CTGAAGCCCG CGCTTTCTTC CGGCGCGATC CGCTGCATCG GCTCGACGAC CTACAAGGAG
TACCGCCAGT TCTTCGAAAA AGACCGGGCG CTTGTTCGCC GTTTCCAGAA GATCGACGTT
AACGAGCCGA CAATTGCAGA CGCAATCGAG ATCATGAAGG GTTTGAAGCC CTATTTCGAG
GACTACCACC GGCTGAAATA CACCAACGAT GCCATAAAGG CGGCGGTGGA GCTTTCCGCA
CGCTACATCA ACGACCGGAA ACTCCCGGAT AAGGCGATCG ATGTCATCGA CGAGTCCGGT
GCCGCGCAGA TGCTTCTGCC CGTCAGCAAG CGCCGCAAGC TGATCACGGA GAGGGAGATC
GAGGCGACGA TCGCGACAAT GGCGCGCATC CCGCCCAAGA CGGTCTCCAA GGACGACGAG
GCGGTGCTCG CCAATCTGGA GCAAGAACTG CGGTCCGTCG TATATGGCCA GGATCTGGCG
ATCGAGGCGC TGGCCTCGTC GATCAAGCTG GCACGGGCGG GTCTCAGAGA GCCGAACAAG
CCTATCGGCT GCTACGTCTT CTCCGGCCCG ACAGGTGTCG GCAAGACGGA AGTGGCAAAG
CAACTCGCCA CCTCGCTCGG CGTCGAGCTG CTGCGCTTCG ACATGTCGGA GTACATGGAA
CGCCACACGG TTTCCCGACT GATCGGCGCA CCTCCGGGCT ATGTCGGCTT CGATCAGGGC
GGGCTCCTGA CCGATGGCGT CGACCAGCAT CCGCACTGCG TGCTGCTTCT GGACGAGATC
GAAAAGGCGC ATCCGGACTT ATTCAACATC CTCCTGCAGG TGATGGACCA CGGTTCATTG
ACCGACCACA ACGGCAAGAA GATCGATTTC CGGAACGTCA TCCTTATCAT GACGACCAAT
GCGGGCGCGT CCGATATGGC TCGGGCGGCA ATCGGCTTCG GTTCTTCCAA GCGGACCGGC
GAGGACGTGG AGGCACTGAA TCGTCTCTTC ACGCCGGAAT TCCGCAACCG CCTGGACGCG
GTCATTCCGT TCAATTCCCT GCCGACTCCG GTCATCCACA AGGTGGTTCA GAAGTTCGTC
ATGCAGCTCG AGACGCAGCT TGCCGAGCGC AACGTCACCT TCGATCTCGC GCCCGATGCG
ATCGCCTGGC TTGCCGAGAG GGGTTATGAT GAGAAGATGG GTGCGCGGCC GCTGTCGCGC
GTCATCCAGG AAAACATCAA GAAGCCTCTC GCGGACGAAA TTCTGTTCGG CAAGCTCAAG
AAGGGCGGCG TCGTCAAAGT GACGATCGGT ACGAAGGAAG ACGGTGCGAA AGGGCTCATG
CTCGAAGCTG TGCCGGAAAC CGCTCCGATC AAGCCCAAAG CCGAGGTCTC GCGGCCGGCC
GGCAAGGGCG CGAAACCCAA GAAGGCAAGC GAAAAGGAGA GCGTGGCCGC GGCCGGAGAG
GGTGCCAAAG CCAAGTCGAA GAAGACGACG GCCAAATCCG CGAACAAGAG CGGTGGCGGT
TCCGATACCG CTCCTCTCAG GGGACGTACG GTTCCGAAGG TGCCGCGAAA GAAATAG
 
Protein sequence
MPTFSPSLEK ALHQALTFAN ERHHEYATLE HLLLALIDDA DAAAVMGACN VNLDTLRKTV 
TDYVDNELSN LVTGYDEDSK PTAGFQRVIQ RAVIHVQSSG REEVTGANVL VAIFAERESH
AAYFLQEQEM TRYDAVNFIS HGIGKRPGSS EARPVRGAED QDSEQKASRE SEETGPKKQQ
DALTAYCVNL NEKAKSGKID PLIGRHAEVN RTIQVLCRRS KNNPLYVGDP GVGKTAIAEG
LAKRIIEKKV PEALQDATIF ALDMGTLLAG TRYRGDFEER LKQVVKELED YPGAVLFIDE
IHTVIGAGAT SGGAMDASNL LKPALSSGAI RCIGSTTYKE YRQFFEKDRA LVRRFQKIDV
NEPTIADAIE IMKGLKPYFE DYHRLKYTND AIKAAVELSA RYINDRKLPD KAIDVIDESG
AAQMLLPVSK RRKLITEREI EATIATMARI PPKTVSKDDE AVLANLEQEL RSVVYGQDLA
IEALASSIKL ARAGLREPNK PIGCYVFSGP TGVGKTEVAK QLATSLGVEL LRFDMSEYME
RHTVSRLIGA PPGYVGFDQG GLLTDGVDQH PHCVLLLDEI EKAHPDLFNI LLQVMDHGSL
TDHNGKKIDF RNVILIMTTN AGASDMARAA IGFGSSKRTG EDVEALNRLF TPEFRNRLDA
VIPFNSLPTP VIHKVVQKFV MQLETQLAER NVTFDLAPDA IAWLAERGYD EKMGARPLSR
VIQENIKKPL ADEILFGKLK KGGVVKVTIG TKEDGAKGLM LEAVPETAPI KPKAEVSRPA
GKGAKPKKAS EKESVAAAGE GAKAKSKKTT AKSANKSGGG SDTAPLRGRT VPKVPRKK