Gene Rleg_1566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1566 
Symbol 
ID8012644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1550331 
End bp1552208 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content61% 
IMG OID644824152 
ProductAAA ATPase central domain protein 
Protein accessionYP_002975394 
Protein GI241204298 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.20815 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00245284 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCCTTG AACGCACTCC CACGACTATC ACTATCGCCT CGGCGACCTA CGGCGTCGCC 
TTGAAGGCTG CGATGCGGCT CGGTGGCGCG TTTTTGAAAG GAGCGACCAC CCGTAAAACT
GTAATCATCT TGAAGTTGCC TCTGCACGCC GACGAAAGGG GCTATGAAAC CGCGGCTGGC
GTGCTGATCA AATGCGCCCC TGATCTCCAC GGCTTTCTGG TTCTGAGAGC CGACGTCACT
CGCCGGGGCG CCTTGGATGT CAAAAAGCTC AACGATGCTC TCGTTCTGGG CCGTCCGGTC
GTCATTCTCT GGCCTCCCGG TCTCATGGTG CCGGCCCACA TCGTCGCGGC CGCGGACAGA
ATTGTCGACG TCCGTCCGGT GCGGCCCTCC CATCTCGTTT CCGCTGCCAA GCTAGTCGAG
GGAAGGGTGC TAGATATCCG CGAGGCGACT AGGTTCCTGG AATATCCGGT CGCCTCCGTG
TTCGCGGCGA TGCGCACCGG ACGCTCGCCA GAAATCGTCC TGAAGCGGCT CCAGGATGCG
CATCTCTCGT CCGATAGCCC GACATCAGGT CCTGGCCTCG ACGAGCTCGA AGGCTATGGC
GAGGCCCGCG AGTGGGGATT GACGTTGGCG GAGGATATCC GTGCATGGGA GAGTGCCGAA
ATTCAATGGT CCGAGGTCGA TCGGGGAATT CTCATCAGCG GGCCTCCCGG ATCGGGTAAG
ACGCTATTCG CCTCGGCGCT CGCACGTACC TGCGGCGTCG AGATCGTTGC GACGTCTGTA
AGCCGGTGGC AATCGGCAGG GCATCTTGGC GACATGCTGG GAGCCATGCG CAAGAGTTTT
CAGGAGGCTG CCGCAAAGAA GCCATGCATC CTTTTTCTGG ACGAACTGGA CAGCATTGGT
GACAGGGCCA CCTTCAAGGG CGACAACGCC CAATACTCTA GCCAGGTGGT GAACGGACTG
CTCGAACTGG TCGACGGGTT TGATCGTCTC GAAGGCGTCG TGGTGGTTGG AGCTACGAAC
TTCCCCGAGA AAATCGACCC CGCACTCCGG AGGGCCGGCC GACTCGATCG ACATATCGCT
ATTTCCCTGC CGGACACTCA AACCAGACGA TCGCTGTGCC GACGATACAT CCGTAATGAT
ATGCCTGAGG GCGAGATCGA AACGATCGTT CACGCCACTG CCGGTTTCAG CGGCGCCGAT
TTCGAACAGA TGGGACGCGA TATCCGTCGG AGGGCAAGGA GGGGCGGGGC GGAGATCACG
GCCGAGCTCG CTCTGGCGGT CTTGCCGCCT ATGTTGAAAA TCGAGGGAGA GCGGCGGCGA
ACGGTGGCCG TGCACGAAGC CGGCCACGCG ATCGTCGGCA TTCGCGTTGC GGTCGGAAAG
TTGGATTCCA TCGTGGTCGC CCGCGACGTC CCAAGGACCG GATCCGCCGC CGGTTTTGCG
CACTTCGTTC TCGACGGTGA CGTCGAGCGA GATCGGCAGA CTCTTCTGAG CCAGATCGCG
ATGCTGCTGG GAGGAAGACT GGCGGAAGAG GTCATTCTTG GATCGGCATT TGAAGGATCG
GGCGGCGAGG GATCTGATAT CCACAAGGCT ACGGACCTCG CGACCGTTAT GGAGGTTCAG
CTCGGGATGG GTGAATCGCT GGGGTATTTC CGGGCAAGCT CGTCTGCCGA CCTTGAGGAA
CTTCGCCGCC GAATTCCTGC CGTGCGCGAG CGGGTCGAAA AGGTTCTCTT GAAGCAGTGG
AAGCGCGCGC GAACCATCGT CGAGGAACAT GTCGGCGTCA TTGAATTGGT GGCGTCGCAA
CTGGCCGCCA AGGGCCGTCT CGATGGGAAG GAAGTCGAGC AGATGATGTC GGCGAAGCCG
CGGGAGAGGT CGCCATGA
 
Protein sequence
MSLERTPTTI TIASATYGVA LKAAMRLGGA FLKGATTRKT VIILKLPLHA DERGYETAAG 
VLIKCAPDLH GFLVLRADVT RRGALDVKKL NDALVLGRPV VILWPPGLMV PAHIVAAADR
IVDVRPVRPS HLVSAAKLVE GRVLDIREAT RFLEYPVASV FAAMRTGRSP EIVLKRLQDA
HLSSDSPTSG PGLDELEGYG EAREWGLTLA EDIRAWESAE IQWSEVDRGI LISGPPGSGK
TLFASALART CGVEIVATSV SRWQSAGHLG DMLGAMRKSF QEAAAKKPCI LFLDELDSIG
DRATFKGDNA QYSSQVVNGL LELVDGFDRL EGVVVVGATN FPEKIDPALR RAGRLDRHIA
ISLPDTQTRR SLCRRYIRND MPEGEIETIV HATAGFSGAD FEQMGRDIRR RARRGGAEIT
AELALAVLPP MLKIEGERRR TVAVHEAGHA IVGIRVAVGK LDSIVVARDV PRTGSAAGFA
HFVLDGDVER DRQTLLSQIA MLLGGRLAEE VILGSAFEGS GGEGSDIHKA TDLATVMEVQ
LGMGESLGYF RASSSADLEE LRRRIPAVRE RVEKVLLKQW KRARTIVEEH VGVIELVASQ
LAAKGRLDGK EVEQMMSAKP RERSP