Gene Rleg2_3196 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3196 
Symbol 
ID6981948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3284154 
End bp3286085 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content63% 
IMG OID643397913 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_002282689 
Protein GI209550772 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.766379 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCTA ACTTACGTAA TTTCGCCTTG TGGGCGATCA TAGCGCTTCT GCTGATCGCC 
CTTTTCAGTA TGTTCCAGAC GGCGCCGGCG CAGACGGGCT CCCGCGAAAT CCCTTATTCG
CAGTTCCTGC GTGAGGTCGA TGCGGGCCGC GTGAAGGACG TCGTCGTCAC CGGCAATCGG
CTGAGCGGAA CGTATCTGGA GAATAACAAT AATTTCCAGA CCTATTCGCC CGTTATCGAC
GACAACTTGC TCGATCGCCT GCAGGCCAAA AACGTTGCTG TCACTGCGCG TCCGGAGACC
GATGGTTCCT CCGGCTTTCT GAGCTACCTC GGAACGCTGC TGCCGATGCT GCTCATCCTC
GGCGTCTGGC TGTTCTTCAT GCGGCAGATG CAGGGCGGCT CGCGTGGCGC GATGGGCTTC
GGCAAGTCGA AGGCCAAGCT GCTGACAGAA GCACACGGCC GTGTCACCTT CGAAGACGTC
GCCGGTGTCG ACGAGGCCAA GCAGGATCTC GAAGAAATCG TCGAATTCCT GCGCGATCCG
CAGAAGTTCC AGCGTCTCGG CGGCAAGATT CCGCGCGGCG TGTTGCTCGT CGGCCCTCCG
GGTACCGGCA AGACGCTGCT CGCCCGCTCG GTTGCCGGCG AAGCCAACGT GCCCTTCTTC
ACCATTTCCG GTTCCGACTT CGTCGAAATG TTCGTCGGCG TCGGCGCTAG CCGCGTGCGC
GATATGTTCG AGCAGGCGAA GAAGAATGCG CCCTGCATCA TCTTCATCGA CGAAATCGAT
GCCGTCGGCC GCCATCGCGG CGCCGGTCTC GGCGGCGGCA ATGACGAACG CGAACAGACG
CTGAACCAGC TGCTGGTCGA GATGGATGGC TTCGAGGCGA ATGAAGGCGT GATCCTGATT
GCCGCCACCA ACCGCCCCGA CGTGCTCGAC CCGGCGCTGC TGCGTCCCGG CCGTTTCGAC
CGCCAGGTCG TGGTTCCGAA CCCGGATATC GTCGGCCGCG AGCGCATCCT CAAGGTGCAT
GCCCGCAACG TTCCGCTGGC GCCGAATGTC GATCTCAAGG TTCTCGCCCG CGGCACGCCC
GGCTTCTCCG GCGCCGATCT GATGAACCTC GTCAACGAAG CCGCCCTCAT GGCCGCCCGC
CGCAACAAGC GTGTCGTCAC CATGCAGGAA TTCGAAGACG CCAAGGACAA GATCATGATG
GGCGCCGAGC GCCGATCCTC AGCCATGACC GAGGCGGAGA AGAAGCTCAC CGCTTACCAT
GAGGCCGGTC ACGCCATCAC CGCGCTTAAT GTCGCCGTCG CCGATCCGCT GCACAAGGCG
ACGATCATTC CACGCGGTCG TGCGCTTGGC ATGGTCATGC AGCTTCCCGA GGGCGACCGC
TACTCGATGA GCTACAAGTG GATGGTGTCG CGCCTCTGCA TCATGATGGG CGGTCGTGTC
GCCGAAGAAC TGACCTTCGG CAAGGAGAAC ATCACCTCGG GTGCCTCCTC CGACATCGAG
CAGGCCACCA AGCTTGCCCG CGCCATGGTC ACGCAATGGG GCTTCTCCGA CCAGCTCGGT
CAGGTCGCCT ATGGCGAGAA CCAGCAGGAG GTCTTCCTCG GTCACTCGGT TTCGCAGTCG
AAGAATGTTT CGGAAGCGAC CGCGCAGAAG ATCGACAATG AAGTGCGCCG CCTGATCGAC
GAGGCCTATA CGCAGGCCCG CACGATCCTG ACGGATAAGC ACGACGAGTT CGTCGCTCTT
GCCGAAGGTC TGCTCGAATA CGAGACGCTG ACCGGCGAAG AGATCAAGGC GCTGATCCGC
GGCGAGAAGC CGTCGCGCGA TCTCGGCGAC GATTCACCGC CAAGCCGCGG CTCGGCGGTT
CCGAAAGCCG GCGCACGGCC TGCCGCCAAG GGTGATGAGC CCGAAGCCGG CCTTGAACCG
CAGCCGCATT GA
 
Protein sequence
MNPNLRNFAL WAIIALLLIA LFSMFQTAPA QTGSREIPYS QFLREVDAGR VKDVVVTGNR 
LSGTYLENNN NFQTYSPVID DNLLDRLQAK NVAVTARPET DGSSGFLSYL GTLLPMLLIL
GVWLFFMRQM QGGSRGAMGF GKSKAKLLTE AHGRVTFEDV AGVDEAKQDL EEIVEFLRDP
QKFQRLGGKI PRGVLLVGPP GTGKTLLARS VAGEANVPFF TISGSDFVEM FVGVGASRVR
DMFEQAKKNA PCIIFIDEID AVGRHRGAGL GGGNDEREQT LNQLLVEMDG FEANEGVILI
AATNRPDVLD PALLRPGRFD RQVVVPNPDI VGRERILKVH ARNVPLAPNV DLKVLARGTP
GFSGADLMNL VNEAALMAAR RNKRVVTMQE FEDAKDKIMM GAERRSSAMT EAEKKLTAYH
EAGHAITALN VAVADPLHKA TIIPRGRALG MVMQLPEGDR YSMSYKWMVS RLCIMMGGRV
AEELTFGKEN ITSGASSDIE QATKLARAMV TQWGFSDQLG QVAYGENQQE VFLGHSVSQS
KNVSEATAQK IDNEVRRLID EAYTQARTIL TDKHDEFVAL AEGLLEYETL TGEEIKALIR
GEKPSRDLGD DSPPSRGSAV PKAGARPAAK GDEPEAGLEP QPH