Gene Rleg_3491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3491 
Symbol 
ID8014362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3526177 
End bp3528123 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content62% 
IMG OID644826056 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_002977276 
Protein GI241206180 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.202032 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0464003 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGGGA GTTCAATGAA CCCTAACTTA CGTAATTTCG CCTTGTGGGC GATCATAGCG 
CTTCTGCTGA TCGCCCTTTT CAGTATGTTT CAGACGGCGC CGGCGCAGAC GGGCTCCCGC
GAAATCCCTT ATTCGCAGTT TCTGCGTGAG GTCGATGCGG GCCGCGTGAA GGATGTCGTG
GTCACGGGCA ACCGTCTCTC CGGAAGCTAT GTTGAAAATG GCACCACCTT TCAGACCTAT
TCGCCTGTGA TCGACGACAG TCTGCTCGAT CGCCTGCAGT CGAAGAACGT CCTGGTTTCC
GCCCGTCCTG AAACGGACGG CTCTTCGGGT TTCCTCAGCT ATCTCGGCAC GCTTTTGCCG
ATGCTTCTGA TTCTCGGCGT CTGGCTGTTC TTCATGCGGC AGATGCAGGG CGGCTCACGC
GGGGCGATGG GCTTCGGCAA GTCCAAGGCC AAGCTGCTCA CCGAAGCGCA TGGCCGCGTG
ACTTTCGAAG ACGTTGCCGG TGTCGACGAG GCCAAGCAGG ATCTCGAGGA AATCGTCGAA
TTCCTGCGTG ATCCGCAGAA GTTCCAGCGT CTCGGTGGCA AGATCCCGCG CGGCGTGCTG
CTCGTCGGTC CTCCGGGCAC CGGCAAGACG CTGCTCGCCC GCTCGGTCGC CGGCGAAGCC
AACGTGCCCT TCTTCACCAT TTCGGGCTCC GACTTTGTCG AAATGTTCGT CGGCGTCGGC
GCAAGCCGGG TGCGCGATAT GTTCGAGCAG GCGAAGAAGA ACGCGCCCTG CATCATCTTC
ATCGACGAAA TCGATGCCGT CGGCCGCCAT CGCGGCGCCG GTCTCGGCGG CGGCAATGAC
GAACGCGAGC AGACGCTGAA CCAGTTGCTG GTCGAAATGG ACGGCTTCGA GGCGAATGAA
GGCGTGATCC TGATCGCCGC TACCAACCGT CCCGACGTTC TCGACCCGGC GCTGCTGCGT
CCCGGCCGTT TCGACCGTCA GGTCGTGGTG CCGAACCCCG ACATCGTCGG CCGCGAACGC
ATTCTCAAGG TGCATGCCCG CAACGTTCCG CTGGCGCCGA ATGTCGATCT CAAGATCCTC
GCCCGCGGTA CGCCCGGTTT CTCCGGCGCC GACCTGATGA ACCTCGTCAA CGAAGCCGCC
CTCATGGCGG CCCGCCGCAA CAAGCGCGTC GTCACCATGC AGGAATTCGA GGACGCCAAG
GACAAGATCA TGATGGGCGC CGAGCGCCGT TCCTCGGCGA TGACCGAGGC GGAAAAGAAG
CTCACCGCTT ACCATGAGGC CGGACACGCC ATGACCGCGC TCAATGTGGC CGTCGCCGAT
CCGCTGCACA AGGCGACGAT CATTCCGCGC GGCCGTGCGC TCGGCATGGT CATGCAGCTT
CCCGAGGGCG ACCGCTACTC GATGAGCTAC AAGTGGATGG TCTCACGCCT CTGCATCATG
ATGGGCGGCC GCGTTGCCGA AGAGCTCACC TTCGGCAAGG AAAACATCAC TTCGGGTGCC
TCTTCCGACA TCGAACAGGC CACGAAGCTT GCCCGCGCCA TGGTCACGCA GTGGGGCTTT
TCCGATCAGC TCGGTCAGGT CGCCTATGGC GAGAACCAGC AGGAAGTCTT CCTCGGCCAC
TCGGTTTCGC AGTCGAAGAA TGTTTCGGAA GCAACCGCGC AGAAGATCGA CAATGAAGTG
CGCCGCCTGA TCGACGAAGC CTATACGCAG GCCCGCACGA TCCTGACGGA AAAGCACGAC
GAATTCGTCG CGCTTGCCGA AGGTCTGCTC GAATACGAGA CACTGACCGG CGAAGAGATC
AAGGCGCTGA TCCGCGGCGA AAAGCCTTCC CGCGATCTCG GTGATGATTC GCCGCCAAGC
CGCGGCTCAG CCGTTCCGAA GGCCGGCGCA CGGCCTGCCA CCAAGGGTGA CGAGCCCGAA
GGCGGCCTCG AACCGCAGCC GCATTGA
 
Protein sequence
MLGSSMNPNL RNFALWAIIA LLLIALFSMF QTAPAQTGSR EIPYSQFLRE VDAGRVKDVV 
VTGNRLSGSY VENGTTFQTY SPVIDDSLLD RLQSKNVLVS ARPETDGSSG FLSYLGTLLP
MLLILGVWLF FMRQMQGGSR GAMGFGKSKA KLLTEAHGRV TFEDVAGVDE AKQDLEEIVE
FLRDPQKFQR LGGKIPRGVL LVGPPGTGKT LLARSVAGEA NVPFFTISGS DFVEMFVGVG
ASRVRDMFEQ AKKNAPCIIF IDEIDAVGRH RGAGLGGGND EREQTLNQLL VEMDGFEANE
GVILIAATNR PDVLDPALLR PGRFDRQVVV PNPDIVGRER ILKVHARNVP LAPNVDLKIL
ARGTPGFSGA DLMNLVNEAA LMAARRNKRV VTMQEFEDAK DKIMMGAERR SSAMTEAEKK
LTAYHEAGHA MTALNVAVAD PLHKATIIPR GRALGMVMQL PEGDRYSMSY KWMVSRLCIM
MGGRVAEELT FGKENITSGA SSDIEQATKL ARAMVTQWGF SDQLGQVAYG ENQQEVFLGH
SVSQSKNVSE ATAQKIDNEV RRLIDEAYTQ ARTILTEKHD EFVALAEGLL EYETLTGEEI
KALIRGEKPS RDLGDDSPPS RGSAVPKAGA RPATKGDEPE GGLEPQPH