Gene Rleg_4019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4019 
Symbol 
ID8014825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4095813 
End bp4096934 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content66% 
IMG OID644826588 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_002977799 
Protein GI241206703 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.513173 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCCCT TTCTGCGCAT CCTTGGCATC GAAACGAGCT GCGACGAGAC CGCCGCGGCG 
GTCGTCGAGC GCGATGCTGA GGGGCATTCC AACGTGCTGT CGGACGTGGT GCTTTCCCAA
CTCGACGAAC ATAGCGCCTA TGGCGGCGTG GTGCCCGAGA TCGCCGCACG CGCCCATGTC
GAAGCGCTGG ACGAGCTGAT CGAGGAGGCG CTGAACCGCG CCAATGTGTC GCTCGATGAT
GTCGACGCCA TAGCCGCCAC GTCGGGGCCG GGGCTGATCG GCGGGCTGCT GGTGGGGTTG
ATGACCGGCA AGGCGATCGC CAGAGCTGCC GGCAAGCCGC TCTATGCGAT CAACCATCTC
GAAGGCCATG CGCTGACGGC GCGGCTGACG GACGGGCTTT CCTTTCCCTA TCTGATGCTG
CTCGTCTCCG GCGGCCATAC CCAGCTCATC CTGGTGCGCG GTGTCGGGCA ATACGAACGC
TGGGGCACGA CGATCGACGA TGCGCTGGGC GAAGCCTTCG ACAAGACGGC AAAGCTGCTC
GGCCTGCCCT ATCCCGGCGG CCCGGCGGTG GAGAGGATGG CGCGGGACGG CAATCCCGAT
CGCTTCGATT TTCCGCGGCC GCTGGTCGGC GAGGCAAGGC TCGACTTCTC CTTCTCCGGC
CTGAAGACGG CAGTGCGGCA GGCGGCGCAG GATATCGCGC CGCTCAGCGA TCAGGACGTG
GCGGATATCT GCGCCTCGTT CCAGAAGGCG GTTTCGCGGA CGCTGAAGGA CCGTATCGGC
CGTGGCCTGC AGCGGTTCAA GACGGAAATT CCCGCGACAT TCCCTGCCAC TGGCCCAGCG
ACTGGCGAAA AGCCGGCGCT CGTCGTTGCC GGCGGCGTCG CCGCCAATCT CGAACTGCGC
GGCACACTGC AGGCACTGTG CGACAAGAAC GGCTTCCGCT TCGTCGCGCC ACCGCTGCAC
CTCTGCACCG ACAACGCCGT GATGATCGCC TGGGCAGGAC TGGAACGGAT GGCGACCGGT
GCCGCACCGG ATACGCTCGA CGTGCAGCCG CGTTCGCGAT GGCCGCTCGA TTCCAATGCG
GAAACGCTGA TCGGCTTTGG AAAACGAGGG GCCAAGGCAT GA
 
Protein sequence
MVPFLRILGI ETSCDETAAA VVERDAEGHS NVLSDVVLSQ LDEHSAYGGV VPEIAARAHV 
EALDELIEEA LNRANVSLDD VDAIAATSGP GLIGGLLVGL MTGKAIARAA GKPLYAINHL
EGHALTARLT DGLSFPYLML LVSGGHTQLI LVRGVGQYER WGTTIDDALG EAFDKTAKLL
GLPYPGGPAV ERMARDGNPD RFDFPRPLVG EARLDFSFSG LKTAVRQAAQ DIAPLSDQDV
ADICASFQKA VSRTLKDRIG RGLQRFKTEI PATFPATGPA TGEKPALVVA GGVAANLELR
GTLQALCDKN GFRFVAPPLH LCTDNAVMIA WAGLERMATG AAPDTLDVQP RSRWPLDSNA
ETLIGFGKRG AKA