Gene Rleg2_3698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3698 
Symbol 
ID6982460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3825767 
End bp3826864 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content66% 
IMG OID643398420 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_002283187 
Protein GI209551270 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCCCT TTCTGCGTAT CCTTGGCATC GAAACGAGCT GCGACGAGAC CGCGGCCGCC 
GTCGTCGAGC GCGATGCGGA GGGAAATGCC AGAGTGCTCT CCGATGTGGT GCTGTCCCAG
CTCGACGAGC ATAGCGCCTA TGGCGGCGTG GTGCCGGAGA TCGCGGCACG CGCCCATGTC
GAGGCGCTGG ACGAGCTGAT CGAGGAGGCG CTGAACCGCG CCAATGTGTC GCTTGATGAG
GTCGACGCCA TCGCCGCGAC GTCCGGGCCG GGGTTGATCG GCGGCCTGCT GGTGGGGTTG
ATGACCGGCA AGGCGATCGC GAGGGCCGCC GGCAAACCGC TCTATGCGGT CAACCATCTC
GAAGGCCATG CGCTGACGGC GCGGCTGACC GACGGGCTTG CCTTTCCCTA TCTGATGCTG
CTCGTTTCCG GCGGCCATAC CCAGCTGATC CTGGTGCGCG GCGTCGGCGA GTATCAGCGC
TGGGGCACGA CGATCGACGA TGCGCTCGGC GAGGCTTTCG ACAAGACGGC AAAGCTGCTC
GGTCTGCCCT ATCCCGGCGG CCCGGCGGTG GAACGGATGG CGCGGGACGG CAATCCCGAC
CGCTTCGCGT TTCCGCGGCC GCTGGTCGGC GAGGCGCGGC TCGATTTCTC CTTCTCCGGG
CTGAAGACGG CGGTGCGGCA GGCGGCACAG GATATCGCGC CGATCAGCGA TCAGGACGTG
GCCGATATCT GCGCCTCGTT CCAGAAAGCG ATTTCGCGAA CGCTGAAGGA TCGCATCGGC
CGCGGCCTGC AGCGGTTCAA AACGGAATTT GCCGCGACCG ATGAGAAGCC GGCGCTCGTC
GTTGCCGGCG GTGTCGCCGC CAATCTCGAA CTGCGCGGCA CGCTGCAGGC GCTCTGCGAC
AAAAACGGCT TTCGCTTCAT TGCGCCGCCG CTGCACCTCT GCACCGACAA TGCCGTGATG
ATCGCCTGGG CGGGACTGGA GCGCATGGCG ACGGGCGCTG CACCGGATCC GCTCGACGTC
CAGCCGCGTT CGCGCTGGCC GCTCGATTCC AATGCGGAAA CGCTGATCGG TTTCGGCAAG
AGAGGAGCCA AGGCATGA
 
Protein sequence
MVPFLRILGI ETSCDETAAA VVERDAEGNA RVLSDVVLSQ LDEHSAYGGV VPEIAARAHV 
EALDELIEEA LNRANVSLDE VDAIAATSGP GLIGGLLVGL MTGKAIARAA GKPLYAVNHL
EGHALTARLT DGLAFPYLML LVSGGHTQLI LVRGVGEYQR WGTTIDDALG EAFDKTAKLL
GLPYPGGPAV ERMARDGNPD RFAFPRPLVG EARLDFSFSG LKTAVRQAAQ DIAPISDQDV
ADICASFQKA ISRTLKDRIG RGLQRFKTEF AATDEKPALV VAGGVAANLE LRGTLQALCD
KNGFRFIAPP LHLCTDNAVM IAWAGLERMA TGAAPDPLDV QPRSRWPLDS NAETLIGFGK
RGAKA