Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3698 |
Symbol | |
ID | 6982460 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 3825767 |
End bp | 3826864 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643398420 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_002283187 |
Protein GI | 209551270 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTCCCT TTCTGCGTAT CCTTGGCATC GAAACGAGCT GCGACGAGAC CGCGGCCGCC GTCGTCGAGC GCGATGCGGA GGGAAATGCC AGAGTGCTCT CCGATGTGGT GCTGTCCCAG CTCGACGAGC ATAGCGCCTA TGGCGGCGTG GTGCCGGAGA TCGCGGCACG CGCCCATGTC GAGGCGCTGG ACGAGCTGAT CGAGGAGGCG CTGAACCGCG CCAATGTGTC GCTTGATGAG GTCGACGCCA TCGCCGCGAC GTCCGGGCCG GGGTTGATCG GCGGCCTGCT GGTGGGGTTG ATGACCGGCA AGGCGATCGC GAGGGCCGCC GGCAAACCGC TCTATGCGGT CAACCATCTC GAAGGCCATG CGCTGACGGC GCGGCTGACC GACGGGCTTG CCTTTCCCTA TCTGATGCTG CTCGTTTCCG GCGGCCATAC CCAGCTGATC CTGGTGCGCG GCGTCGGCGA GTATCAGCGC TGGGGCACGA CGATCGACGA TGCGCTCGGC GAGGCTTTCG ACAAGACGGC AAAGCTGCTC GGTCTGCCCT ATCCCGGCGG CCCGGCGGTG GAACGGATGG CGCGGGACGG CAATCCCGAC CGCTTCGCGT TTCCGCGGCC GCTGGTCGGC GAGGCGCGGC TCGATTTCTC CTTCTCCGGG CTGAAGACGG CGGTGCGGCA GGCGGCACAG GATATCGCGC CGATCAGCGA TCAGGACGTG GCCGATATCT GCGCCTCGTT CCAGAAAGCG ATTTCGCGAA CGCTGAAGGA TCGCATCGGC CGCGGCCTGC AGCGGTTCAA AACGGAATTT GCCGCGACCG ATGAGAAGCC GGCGCTCGTC GTTGCCGGCG GTGTCGCCGC CAATCTCGAA CTGCGCGGCA CGCTGCAGGC GCTCTGCGAC AAAAACGGCT TTCGCTTCAT TGCGCCGCCG CTGCACCTCT GCACCGACAA TGCCGTGATG ATCGCCTGGG CGGGACTGGA GCGCATGGCG ACGGGCGCTG CACCGGATCC GCTCGACGTC CAGCCGCGTT CGCGCTGGCC GCTCGATTCC AATGCGGAAA CGCTGATCGG TTTCGGCAAG AGAGGAGCCA AGGCATGA
|
Protein sequence | MVPFLRILGI ETSCDETAAA VVERDAEGNA RVLSDVVLSQ LDEHSAYGGV VPEIAARAHV EALDELIEEA LNRANVSLDE VDAIAATSGP GLIGGLLVGL MTGKAIARAA GKPLYAVNHL EGHALTARLT DGLAFPYLML LVSGGHTQLI LVRGVGEYQR WGTTIDDALG EAFDKTAKLL GLPYPGGPAV ERMARDGNPD RFAFPRPLVG EARLDFSFSG LKTAVRQAAQ DIAPISDQDV ADICASFQKA ISRTLKDRIG RGLQRFKTEF AATDEKPALV VAGGVAANLE LRGTLQALCD KNGFRFIAPP LHLCTDNAVM IAWAGLERMA TGAAPDPLDV QPRSRWPLDS NAETLIGFGK RGAKA
|
| |