Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4019 |
Symbol | |
ID | 8014825 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4095813 |
End bp | 4096934 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644826588 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_002977799 |
Protein GI | 241206703 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.513173 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTCCCT TTCTGCGCAT CCTTGGCATC GAAACGAGCT GCGACGAGAC CGCCGCGGCG GTCGTCGAGC GCGATGCTGA GGGGCATTCC AACGTGCTGT CGGACGTGGT GCTTTCCCAA CTCGACGAAC ATAGCGCCTA TGGCGGCGTG GTGCCCGAGA TCGCCGCACG CGCCCATGTC GAAGCGCTGG ACGAGCTGAT CGAGGAGGCG CTGAACCGCG CCAATGTGTC GCTCGATGAT GTCGACGCCA TAGCCGCCAC GTCGGGGCCG GGGCTGATCG GCGGGCTGCT GGTGGGGTTG ATGACCGGCA AGGCGATCGC CAGAGCTGCC GGCAAGCCGC TCTATGCGAT CAACCATCTC GAAGGCCATG CGCTGACGGC GCGGCTGACG GACGGGCTTT CCTTTCCCTA TCTGATGCTG CTCGTCTCCG GCGGCCATAC CCAGCTCATC CTGGTGCGCG GTGTCGGGCA ATACGAACGC TGGGGCACGA CGATCGACGA TGCGCTGGGC GAAGCCTTCG ACAAGACGGC AAAGCTGCTC GGCCTGCCCT ATCCCGGCGG CCCGGCGGTG GAGAGGATGG CGCGGGACGG CAATCCCGAT CGCTTCGATT TTCCGCGGCC GCTGGTCGGC GAGGCAAGGC TCGACTTCTC CTTCTCCGGC CTGAAGACGG CAGTGCGGCA GGCGGCGCAG GATATCGCGC CGCTCAGCGA TCAGGACGTG GCGGATATCT GCGCCTCGTT CCAGAAGGCG GTTTCGCGGA CGCTGAAGGA CCGTATCGGC CGTGGCCTGC AGCGGTTCAA GACGGAAATT CCCGCGACAT TCCCTGCCAC TGGCCCAGCG ACTGGCGAAA AGCCGGCGCT CGTCGTTGCC GGCGGCGTCG CCGCCAATCT CGAACTGCGC GGCACACTGC AGGCACTGTG CGACAAGAAC GGCTTCCGCT TCGTCGCGCC ACCGCTGCAC CTCTGCACCG ACAACGCCGT GATGATCGCC TGGGCAGGAC TGGAACGGAT GGCGACCGGT GCCGCACCGG ATACGCTCGA CGTGCAGCCG CGTTCGCGAT GGCCGCTCGA TTCCAATGCG GAAACGCTGA TCGGCTTTGG AAAACGAGGG GCCAAGGCAT GA
|
Protein sequence | MVPFLRILGI ETSCDETAAA VVERDAEGHS NVLSDVVLSQ LDEHSAYGGV VPEIAARAHV EALDELIEEA LNRANVSLDD VDAIAATSGP GLIGGLLVGL MTGKAIARAA GKPLYAINHL EGHALTARLT DGLSFPYLML LVSGGHTQLI LVRGVGQYER WGTTIDDALG EAFDKTAKLL GLPYPGGPAV ERMARDGNPD RFDFPRPLVG EARLDFSFSG LKTAVRQAAQ DIAPLSDQDV ADICASFQKA VSRTLKDRIG RGLQRFKTEI PATFPATGPA TGEKPALVVA GGVAANLELR GTLQALCDKN GFRFVAPPLH LCTDNAVMIA WAGLERMATG AAPDTLDVQP RSRWPLDSNA ETLIGFGKRG AKA
|
| |