Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3196 |
Symbol | |
ID | 6981948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 3284154 |
End bp | 3286085 |
Gene Length | 1932 bp |
Protein Length | 643 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643397913 |
Product | ATP-dependent metalloprotease FtsH |
Protein accession | YP_002282689 |
Protein GI | 209550772 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.766379 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCCTA ACTTACGTAA TTTCGCCTTG TGGGCGATCA TAGCGCTTCT GCTGATCGCC CTTTTCAGTA TGTTCCAGAC GGCGCCGGCG CAGACGGGCT CCCGCGAAAT CCCTTATTCG CAGTTCCTGC GTGAGGTCGA TGCGGGCCGC GTGAAGGACG TCGTCGTCAC CGGCAATCGG CTGAGCGGAA CGTATCTGGA GAATAACAAT AATTTCCAGA CCTATTCGCC CGTTATCGAC GACAACTTGC TCGATCGCCT GCAGGCCAAA AACGTTGCTG TCACTGCGCG TCCGGAGACC GATGGTTCCT CCGGCTTTCT GAGCTACCTC GGAACGCTGC TGCCGATGCT GCTCATCCTC GGCGTCTGGC TGTTCTTCAT GCGGCAGATG CAGGGCGGCT CGCGTGGCGC GATGGGCTTC GGCAAGTCGA AGGCCAAGCT GCTGACAGAA GCACACGGCC GTGTCACCTT CGAAGACGTC GCCGGTGTCG ACGAGGCCAA GCAGGATCTC GAAGAAATCG TCGAATTCCT GCGCGATCCG CAGAAGTTCC AGCGTCTCGG CGGCAAGATT CCGCGCGGCG TGTTGCTCGT CGGCCCTCCG GGTACCGGCA AGACGCTGCT CGCCCGCTCG GTTGCCGGCG AAGCCAACGT GCCCTTCTTC ACCATTTCCG GTTCCGACTT CGTCGAAATG TTCGTCGGCG TCGGCGCTAG CCGCGTGCGC GATATGTTCG AGCAGGCGAA GAAGAATGCG CCCTGCATCA TCTTCATCGA CGAAATCGAT GCCGTCGGCC GCCATCGCGG CGCCGGTCTC GGCGGCGGCA ATGACGAACG CGAACAGACG CTGAACCAGC TGCTGGTCGA GATGGATGGC TTCGAGGCGA ATGAAGGCGT GATCCTGATT GCCGCCACCA ACCGCCCCGA CGTGCTCGAC CCGGCGCTGC TGCGTCCCGG CCGTTTCGAC CGCCAGGTCG TGGTTCCGAA CCCGGATATC GTCGGCCGCG AGCGCATCCT CAAGGTGCAT GCCCGCAACG TTCCGCTGGC GCCGAATGTC GATCTCAAGG TTCTCGCCCG CGGCACGCCC GGCTTCTCCG GCGCCGATCT GATGAACCTC GTCAACGAAG CCGCCCTCAT GGCCGCCCGC CGCAACAAGC GTGTCGTCAC CATGCAGGAA TTCGAAGACG CCAAGGACAA GATCATGATG GGCGCCGAGC GCCGATCCTC AGCCATGACC GAGGCGGAGA AGAAGCTCAC CGCTTACCAT GAGGCCGGTC ACGCCATCAC CGCGCTTAAT GTCGCCGTCG CCGATCCGCT GCACAAGGCG ACGATCATTC CACGCGGTCG TGCGCTTGGC ATGGTCATGC AGCTTCCCGA GGGCGACCGC TACTCGATGA GCTACAAGTG GATGGTGTCG CGCCTCTGCA TCATGATGGG CGGTCGTGTC GCCGAAGAAC TGACCTTCGG CAAGGAGAAC ATCACCTCGG GTGCCTCCTC CGACATCGAG CAGGCCACCA AGCTTGCCCG CGCCATGGTC ACGCAATGGG GCTTCTCCGA CCAGCTCGGT CAGGTCGCCT ATGGCGAGAA CCAGCAGGAG GTCTTCCTCG GTCACTCGGT TTCGCAGTCG AAGAATGTTT CGGAAGCGAC CGCGCAGAAG ATCGACAATG AAGTGCGCCG CCTGATCGAC GAGGCCTATA CGCAGGCCCG CACGATCCTG ACGGATAAGC ACGACGAGTT CGTCGCTCTT GCCGAAGGTC TGCTCGAATA CGAGACGCTG ACCGGCGAAG AGATCAAGGC GCTGATCCGC GGCGAGAAGC CGTCGCGCGA TCTCGGCGAC GATTCACCGC CAAGCCGCGG CTCGGCGGTT CCGAAAGCCG GCGCACGGCC TGCCGCCAAG GGTGATGAGC CCGAAGCCGG CCTTGAACCG CAGCCGCATT GA
|
Protein sequence | MNPNLRNFAL WAIIALLLIA LFSMFQTAPA QTGSREIPYS QFLREVDAGR VKDVVVTGNR LSGTYLENNN NFQTYSPVID DNLLDRLQAK NVAVTARPET DGSSGFLSYL GTLLPMLLIL GVWLFFMRQM QGGSRGAMGF GKSKAKLLTE AHGRVTFEDV AGVDEAKQDL EEIVEFLRDP QKFQRLGGKI PRGVLLVGPP GTGKTLLARS VAGEANVPFF TISGSDFVEM FVGVGASRVR DMFEQAKKNA PCIIFIDEID AVGRHRGAGL GGGNDEREQT LNQLLVEMDG FEANEGVILI AATNRPDVLD PALLRPGRFD RQVVVPNPDI VGRERILKVH ARNVPLAPNV DLKVLARGTP GFSGADLMNL VNEAALMAAR RNKRVVTMQE FEDAKDKIMM GAERRSSAMT EAEKKLTAYH EAGHAITALN VAVADPLHKA TIIPRGRALG MVMQLPEGDR YSMSYKWMVS RLCIMMGGRV AEELTFGKEN ITSGASSDIE QATKLARAMV TQWGFSDQLG QVAYGENQQE VFLGHSVSQS KNVSEATAQK IDNEVRRLID EAYTQARTIL TDKHDEFVAL AEGLLEYETL TGEEIKALIR GEKPSRDLGD DSPPSRGSAV PKAGARPAAK GDEPEAGLEP QPH
|
| |