Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0194 |
Symbol | |
ID | 8011424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 194836 |
End bp | 196695 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644822787 |
Product | AAA ATPase central domain protein |
Protein accession | YP_002974044 |
Protein GI | 241202948 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.223838 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATCATC ACCTCGATAG GCGGGTCTTT GCGGAAACCA GCAGCAACAC GGCAAAGTTC TTGATGCTTT GCGCTCTTCG CCAGGCCCTT CGCGGAACGG ATCAGTTCAA GGTCGATGCG AAAGGCATCA TCGTCTGCAT CGTCGACAAG GCTTGGCTCT GGTATGCGAA GTCCGCGGCG AATGTCCTTA TGACGGGCGG CAAAGTGCAT TCGTACTACG ACGACTTCCG TCGCCCAGTG CATATCATCG GTGAGTCGCC CCGGCGTACG ACGTCGAAGG GCGTCGAAGA TTCCACAGTC TTTCGTCCGG AGCACCAGGT CATCTATCTT GCGTCGTCTT TAGACGCCGT GAGGCTTTCG CTGCAGTTAG CTGCTGACGT GATTGTCAAC ATCTTGCCGC CCACGGCGAG ACACGTGATC GCGGCGCGTA AGGTGCTCGG CGTCGAAGAC GTTGACATGC CGCTTGCAGA AGCGATCGCG CAGCAGTCGG CGGAGATCGT CATCGGGCTT ACCGCAAGGA ACTCGCTCAA GGGCCTCGAC ATAGCGACGT TGACGAAGCC GATCGCGGTA CCTGAGCGTT CGCACAAGCT GTCGGAACTG CCTGGATACG GTCCAGCACG TCCTTGGGTC GATGCCATCA AGCAGGACGT CGCCGATTGG CGGGAGGGTA AACTTCCCTG GACGGATGTG GACAGAGGAA TCCTTCTCCT GGGCGCGCCC GGAACCGGCA AGACGCTTTT CGCCACGGCA CTCGCCAACG AGCTCGGATT CGATCTGGTC TTGACGTCGG TGGGCGCATG GCAGGGATCC AATAACGGCT ACCTTGGCGA CATGCTTGCC GCGATGTCCA AGTCCTTCGC CGACGCCACG GCGCGGCGCG GAGCTGTGTT GCTCGTGGAC GAACTCGATG CCATCGGAGA TCGCGCCACG ATGCGGGGAG ATCATGCCTT CTACGAAGGC AACGTCATCG GCAGATTTCT TGAACTGACG ACGCATGCGC TCGAACAGCC CGGCACAATC ATCGTCGGAG CTACAAACTA CGGACACCTC ATCGACAACG CTGTCCTGCG ATCCGGACGC CTCGAGAAAC ACGTGTACCT TGAGTTGCCC GAGGACGAGG AGCGCGCGGA AATCCTCGCC TATCATTTTA ATCAGGCCCT ACCTGCGAAG GACCTGCGTG AAATCACGGA CAAGCTAAGG CTCGTCACAC CCGCCGACCT GGAAAAGCTT GCGCGAGCGG CAAAACGAGC GGCGAGGATT CGAAAGGGGC TTCTTAGTAT CCAGGACGTC AAAGCCATTC TTCCAGCGCA GGTTCCGCTT CCCGAAGCCG TCGTTCACCG CATCTGTGTG CATGAGATCG GTCACGCACT TATGGCGATG GCATCGGGGT CAGCAGATGT GATCAGCATC AGAGTCGAAT CCCATATGGT GGAGGGCCAG TTCGTGCAGG ACGGCGGTCG GCTGCATTAC AAAATACACA ATGAAGCGCT TCCGTCGGAC AAGGATTTGC TCGCCAAGAT CAGGATCATG CTGGGCGGGA CCGCGGCTGA GGAAGTTGTG TTCGGCAACA GATCCATAGG CGCTGGCGGC GTCGAAGGGA GCGATCTGGA CCAGGCTACC CGGCTCGCTT ACCGGCTGGT TGGCAGCTAT GGCCTGGGGA AATGGCTTCG TTACCAGATG GGTGCAAATC GCGTGGACGA AACCTTTGTA CCGGCGCCAG AGCTTCGAGC CGAAGTCGAT GGGATCCTTG CGCGGGAATA TCGGGCGACG AAGGAGTTGC TCAGCAAGGA AAAGGCTCAT CTCATGCGGC TCGCCGCCGA ACTCGTTGTC GATCGAAAAT TGCTGATCGA CAAAAAATGA
|
Protein sequence | MYHHLDRRVF AETSSNTAKF LMLCALRQAL RGTDQFKVDA KGIIVCIVDK AWLWYAKSAA NVLMTGGKVH SYYDDFRRPV HIIGESPRRT TSKGVEDSTV FRPEHQVIYL ASSLDAVRLS LQLAADVIVN ILPPTARHVI AARKVLGVED VDMPLAEAIA QQSAEIVIGL TARNSLKGLD IATLTKPIAV PERSHKLSEL PGYGPARPWV DAIKQDVADW REGKLPWTDV DRGILLLGAP GTGKTLFATA LANELGFDLV LTSVGAWQGS NNGYLGDMLA AMSKSFADAT ARRGAVLLVD ELDAIGDRAT MRGDHAFYEG NVIGRFLELT THALEQPGTI IVGATNYGHL IDNAVLRSGR LEKHVYLELP EDEERAEILA YHFNQALPAK DLREITDKLR LVTPADLEKL ARAAKRAARI RKGLLSIQDV KAILPAQVPL PEAVVHRICV HEIGHALMAM ASGSADVISI RVESHMVEGQ FVQDGGRLHY KIHNEALPSD KDLLAKIRIM LGGTAAEEVV FGNRSIGAGG VEGSDLDQAT RLAYRLVGSY GLGKWLRYQM GANRVDETFV PAPELRAEVD GILAREYRAT KELLSKEKAH LMRLAAELVV DRKLLIDKK
|
| |