Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0566 |
Symbol | |
ID | 8011754 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 587844 |
End bp | 589676 |
Gene Length | 1833 bp |
Protein Length | 610 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644823156 |
Product | TPR repeat-containing protein |
Protein accession | YP_002974409 |
Protein GI | 241203313 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCAGA GACTTGCCAT CCGTCTTCTT ACGAGCGCAG CGCTTGCCGC TGTCCTTTCG CTCAGCGGTG TCGGCGGCGC GAACGCCGAG GATGCGGCCA AGACAGACGA TGCCGGCACG GCCGTTCACT TTGATGCGGA CAGCGTCACC ACTTTCTCCG GCGCCTTCCT CGCCGCGCGC ACGGCCGATG TCGATCATGA CTACGAAACC GCGATCGAAC TCTACAAGAA GGCGCTGCAG ATCGAGCCCG GCAATCCCGA GATTCGCCAG CGGCTGATGA TCTCGCTGCT GCTAAATGGC GATATCAAGG ACGGCGTCAA ATATGCCAAC GACCTGAAGG GCGATCCTTC CGTCGAGCGC ATCACCACGA TCGTGCGCGG CATGGATGCC GTGCGCCGCG ACGACTACAA GTCCGCCGAG GCCATCCTGA AATATAAGGG GCCAAACGAT CTCGACCGGA TGATGAACGA CCTGCTGCTC GCCTGGGCCC GCGTCGGCGC CGGCCGCGGT AAGGAAGCGC TCACCATGGT CGAGAAGATG AAGGGCCCGG ACTGGTTCCG CATCTTCCAG AATTACAATG CCGGCGCGAT CGCCATCGTC ACCGGCGACG TGAAATCCGC CCGCCAGCAT CTGAACGATG CCGTGCTCGA CAAGGAAGGC GGCGCGACGG CGCCCGACAC CTTCATGCGC GCCGTGATGG CGCTTGCCCG CCTCGAAGCA ACACAAGGCA ATAAGCAGAA GGCGCTCGAC GCCGTTTCCG TCGGCGACAA CCTGCTGCCG AATTATGCGC CGCTGAACGC GTTGCGCGAC AGTATCGAGA AGAACGAGAA GCAGGACCAG CAGGTCAAGA CGGCCGAAGA GGGTGCTGCC GGCGTGCTCT TTTCGGTCGG CGGCGCGCTG AACCGCGACG GCGCCGAAGA CATCGTCTCG CTGTATCTGC AGACGGCCAA TGCGCTCGAT CCGAACAGCG CCGATACGCT GGTGCTGCTT GGCGGCATTG CCGAGAAGCA GAACCAGATG GACCGCGCCA TCGCGCTCTA CAAGAAGGTG CCGGAGAATT CGCCGATGCG GCGCATCTCC GAGCTGCAGC TCGGCCTTGC GCTTGCCCAG GGCGGCAAGG TGGATGAGGC GCGCAAGCAC CTGCAGGCGC TTATCACCTC CGACCCGAAG GACATCCGCA GTTACCTCGC TTATGGCAGC GTGCTCTCCG ACGCCAAGGA CTACCAGGCG ATGGCCACGA ATTACGACAA GGCCGTCGAA ACGATCGGCC CCATTCCCGG CCGCGCCAAC TGGAGCGTCT TCTTCCAGCG CGGCATCGCC TATGAGCGGC TGAAGAAGTG GGACCAGGCG GAACCGAACT TCCGCAAGGC GCTGGAGCTC AATCCCGACC AGCCGCAGGT GCTGAATTAT CTCGGCTATT CCTGGATCGA CATGAACCGG AACCTCGATG AAGGTCTCGG CATGATCAAG AAGGCCGTCG ACCTTCGCCC TGACGACGGC TACATCATCG ATTCGCTCGG CTGGGCCTAT TTCCGCCTCA ACCGCTTCGA CGATGCCGTC GACGAGCTGG AGCGGGCCGC ACAGATCAAG GCCGGCGACG CGACGATCAA CGACCATCTC GGCGATGCCT ATTGGCGCGT CGGGCGCAAG CTCGAGGCCG TCTACCAGTG GAACCGGGCG CTCGCCTCCG AGCCCGAAGC TGCCGAGATC CCGAAGATCA AGGACAAGGT CGCCAACGGC CTGCCTCCCG CCAGTGACGA TGCCAAGGCG GCCGACAAGA AGCAGCCGGA TCCGGCGCCC GTCACCCCGC CGCCGGTCGA CAAGAAATCC TGA
|
Protein sequence | MRQRLAIRLL TSAALAAVLS LSGVGGANAE DAAKTDDAGT AVHFDADSVT TFSGAFLAAR TADVDHDYET AIELYKKALQ IEPGNPEIRQ RLMISLLLNG DIKDGVKYAN DLKGDPSVER ITTIVRGMDA VRRDDYKSAE AILKYKGPND LDRMMNDLLL AWARVGAGRG KEALTMVEKM KGPDWFRIFQ NYNAGAIAIV TGDVKSARQH LNDAVLDKEG GATAPDTFMR AVMALARLEA TQGNKQKALD AVSVGDNLLP NYAPLNALRD SIEKNEKQDQ QVKTAEEGAA GVLFSVGGAL NRDGAEDIVS LYLQTANALD PNSADTLVLL GGIAEKQNQM DRAIALYKKV PENSPMRRIS ELQLGLALAQ GGKVDEARKH LQALITSDPK DIRSYLAYGS VLSDAKDYQA MATNYDKAVE TIGPIPGRAN WSVFFQRGIA YERLKKWDQA EPNFRKALEL NPDQPQVLNY LGYSWIDMNR NLDEGLGMIK KAVDLRPDDG YIIDSLGWAY FRLNRFDDAV DELERAAQIK AGDATINDHL GDAYWRVGRK LEAVYQWNRA LASEPEAAEI PKIKDKVANG LPPASDDAKA ADKKQPDPAP VTPPPVDKKS
|
| |