Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_0522 |
Symbol | |
ID | 6979238 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 535715 |
End bp | 537547 |
Gene Length | 1833 bp |
Protein Length | 610 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643395234 |
Product | Tetratricopeptide TPR_2 repeat protein |
Protein accession | YP_002280045 |
Protein GI | 209548128 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.133163 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.433558 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCAGA GAATTGCCAT CCGTCTTCTT ACAAGCGCAG CGCTTGCCGC TGTCCTTTCG CTGGGCGGTG TCGGCGGCGC GAATGCCGAG GATGCGGCCA AGCCGAGCGA TGTGGCGAAG ACCGATAGCT TCGATGCCGA TAGCGTTACC ACCTTCTCCG GCGCCTTCCT TGCGGCGCGC ACGGCCGATG TCGATCATGA CTACGAGACG GCGATCGAAC TCTACAAGAA GGCGCTGCAG ATCGAGCCCG GCAATCCCGA GATCCGCCAG CGGCTGATGA TCTCGCTGCT GCTCAATGGC GACATCAAGG ACGGCGTCAA ATATGCCAAC GACCTGAAGG GCGATCCCTC TGTCGAGCGC ATTACCACGA TCGTGCGCGG CATGGATGCC GTGCGCCGCG ACGATTACAA GACCGCCGAG AGCATTCTCA AATATAACGG GCCGAACGAT CTCGACCGGA TGATGAACGA CCTGCTGCTC GCCTGGGCCC GCGTCGGCGC CGGCCGCGGC AAGGAAGCGC TCGCCATGGT CGAGAAGATG AAGGGGCCGG ACTGGGTCCG CATCTTCCAG AATTATAATG CTGGCGCGAT CGCCATCGCC ACCGGTGACG TAAAATCCGC CCGGAAGCAT CTGAACGACG CCGTGCTCGA CAAGGAGGGA GGTGCGACCG CACCCGACAC CTTCATGCGC GCGGTGATGG CGCTTGCCCG TCTAGAAGCG ACACAAGGCA ATAAGCAGAA GGCGCTCGAC GCCGTTTCCG TCGGCGACAA CCTGCTGCCG AACTACGCGC CGCTGAACGC GTTGCGCGAC AGTATCGAAA AAGACGAGAA GCAAGAGCAG CAGGTCAAGA CGGCCGAAGA AGGCGCTGCC GGCGTGCTGT TTTCGGTCGG CGGCGCGCTG AACCGCGACG GCGCCGAGGA CATCGTCTCG CTTTACCTGC AGACCGCCAA TGCGCTCGAC CCGAACAGCG CCGATACGCT GGTGCTGCTC GGCGGCATCG CCGAGAAGCA GAACCAGATG GACCGCGCCA TTGCGCTCTA CAAGAAGGTG CCGGAGAATT CGCCGATGCG GCGCATCTCC GAGCTGCAGC TCGGCCTTGC CCTTGCCCAG GGCGGCAAGG TGGACGAGGC GCGCAAGCAC CTGCAGGCGC TGATCGCTTC CGACCCGAAG GACATCCGCA GCTATCTCGC CTATGGCAGC GTGCTCTCCG ACGCCAAGGA CTACGAGGCG ATGGCGGCCA ATTACGACAA GGCCGTCGAC GCGATTGGCC CGATTCCCGG CCGCGCCAAC TGGAGCGTCT TCTTCCAGCG CGGCATCGCT TATGAGCGGC TGAAGAAGTG GGACCAGGCG GAGCCGAATT TCCGCAAGGC CCTCGAACTC AATCCCGACC AGCCGCAGGT GCTGAACTAT CTCGGCTATT CCTGGATCGA CATGAACCGT AACCTCGATG AAGGTCTCGG CATGATCAAG AAGGCCGTCG ACCTTCGCCC CGACGACGGC TACATCATCG ATTCGCTCGG CTGGGCCTAT TTCCGCCTCA ACCGTTTCGA CGATGCCGTC GACGAATTGG AGCGGGCAGC CCAGATCAAG GCCGGCGACG CGACGATCAA CGACCATCTG GGTGACGCCT ACTGGCGCGT CGGCCGCAAG CTCGAGGCCG TCTATCAGTG GAACCGGGCG CTCGCCTCCG AGCCGGAAGC CGCCGAGATC CCGAAGATCA AGGACAAGGT CGCCAATGGC CTGCCCGCCG TCAGCGACGA TGCCAAGGCG GCCGACAAGA AGCAGCCGGA TCCGGCCCCG GTCACGCCGC CGCCGGTCGA CAAGAAATCC TGA
|
Protein sequence | MRQRIAIRLL TSAALAAVLS LGGVGGANAE DAAKPSDVAK TDSFDADSVT TFSGAFLAAR TADVDHDYET AIELYKKALQ IEPGNPEIRQ RLMISLLLNG DIKDGVKYAN DLKGDPSVER ITTIVRGMDA VRRDDYKTAE SILKYNGPND LDRMMNDLLL AWARVGAGRG KEALAMVEKM KGPDWVRIFQ NYNAGAIAIA TGDVKSARKH LNDAVLDKEG GATAPDTFMR AVMALARLEA TQGNKQKALD AVSVGDNLLP NYAPLNALRD SIEKDEKQEQ QVKTAEEGAA GVLFSVGGAL NRDGAEDIVS LYLQTANALD PNSADTLVLL GGIAEKQNQM DRAIALYKKV PENSPMRRIS ELQLGLALAQ GGKVDEARKH LQALIASDPK DIRSYLAYGS VLSDAKDYEA MAANYDKAVD AIGPIPGRAN WSVFFQRGIA YERLKKWDQA EPNFRKALEL NPDQPQVLNY LGYSWIDMNR NLDEGLGMIK KAVDLRPDDG YIIDSLGWAY FRLNRFDDAV DELERAAQIK AGDATINDHL GDAYWRVGRK LEAVYQWNRA LASEPEAAEI PKIKDKVANG LPAVSDDAKA ADKKQPDPAP VTPPPVDKKS
|
| |