Gene Rleg2_0522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0522 
Symbol 
ID6979238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp535715 
End bp537547 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content63% 
IMG OID643395234 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_002280045 
Protein GI209548128 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.133163 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.433558 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCAGA GAATTGCCAT CCGTCTTCTT ACAAGCGCAG CGCTTGCCGC TGTCCTTTCG 
CTGGGCGGTG TCGGCGGCGC GAATGCCGAG GATGCGGCCA AGCCGAGCGA TGTGGCGAAG
ACCGATAGCT TCGATGCCGA TAGCGTTACC ACCTTCTCCG GCGCCTTCCT TGCGGCGCGC
ACGGCCGATG TCGATCATGA CTACGAGACG GCGATCGAAC TCTACAAGAA GGCGCTGCAG
ATCGAGCCCG GCAATCCCGA GATCCGCCAG CGGCTGATGA TCTCGCTGCT GCTCAATGGC
GACATCAAGG ACGGCGTCAA ATATGCCAAC GACCTGAAGG GCGATCCCTC TGTCGAGCGC
ATTACCACGA TCGTGCGCGG CATGGATGCC GTGCGCCGCG ACGATTACAA GACCGCCGAG
AGCATTCTCA AATATAACGG GCCGAACGAT CTCGACCGGA TGATGAACGA CCTGCTGCTC
GCCTGGGCCC GCGTCGGCGC CGGCCGCGGC AAGGAAGCGC TCGCCATGGT CGAGAAGATG
AAGGGGCCGG ACTGGGTCCG CATCTTCCAG AATTATAATG CTGGCGCGAT CGCCATCGCC
ACCGGTGACG TAAAATCCGC CCGGAAGCAT CTGAACGACG CCGTGCTCGA CAAGGAGGGA
GGTGCGACCG CACCCGACAC CTTCATGCGC GCGGTGATGG CGCTTGCCCG TCTAGAAGCG
ACACAAGGCA ATAAGCAGAA GGCGCTCGAC GCCGTTTCCG TCGGCGACAA CCTGCTGCCG
AACTACGCGC CGCTGAACGC GTTGCGCGAC AGTATCGAAA AAGACGAGAA GCAAGAGCAG
CAGGTCAAGA CGGCCGAAGA AGGCGCTGCC GGCGTGCTGT TTTCGGTCGG CGGCGCGCTG
AACCGCGACG GCGCCGAGGA CATCGTCTCG CTTTACCTGC AGACCGCCAA TGCGCTCGAC
CCGAACAGCG CCGATACGCT GGTGCTGCTC GGCGGCATCG CCGAGAAGCA GAACCAGATG
GACCGCGCCA TTGCGCTCTA CAAGAAGGTG CCGGAGAATT CGCCGATGCG GCGCATCTCC
GAGCTGCAGC TCGGCCTTGC CCTTGCCCAG GGCGGCAAGG TGGACGAGGC GCGCAAGCAC
CTGCAGGCGC TGATCGCTTC CGACCCGAAG GACATCCGCA GCTATCTCGC CTATGGCAGC
GTGCTCTCCG ACGCCAAGGA CTACGAGGCG ATGGCGGCCA ATTACGACAA GGCCGTCGAC
GCGATTGGCC CGATTCCCGG CCGCGCCAAC TGGAGCGTCT TCTTCCAGCG CGGCATCGCT
TATGAGCGGC TGAAGAAGTG GGACCAGGCG GAGCCGAATT TCCGCAAGGC CCTCGAACTC
AATCCCGACC AGCCGCAGGT GCTGAACTAT CTCGGCTATT CCTGGATCGA CATGAACCGT
AACCTCGATG AAGGTCTCGG CATGATCAAG AAGGCCGTCG ACCTTCGCCC CGACGACGGC
TACATCATCG ATTCGCTCGG CTGGGCCTAT TTCCGCCTCA ACCGTTTCGA CGATGCCGTC
GACGAATTGG AGCGGGCAGC CCAGATCAAG GCCGGCGACG CGACGATCAA CGACCATCTG
GGTGACGCCT ACTGGCGCGT CGGCCGCAAG CTCGAGGCCG TCTATCAGTG GAACCGGGCG
CTCGCCTCCG AGCCGGAAGC CGCCGAGATC CCGAAGATCA AGGACAAGGT CGCCAATGGC
CTGCCCGCCG TCAGCGACGA TGCCAAGGCG GCCGACAAGA AGCAGCCGGA TCCGGCCCCG
GTCACGCCGC CGCCGGTCGA CAAGAAATCC TGA
 
Protein sequence
MRQRIAIRLL TSAALAAVLS LGGVGGANAE DAAKPSDVAK TDSFDADSVT TFSGAFLAAR 
TADVDHDYET AIELYKKALQ IEPGNPEIRQ RLMISLLLNG DIKDGVKYAN DLKGDPSVER
ITTIVRGMDA VRRDDYKTAE SILKYNGPND LDRMMNDLLL AWARVGAGRG KEALAMVEKM
KGPDWVRIFQ NYNAGAIAIA TGDVKSARKH LNDAVLDKEG GATAPDTFMR AVMALARLEA
TQGNKQKALD AVSVGDNLLP NYAPLNALRD SIEKDEKQEQ QVKTAEEGAA GVLFSVGGAL
NRDGAEDIVS LYLQTANALD PNSADTLVLL GGIAEKQNQM DRAIALYKKV PENSPMRRIS
ELQLGLALAQ GGKVDEARKH LQALIASDPK DIRSYLAYGS VLSDAKDYEA MAANYDKAVD
AIGPIPGRAN WSVFFQRGIA YERLKKWDQA EPNFRKALEL NPDQPQVLNY LGYSWIDMNR
NLDEGLGMIK KAVDLRPDDG YIIDSLGWAY FRLNRFDDAV DELERAAQIK AGDATINDHL
GDAYWRVGRK LEAVYQWNRA LASEPEAAEI PKIKDKVANG LPAVSDDAKA ADKKQPDPAP
VTPPPVDKKS