Gene Rleg2_5359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5359 
Symbol 
ID6978453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp987805 
End bp989169 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content65% 
IMG OID643394461 
Productputative polygalacturonase protein 
Protein accessionYP_002279279 
Protein GI209547361 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.071082 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.281824 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCCG CCTCCCTCGT TTCGATCGAG GCGCTCGACG GCGACAATAC CGACCGCCTG 
CAGGCGGCGA TCGACGATCT CTCGGCTTCC GGCGGTGGAC GCCTGGAGCT CCTGGCGGGC
ATCCACATCT GCCGGGGGCT CCGGTTGCGC TCGGGCGTCG ATCTGCATCT GACCGCCGGG
GCGATCCTGC GGCCGGTTCC GGACTACGCA GCCTATGCAC ATACGTCTGT TTCGGTGATC
GCCGAGAAGT CGGACCGCGG CATGATCGTC GCCAAGGGCG CGCGGCGGAT CGGCCTGACG
GGTCCGGGGC GGATTGAAGC CGGCTGCGAG AGCTTCATCA TCGGGGATGA CGAGACGGTG
GGAACCTTTA TCCCGGCGGA ATTCCGTCCC CGCGTCGTCG TCTTCGAGGG CTGCGACGAA
GTCGAGATCA GCGCGTTGCA TATCAGCCGC TCGCCAATGT GGACGCTGCA TTTCGTCGAC
TGCACCGATG TCGCGGTCCG CAACGTCACC ATCGACAACG ACCGTCGCCT TCCCAATACG
GATGGCATCG TGCTCGATGC CTGCCGCGGC GCCGTGATCG AGGATTGCAC CATATCGACG
GCCGATGACG GCATATGCCT GAAGACCAGC ATCGGCCCGC AGGGTGTCGC CATCGGGCGA
TGCGAGAATA TTGTTATCCG CCGCTGCGCC GTTCAGAGCC TCAGCTGCGC GCTGAAGATC
GGCACGGAAA CGCACGGGGA CGTCACCAAT GTCGTCTTCG AGGATTGCAG CGTTTCATCT
TCCAACCGGG CGCTCGGTAT CTTCTCACGC GACGGCGGCC GGATCTCGAA CGTCAGGTTT
TCGCGGATTG CTGTGGAGTG CCGCGAAACG CTCGACGGCT TCTGGGGCTC GGGAGAGGCG
CTGACCGTCA ACGTCGTCGA CCGCGTCGCT GAGCGCCCGG CAGGCGCCAT CGAAAATCTC
ATTGTCGAGG ACATTGCCGG GCGTATGGAA GGGGCGATCA CCGTCATTTC GGCTTCGCCC
GCCAGCATCC GCAATGTATC GCTGGCGCGC ATCGGCCTGG ATCAACGGCC CGGCGAACTC
GGCACCGCGC AGTCCTACGA CCTGCGTCCG ACAAACGCGG ACCTTGCGCC GAAGGCAGAC
GGTGGCGGCC GCGCCAATGC CTGGACGCGC GGGGCGGACG GGCGGGTGAT CGGCCTGCAG
GACTATCCGG GCGGAATGCC CGCCGTCTAC GTGGCTGATG TCACCGGGAT ATTGATGAAC
GAGGTGCGGA TTAAGAGACC GACACCGCTG CCGCAAGGCT GGAACGCAAT CGACGTCGTC
TTCGAGACGG CGGCACCCGA TGGGAGTGGG GCATGGCAGA ACTGA
 
Protein sequence
MSAASLVSIE ALDGDNTDRL QAAIDDLSAS GGGRLELLAG IHICRGLRLR SGVDLHLTAG 
AILRPVPDYA AYAHTSVSVI AEKSDRGMIV AKGARRIGLT GPGRIEAGCE SFIIGDDETV
GTFIPAEFRP RVVVFEGCDE VEISALHISR SPMWTLHFVD CTDVAVRNVT IDNDRRLPNT
DGIVLDACRG AVIEDCTIST ADDGICLKTS IGPQGVAIGR CENIVIRRCA VQSLSCALKI
GTETHGDVTN VVFEDCSVSS SNRALGIFSR DGGRISNVRF SRIAVECRET LDGFWGSGEA
LTVNVVDRVA ERPAGAIENL IVEDIAGRME GAITVISASP ASIRNVSLAR IGLDQRPGEL
GTAQSYDLRP TNADLAPKAD GGGRANAWTR GADGRVIGLQ DYPGGMPAVY VADVTGILMN
EVRIKRPTPL PQGWNAIDVV FETAAPDGSG AWQN