Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5359 |
Symbol | |
ID | 6978453 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 987805 |
End bp | 989169 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643394461 |
Product | putative polygalacturonase protein |
Protein accession | YP_002279279 |
Protein GI | 209547361 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.071082 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.281824 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGCCG CCTCCCTCGT TTCGATCGAG GCGCTCGACG GCGACAATAC CGACCGCCTG CAGGCGGCGA TCGACGATCT CTCGGCTTCC GGCGGTGGAC GCCTGGAGCT CCTGGCGGGC ATCCACATCT GCCGGGGGCT CCGGTTGCGC TCGGGCGTCG ATCTGCATCT GACCGCCGGG GCGATCCTGC GGCCGGTTCC GGACTACGCA GCCTATGCAC ATACGTCTGT TTCGGTGATC GCCGAGAAGT CGGACCGCGG CATGATCGTC GCCAAGGGCG CGCGGCGGAT CGGCCTGACG GGTCCGGGGC GGATTGAAGC CGGCTGCGAG AGCTTCATCA TCGGGGATGA CGAGACGGTG GGAACCTTTA TCCCGGCGGA ATTCCGTCCC CGCGTCGTCG TCTTCGAGGG CTGCGACGAA GTCGAGATCA GCGCGTTGCA TATCAGCCGC TCGCCAATGT GGACGCTGCA TTTCGTCGAC TGCACCGATG TCGCGGTCCG CAACGTCACC ATCGACAACG ACCGTCGCCT TCCCAATACG GATGGCATCG TGCTCGATGC CTGCCGCGGC GCCGTGATCG AGGATTGCAC CATATCGACG GCCGATGACG GCATATGCCT GAAGACCAGC ATCGGCCCGC AGGGTGTCGC CATCGGGCGA TGCGAGAATA TTGTTATCCG CCGCTGCGCC GTTCAGAGCC TCAGCTGCGC GCTGAAGATC GGCACGGAAA CGCACGGGGA CGTCACCAAT GTCGTCTTCG AGGATTGCAG CGTTTCATCT TCCAACCGGG CGCTCGGTAT CTTCTCACGC GACGGCGGCC GGATCTCGAA CGTCAGGTTT TCGCGGATTG CTGTGGAGTG CCGCGAAACG CTCGACGGCT TCTGGGGCTC GGGAGAGGCG CTGACCGTCA ACGTCGTCGA CCGCGTCGCT GAGCGCCCGG CAGGCGCCAT CGAAAATCTC ATTGTCGAGG ACATTGCCGG GCGTATGGAA GGGGCGATCA CCGTCATTTC GGCTTCGCCC GCCAGCATCC GCAATGTATC GCTGGCGCGC ATCGGCCTGG ATCAACGGCC CGGCGAACTC GGCACCGCGC AGTCCTACGA CCTGCGTCCG ACAAACGCGG ACCTTGCGCC GAAGGCAGAC GGTGGCGGCC GCGCCAATGC CTGGACGCGC GGGGCGGACG GGCGGGTGAT CGGCCTGCAG GACTATCCGG GCGGAATGCC CGCCGTCTAC GTGGCTGATG TCACCGGGAT ATTGATGAAC GAGGTGCGGA TTAAGAGACC GACACCGCTG CCGCAAGGCT GGAACGCAAT CGACGTCGTC TTCGAGACGG CGGCACCCGA TGGGAGTGGG GCATGGCAGA ACTGA
|
Protein sequence | MSAASLVSIE ALDGDNTDRL QAAIDDLSAS GGGRLELLAG IHICRGLRLR SGVDLHLTAG AILRPVPDYA AYAHTSVSVI AEKSDRGMIV AKGARRIGLT GPGRIEAGCE SFIIGDDETV GTFIPAEFRP RVVVFEGCDE VEISALHISR SPMWTLHFVD CTDVAVRNVT IDNDRRLPNT DGIVLDACRG AVIEDCTIST ADDGICLKTS IGPQGVAIGR CENIVIRRCA VQSLSCALKI GTETHGDVTN VVFEDCSVSS SNRALGIFSR DGGRISNVRF SRIAVECRET LDGFWGSGEA LTVNVVDRVA ERPAGAIENL IVEDIAGRME GAITVISASP ASIRNVSLAR IGLDQRPGEL GTAQSYDLRP TNADLAPKAD GGGRANAWTR GADGRVIGLQ DYPGGMPAVY VADVTGILMN EVRIKRPTPL PQGWNAIDVV FETAAPDGSG AWQN
|
| |