Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5079 |
Symbol | |
ID | 8007672 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 465535 |
End bp | 466899 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644821994 |
Product | Parallel beta-helix repeat protein |
Protein accession | YP_002973254 |
Protein GI | 241113419 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.416725 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.142328 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCCCG TGTCTCTTGT CGCGATCGAG CCGGCGCAGA GCGACGATAC GGTCCGTCTG CAGGCGGCCA TCGACGGCCT GTCGGCTTCA GGTGGCGGAC GCGTTGAGCT CATGGCCGGC ATCCATGTCT GCCGCGGGCT CCAGCTCAGA TCCGGCGTCG ACCTTCATCT GGCCGCCGGT GCGATCCTGC GTCCCGTTCC GGATTACGCG GCCTATGCGC AGACGACCGT TTCGGTGATC GCCGAAAAAT CCGACCGCGG CATGATCGTC GCCAAGGATG CGCGGCGAAT CAGCCTGACG GGGGCGGGGC GCATCGAAGC CGGTTGTGAC AGCTTCATCG TGGGAGACGA CGAGACGGTG GGAACCTTCA TCCCGGCCGA ATTTCGTCCC CGCGTCGTCG TCTTCGAGGG CTGCGACGAG GTCGAGATCA GCTCCGTCCA TATTTGCCGC TCGCCGATGT GGACGCTGCA CTTCGTCAAC TGCACCGATG TTGCGGTCAG GAACGTGATC ATCGACAACG ACCGCCGCCT TCCCAACACG GACGGGATCG TGCTGGATGC CTGCCGCGGC GCCGTTATCG AGGATTGCCG GATATCGACG GCCGACGACG GCATATGCCT GAAGACGAGC ATCGGTCCTG ACCGTGTCGC CATCGGGCGT TGCGAAAACA TTCTCGTCCG CAGATGCTCG GTTCAGAGCC TTAGCTGCGC GCTGAAGATC GGCACGGAAA CGCATGGCGA TGTCACTAAT GTCGTGTTCG AGGATTGCAG CGTCTCATCT TCCAACAGGG CCCTCGGCGT ATTCTCCCGC GACGGCGGCC GGATATCGAA CGTGAGGTTC TCACGAATTG CGGTGGAGTG CCGCGAGACG CCCGATGGTT TCTGGGGTTC GGGGGAGGCG CTGACCGTCA ATGTCGTCGA CCGTGTTACG GAACGCGCGG CAGGCGCCAT CGAAAATCTC ATCGTCGAGG ACATCACCGG CCGGATGGAG GGGGCGATCA CCGTCATTTC GACCTCGCCG GCCGGCATCC GCAACGCATC GCTGGCACGC ATCGCCATCG ATCAGCAGCC GGGGCAGCTC GGCACGGCAC GGTCCTACGA CCTGCGTCCG ACAAACGCCG ACCTCTCCCC GAAAGCCGAT GGCGGCGGCC GCGCCAATGC CTGGACCCGC GGGTCGGACG GGCGAGTGAT CGGTCTTGAG CACTATCCGG GAGGAATGCC GGCCGTCTAC GTGGCTGATG TCACCGGGAT CTTGATGAAC GAGGTGCGGA TCACAAGGCC GACACCGCTG CCGCAAGGCT GGAACAAAAA CGACGCCGTC TTCGAAACGG CGGCACCTGA TGGGAGTGGG GCATGGCAGA ACTGA
|
Protein sequence | MSPVSLVAIE PAQSDDTVRL QAAIDGLSAS GGGRVELMAG IHVCRGLQLR SGVDLHLAAG AILRPVPDYA AYAQTTVSVI AEKSDRGMIV AKDARRISLT GAGRIEAGCD SFIVGDDETV GTFIPAEFRP RVVVFEGCDE VEISSVHICR SPMWTLHFVN CTDVAVRNVI IDNDRRLPNT DGIVLDACRG AVIEDCRIST ADDGICLKTS IGPDRVAIGR CENILVRRCS VQSLSCALKI GTETHGDVTN VVFEDCSVSS SNRALGVFSR DGGRISNVRF SRIAVECRET PDGFWGSGEA LTVNVVDRVT ERAAGAIENL IVEDITGRME GAITVISTSP AGIRNASLAR IAIDQQPGQL GTARSYDLRP TNADLSPKAD GGGRANAWTR GSDGRVIGLE HYPGGMPAVY VADVTGILMN EVRITRPTPL PQGWNKNDAV FETAAPDGSG AWQN
|
| |