Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1051 |
Symbol | |
ID | 8012180 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 1026565 |
End bp | 1027470 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644823634 |
Product | PDZ/DHR/GLGF domain protein |
Protein accession | YP_002974885 |
Protein GI | 241203789 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00175075 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATCG ATCCGATTTT GCGGTCAGTC GTGGCCGTCC GTTCCTCCAT TCCGGAAGAT GCCTTCACAG CGGAGACGCT GGGCACTGTC CGGGAGGGCA GCGGCGTGGT CATTCGCGAC AACGGGCTGG TGCTGACCAT CGGTTATCTG ATCACCGAGG CCGAAGAGGT CTGGCTGACG ACCCATGACG GGCGTGTGGT TCCCGCGCAT GCGCTTGCCT ATGATCAGGA AAGCGGCTTC GGTCTGGTGC AGGCGCTTGG CCCTCTGAAT GCGCCGGCGG TGGATCTCGG CGACGCGGCC ACGGCCAAGG CCGGCGATCC CGTCGTGCTG GCCGATGGCA TCGGAGAATT CGTCGAGGCT AATATCGTCG CCCGGCAGGA ATTCGCCGGC TATTGGGAAT ATCTGCTGGA CGAGGCGATT TTCACGTCAC CAGCCCATCC CTCATGGGGC GGTGCGGCGC TGATCGGTTC GGACGGCAAG CTTCTCGGCA TCGGTTCGCT TCGCCTGCAG ATGAGCGATG GCGACGAGGT CGCCGATATC AACATGGTCG TGCCGATCGA CCTTTTGCCG CCGATCCTCG ACGATCTCTT GAACCGGGGA CAGGTCAACA GACCGCCGCG GCCCTGGCTC GGCGCCTTCT CCGCCGAGAG CAATGGCGGC GTGGTGGTCA TGAGCGTGGC CGAAGGCGGG CCGGCCGCTC AGGCGGGCCT GCGTCAGGGC GATATCATAT CGGAAATCCG CGACGAAGAG GTCGACGGCC TGGCCGATTT CTACCGCAAG GTCTGGAGCA GCGGCCCGGC CGGCGCCGAG ATCCCCATGC GCATCCTCAG GAACGGCCGG GAAGCCTGGC TGCGCATCAA GTCCGCCGAC CGCAACAGCT TTCTTAAGAA GCCGCAGCTG CAGTAA
|
Protein sequence | MNIDPILRSV VAVRSSIPED AFTAETLGTV REGSGVVIRD NGLVLTIGYL ITEAEEVWLT THDGRVVPAH ALAYDQESGF GLVQALGPLN APAVDLGDAA TAKAGDPVVL ADGIGEFVEA NIVARQEFAG YWEYLLDEAI FTSPAHPSWG GAALIGSDGK LLGIGSLRLQ MSDGDEVADI NMVVPIDLLP PILDDLLNRG QVNRPPRPWL GAFSAESNGG VVVMSVAEGG PAAQAGLRQG DIISEIRDEE VDGLADFYRK VWSSGPAGAE IPMRILRNGR EAWLRIKSAD RNSFLKKPQL Q
|
| |