Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_0906 |
Symbol | |
ID | 6979624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 921773 |
End bp | 922678 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643395617 |
Product | PDZ/DHR/GLGF domain protein |
Protein accession | YP_002280426 |
Protein GI | 209548509 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.0997368 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATCG ATCCAATTTT GCGGTCAATC GTGGCCGTTC GTTCTTCCAT CCCGGAAGAT GCCTTCACAG CGGAGACGTT GGGCACTGTC CGGGAGGGCA GCGGCGTGGT CATTCGCGAC AATGGGCTGG TGCTGACCAT CGGTTACCTC ATCACCGAGG CCGAAGAGGT CTGGCTGACC ACCCATGACG GGCGCGTCGT TCCCGCCCAT GCACTTGCCT ATGATCAGGA ATCCGGCTTC GGCCTGGTGC AGGCGCTGGG GGCTCTCAAT GCGCCGGCTG TGGATCTCGG CGATGCGGCA AGCGCCAAGG CCGGCGATGC TGTGGTGCTT GCCGATGGCA TCGGAGAATT CGTCGAGGCC AATATCGTCG CCCGGCAGGA ATTCGCCGGC TACTGGGAAT ATCTGCTGGA TGAGGCGATT TTCACGGCGC CGGCCCATCC CTCATGGGGT GGTGCGGCGC TGATCGGTTC GGACGGCAAG CTTCTGGGCA TCGGTTCGCT TCGCCTGCAA ATGAGCCAGG GCGACGAGGT CGCCGATATC AACATGGTCG TGCCGATCGA CCTTTTGACT CCGATCCTCG ACGATCTGTT GAACCGCGGA AGCGTCAACA AACCGCCGCG GCCCTGGCTC GGCGCATTCT CCGCCGAGAG CAATGGCGGC GTGGTGGTGA TGAGCGTCGC CGAAGGCGGC CCGGCCGCCC AGGCGGGTCT GCGGCAAGGC GATATCATCT CGGAGATCCG CGATGAAGAG GTCGATGGCC TGGCCGATTT CTACCGCAAG GTCTGGAGCA GCGGCCCGGC CGGCGCCGAA ATTCCGATGC GGATTCTGAG GAACGGCCGG GAAGCCTGGC TGCGCATCAA GTCCGCCGAC CGCAACAATT TTCTCAAGAA GCCGCAGCTG CAGTAA
|
Protein sequence | MNIDPILRSI VAVRSSIPED AFTAETLGTV REGSGVVIRD NGLVLTIGYL ITEAEEVWLT THDGRVVPAH ALAYDQESGF GLVQALGALN APAVDLGDAA SAKAGDAVVL ADGIGEFVEA NIVARQEFAG YWEYLLDEAI FTAPAHPSWG GAALIGSDGK LLGIGSLRLQ MSQGDEVADI NMVVPIDLLT PILDDLLNRG SVNKPPRPWL GAFSAESNGG VVVMSVAEGG PAAQAGLRQG DIISEIRDEE VDGLADFYRK VWSSGPAGAE IPMRILRNGR EAWLRIKSAD RNNFLKKPQL Q
|
| |