Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2798 |
Symbol | |
ID | 8013742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 2778707 |
End bp | 2780473 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644825369 |
Product | protease Do |
Protein accession | YP_002976598 |
Protein GI | 241205502 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.83064 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.454962 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCCCA CGATTCGCTC GCCCTTCAGA CGAACGCTCG CGCTTATGGC CAGCGCTGCA ATTCTTGCGC ATGCCGGCAT GAACGGGGTC GCGTACGCCC AAACCTCAGC TGAGGCGACA CCGCCCGGGG TTGCCGCGCC CGCTCCCGCT ACTCCAGAAA CGGCTGCCCC TGCACCCACA CCGCCGGCTC CGGCTGCCCC GGAAACGGCT GCTCCTACGC CTACCGCACC CGCCGCACCG CAGCAGACCG CTCCAATCCA GGCCGCCGTG CCCAACAACG GCCCGGCTTC CGTCGCCGAT CTCGCCGAGG GGCTGCTCGA CGCCGTGGTC AACATCTCGA CCTCGCAGAA TGTGAAGGAC GATGAGGGCG CGGGTCCGGC GCCGCGCGCG CCCGACGGCT CGCCTTTCCA GGAATTCTTC AACGATTTCT TCAACAAGCA GCAGGGCAAC AAAGGCGGCA ACCACAATGT CAGCTCGCTC GGCTCCGGCT TCGTCATCGA TCCGGCCGGC TATATCGTCA CCAACAACCA CGTGATCGAG GGCGCCGACG ATATCGAGAT CAATTTCGCC AATGGTTCGA AGCTCAAGGC GAAGCTGATC GGCACCGATA CGAAGACCGA TCTTTCGGTG CTGAAGGTCG AGCCGAAGAC GCCGCTGAAA TCGGTGAAAT TCGGCGATTC CAGCACGATG CGCATCGGCG ACTGGGTGAT GGCGATCGGC AATCCGTTCG GCTTCGGCGG TTCGGTGACG GTGGGTATCA TTTCCGGGCG TGGCCGCAAC ATCAATGCCG GTCCCTACGA CAACTTCATC CAGACGGATG CGGCGATCAA CAAGGGCAAT TCCGGCGGCC CGCTCTTCAA TATGAAGGGT GAAGTGATCG GCATCAACAC GGCGATCATT TCGCCGAGCG GCGGCTCGAT CGGTATCGGC TTCTCTGTGC CGTCGGAGCT TGCTTCCGGC GTCGTCGACC AGCTGCGCGA ATATGGCGAG ACGCGGCGCG GCTGGCTCGG CGTGCGCATC CAGCCGGTGA CCGACGATAT CGCCGACAGC CTCGGGCTCG ACACTGCAAA GGGCGCGCTG GTCGCCGGTG TCATCAAGGG CGGCCCCGTC GATGACGGCT CGATCAAGGC GGGCGACGTC ATTTTGAAAT TCGACGGCAA GACCGTCAGC GAAATGCGCG ACCTGCCGCG CGTCGTGGCG GAGAGCACCG TCGGCAAGGA AGTTGACGTC GTGGTGCTGC GCGACGGCAA GGAGCAGACC GTCAAGGTGA AGCTTGGCCG GCTCGAGGAC AGCGATCAGG CGGCAGCATC CGACGCGCCC GACGGTTCGC AGAACGACGG CGGCGTGATC ACCCCGGACC CCGGCGAGAA CAACGACATG GACCAGCCGG ATTCCGGCGA TCAGGCTAAG CCTGCACCTG ATACACCCGA CCAGCATAAG GGGCAGGTGT CGCCGGATGC GGCCACGCCG AAGAACGTGC TCGGGCTGTC GCTGTCGCTG TTGAGCGCCG AGACGCGCAA GGCCTTTGGC ATCGCCGAGA GCGTTGACGG CGTCGTCGTC ACAGAGGTGA CGCCCGGCTC CGCCTCGGCC GAAAAGGGGC TGAAGCCCGG CGACGTGATC GTCGAGGTTG CGCAGGAGTT TATGAAGTCG CCGGATGCTG TCGCCGCCAA GGTGCAGGCG CTGAAACAGG AGGGCCGCCG CAACGCTCAG CTGATGGTCG CATCGGCGAA TGGCGATCTG CGGTTCGTGG CGGTGCCGAT GGAATAG
|
Protein sequence | MAPTIRSPFR RTLALMASAA ILAHAGMNGV AYAQTSAEAT PPGVAAPAPA TPETAAPAPT PPAPAAPETA APTPTAPAAP QQTAPIQAAV PNNGPASVAD LAEGLLDAVV NISTSQNVKD DEGAGPAPRA PDGSPFQEFF NDFFNKQQGN KGGNHNVSSL GSGFVIDPAG YIVTNNHVIE GADDIEINFA NGSKLKAKLI GTDTKTDLSV LKVEPKTPLK SVKFGDSSTM RIGDWVMAIG NPFGFGGSVT VGIISGRGRN INAGPYDNFI QTDAAINKGN SGGPLFNMKG EVIGINTAII SPSGGSIGIG FSVPSELASG VVDQLREYGE TRRGWLGVRI QPVTDDIADS LGLDTAKGAL VAGVIKGGPV DDGSIKAGDV ILKFDGKTVS EMRDLPRVVA ESTVGKEVDV VVLRDGKEQT VKVKLGRLED SDQAAASDAP DGSQNDGGVI TPDPGENNDM DQPDSGDQAK PAPDTPDQHK GQVSPDAATP KNVLGLSLSL LSAETRKAFG IAESVDGVVV TEVTPGSASA EKGLKPGDVI VEVAQEFMKS PDAVAAKVQA LKQEGRRNAQ LMVASANGDL RFVAVPME
|
| |