Gene Rleg_2798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2798 
Symbol 
ID8013742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2778707 
End bp2780473 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content65% 
IMG OID644825369 
Productprotease Do 
Protein accessionYP_002976598 
Protein GI241205502 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.83064 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.454962 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCCCA CGATTCGCTC GCCCTTCAGA CGAACGCTCG CGCTTATGGC CAGCGCTGCA 
ATTCTTGCGC ATGCCGGCAT GAACGGGGTC GCGTACGCCC AAACCTCAGC TGAGGCGACA
CCGCCCGGGG TTGCCGCGCC CGCTCCCGCT ACTCCAGAAA CGGCTGCCCC TGCACCCACA
CCGCCGGCTC CGGCTGCCCC GGAAACGGCT GCTCCTACGC CTACCGCACC CGCCGCACCG
CAGCAGACCG CTCCAATCCA GGCCGCCGTG CCCAACAACG GCCCGGCTTC CGTCGCCGAT
CTCGCCGAGG GGCTGCTCGA CGCCGTGGTC AACATCTCGA CCTCGCAGAA TGTGAAGGAC
GATGAGGGCG CGGGTCCGGC GCCGCGCGCG CCCGACGGCT CGCCTTTCCA GGAATTCTTC
AACGATTTCT TCAACAAGCA GCAGGGCAAC AAAGGCGGCA ACCACAATGT CAGCTCGCTC
GGCTCCGGCT TCGTCATCGA TCCGGCCGGC TATATCGTCA CCAACAACCA CGTGATCGAG
GGCGCCGACG ATATCGAGAT CAATTTCGCC AATGGTTCGA AGCTCAAGGC GAAGCTGATC
GGCACCGATA CGAAGACCGA TCTTTCGGTG CTGAAGGTCG AGCCGAAGAC GCCGCTGAAA
TCGGTGAAAT TCGGCGATTC CAGCACGATG CGCATCGGCG ACTGGGTGAT GGCGATCGGC
AATCCGTTCG GCTTCGGCGG TTCGGTGACG GTGGGTATCA TTTCCGGGCG TGGCCGCAAC
ATCAATGCCG GTCCCTACGA CAACTTCATC CAGACGGATG CGGCGATCAA CAAGGGCAAT
TCCGGCGGCC CGCTCTTCAA TATGAAGGGT GAAGTGATCG GCATCAACAC GGCGATCATT
TCGCCGAGCG GCGGCTCGAT CGGTATCGGC TTCTCTGTGC CGTCGGAGCT TGCTTCCGGC
GTCGTCGACC AGCTGCGCGA ATATGGCGAG ACGCGGCGCG GCTGGCTCGG CGTGCGCATC
CAGCCGGTGA CCGACGATAT CGCCGACAGC CTCGGGCTCG ACACTGCAAA GGGCGCGCTG
GTCGCCGGTG TCATCAAGGG CGGCCCCGTC GATGACGGCT CGATCAAGGC GGGCGACGTC
ATTTTGAAAT TCGACGGCAA GACCGTCAGC GAAATGCGCG ACCTGCCGCG CGTCGTGGCG
GAGAGCACCG TCGGCAAGGA AGTTGACGTC GTGGTGCTGC GCGACGGCAA GGAGCAGACC
GTCAAGGTGA AGCTTGGCCG GCTCGAGGAC AGCGATCAGG CGGCAGCATC CGACGCGCCC
GACGGTTCGC AGAACGACGG CGGCGTGATC ACCCCGGACC CCGGCGAGAA CAACGACATG
GACCAGCCGG ATTCCGGCGA TCAGGCTAAG CCTGCACCTG ATACACCCGA CCAGCATAAG
GGGCAGGTGT CGCCGGATGC GGCCACGCCG AAGAACGTGC TCGGGCTGTC GCTGTCGCTG
TTGAGCGCCG AGACGCGCAA GGCCTTTGGC ATCGCCGAGA GCGTTGACGG CGTCGTCGTC
ACAGAGGTGA CGCCCGGCTC CGCCTCGGCC GAAAAGGGGC TGAAGCCCGG CGACGTGATC
GTCGAGGTTG CGCAGGAGTT TATGAAGTCG CCGGATGCTG TCGCCGCCAA GGTGCAGGCG
CTGAAACAGG AGGGCCGCCG CAACGCTCAG CTGATGGTCG CATCGGCGAA TGGCGATCTG
CGGTTCGTGG CGGTGCCGAT GGAATAG
 
Protein sequence
MAPTIRSPFR RTLALMASAA ILAHAGMNGV AYAQTSAEAT PPGVAAPAPA TPETAAPAPT 
PPAPAAPETA APTPTAPAAP QQTAPIQAAV PNNGPASVAD LAEGLLDAVV NISTSQNVKD
DEGAGPAPRA PDGSPFQEFF NDFFNKQQGN KGGNHNVSSL GSGFVIDPAG YIVTNNHVIE
GADDIEINFA NGSKLKAKLI GTDTKTDLSV LKVEPKTPLK SVKFGDSSTM RIGDWVMAIG
NPFGFGGSVT VGIISGRGRN INAGPYDNFI QTDAAINKGN SGGPLFNMKG EVIGINTAII
SPSGGSIGIG FSVPSELASG VVDQLREYGE TRRGWLGVRI QPVTDDIADS LGLDTAKGAL
VAGVIKGGPV DDGSIKAGDV ILKFDGKTVS EMRDLPRVVA ESTVGKEVDV VVLRDGKEQT
VKVKLGRLED SDQAAASDAP DGSQNDGGVI TPDPGENNDM DQPDSGDQAK PAPDTPDQHK
GQVSPDAATP KNVLGLSLSL LSAETRKAFG IAESVDGVVV TEVTPGSASA EKGLKPGDVI
VEVAQEFMKS PDAVAAKVQA LKQEGRRNAQ LMVASANGDL RFVAVPME