Gene Rleg_4209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4209 
Symbol 
ID8015901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4303146 
End bp4304468 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content62% 
IMG OID644826780 
Productcarboxyl-terminal protease 
Protein accessionYP_002977989 
Protein GI241206893 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.194144 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCGTA GGGCTTCTCT TGTTCTGGTC GGCGCATTGG TGGGTGCGAC CGCAATGAGC 
GTCATTTACT CGGCGGGTGT GCCGGCAGAA GCGGCCGGAT CCTCGACCTA CAAGGAACTT
TCGGTTTTCG GAGATGTCTT CGAGCGTGTG CGTGCGCAAT ATGTGACACC GCCTGCGGAA
GACAAGCTGA TCGAGAACGC CATCAACGGT ATGCTCTCCT CGCTCGATCC GCATTCGAGC
TACATGAATG CGAAGGACGC CGAGGACATG CGCACCCAGA CCAAGGGTGA GTTCGGCGGC
CTCGGCATCG AAGTCACGAT GGAAGACGAA CTCGTCAAGG TCATTACCCC GATTGACGAT
ACGCCCGCCG CCAAGGCCGG TGTTCTCGCC GGCGACTACA TCTCCGAGAT CGACGGCCAA
TCCGTGCGCG GCCTGAAGCT GGAAGACGCA GTCGAGAAGA TGCGCGGCGC CGTCAACACG
CCGATCAAGC TGACGCTGAT CCGCAAGGGT GCCGACAAGC CGATCGAGCT GACGATCGTC
CGTGACGTCG TCGCCGTCCA GGCAGTCAAG TCGCGTGTCG AGGACGATGT CGGTTATCTC
CGCATCATCT CCTTCACCGA GAAGACCTAT CCTGACATGG AAAAGGCGAT CAAGAAGATC
AAGGACACCG TTCCGGCCGA CAAGCTGAAG GGTTATGTTC TCGACCTGCG CCTCAATCCG
GGCGGTCTGC TCGACCAGGC GATCAACGTC TCCGATGCCC TGCTGCAGCG CGGCGAAGTC
GTTTCGACCC GCGGCCGCAA TCCGGATGAA ACCCGCCGCT TCAATGCCGG CCCGGGCGAC
CTGACGGATG GCAAGCCGGT GATCGTGCTG ATCAACGGCG GTTCGGCTTC CGCATCGGAA
ATCGTCGCCG GCGCTCTTCA GGATCTGCGC CGTGCCACCG TTCTCGGTAC GCGCTCCTTC
GGCAAGGGCT CCGTCCAGAC GATCATCCCG CTCGGCGAAA ACGGCGCGCT GCGCCTGACC
ACAGCGCTCT ACTACACGCC GTCGGGCCGC TCGATCCAGG GCACCGGCAT CACCCCCGAC
ATCAAGGTCG AGGAGCCGCT GCCGCAGGAA CTGCAGGGCA AGATGGTGAC CGAAGGCGAA
TCCAGCCTGC GCGGCCACAT CAAGGGCCAA AGCGAGACGG ACGAAGGTTC GGGCTCCGTT
GCCTACGTCC CGCCGGACCC GAAGGACGAC GTTCAGCTGA ATTACGCGCT CGACCTCCTG
CGCGGCAAGA AGACCGATCC GGCCTTCCCG CCGAATCCTG ACAAGGCCGT CGTCGCCAAG
TAA
 
Protein sequence
MIRRASLVLV GALVGATAMS VIYSAGVPAE AAGSSTYKEL SVFGDVFERV RAQYVTPPAE 
DKLIENAING MLSSLDPHSS YMNAKDAEDM RTQTKGEFGG LGIEVTMEDE LVKVITPIDD
TPAAKAGVLA GDYISEIDGQ SVRGLKLEDA VEKMRGAVNT PIKLTLIRKG ADKPIELTIV
RDVVAVQAVK SRVEDDVGYL RIISFTEKTY PDMEKAIKKI KDTVPADKLK GYVLDLRLNP
GGLLDQAINV SDALLQRGEV VSTRGRNPDE TRRFNAGPGD LTDGKPVIVL INGGSASASE
IVAGALQDLR RATVLGTRSF GKGSVQTIIP LGENGALRLT TALYYTPSGR SIQGTGITPD
IKVEEPLPQE LQGKMVTEGE SSLRGHIKGQ SETDEGSGSV AYVPPDPKDD VQLNYALDLL
RGKKTDPAFP PNPDKAVVAK