Gene Rleg_5453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5453 
Symbol 
ID8016762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp31613 
End bp33232 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content56% 
IMG OID644827625 
ProductPeptidase S53 propeptide 
Protein accessionYP_002978825 
Protein GI241518197 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.552713 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGATA GAAAGGTTTT CTCGAACAGT GTCGTGCCGC TTCCAGACCA TCCTGGCCTG 
ACCCACAATG GGTTGATGGT AAACGCGGTT GAGCCGGCGT CGACTACGCC GGTGGACGTT
ATGTTCTCAA TGGATATAGC GCCAGATTTA CGGAAGGAAC TAGAGAACAA GATTGGAGCC
GGGGAAACCG TTAGCCCAAA CGAACTCCAA TCGAAATATG GTGGCGATCC GGAAAACGCA
GGTGCCCTAA TTAAGTGGTT GAAGTCTCAA GGCTTTGAAA TTCTTGACGT GGCGGCAGAC
AAAAGCGCGG TATATGCCCG AGCTGCGGTG CCGGTAGTGG AACAGGTCCT CCAGGTTAAG
ATGGTACCAG TGACACGTGA CGGCATCACC TACATGTCGG CACAAAATCC CCCGAGTCTG
CCGGCCGAGA TTGGCGAACC CGTGCACGCG ATCTTGGGAC TTCAGCCGTT TCGTCGAGCA
CAGAAGCACT TTAGGAAGCG ATTTTCCAGA GTTGCAAATC GTAATCGTCT CCGGGCCGGC
TCACCTCAAC CAAACATCGA TAATTCTCCT CCGTACCTAA TCGCCGAGAT TCTAAGAGCC
TACGGCGCGC ATGGATTAGG CCTGACTGGA AACGGTCAGC AGATCGCGGT TCTCATCGAT
ACATTTCCGG CCGAGGCCGA CTTGAAAGCG TTTTGGAAAG CGAACGGACT CAAATGGGAT
GGGTCTCGCA TAAAGAAAAT CAATGTGGGC AATACACCCC TACCGCCGCC TGAAGGAGAG
GAGACACTTG ACGTATCATG GACTGGCGGG ATTGCACCGG AAGCCGAGAT AAGAATCTAC
GCCTCCGGCT CGCTGCAGTT TTCTGCCCTT GATAGGGCTC TGGACCGTAT CATTGCCGAT
GTGCCGGCAA ATCCCGGCTT GCGGCAGCTG TCGATCAGTC TCGGCTTGGG AGAGACGTAC
ATGGGCGGAC CAGACGGGGA GGTCGCAGCT CAGCATAGCA GATTCTTGAA ATTGGCCGCA
GCAGGTGTGA ATGTGTTCGT GTCTACCGGA GATGCGGGAT CCAACCCGGA CCCGACTGGG
CACAGTCCAA CCGGGCCGCT GCAAGCTGAA TACGAATCGA CGGATACCGC CGTGGTTGCA
GTGGGCGGCA CGACGCTTCA ACTAACACCT GACGGCTCGG TTTCGTCAGA GATTGGATGG
GCGGCCAGTG GAGGGGGTCG AAGCGTCTTG TTCTCGCGAC CCGTTTGGCA GGCTGGCGTA
GGAATTCCTC CGGGCACGGA TCGCCTTGTT CCAGATGTTT GTGCAGCTGC CGACCCAAAC
ACCGGCGCCT TTCTGGTGTT ACACGGACAG CCGACCGGAA TAGGTGGAAC AAGTTGGAGC
GCGCCAATGT GGGCGGGCTT CTGCGCGCTG ATAAATGAAG CGCGCCACAA GAATGGACAA
CCAGCTCTGC CCTATCTGAA CCCGCTCTTG TATCCTCTGT CAGGGACGGC AGCGTTTCGG
GATATCTCTC ACGGAACAAA TGGGGCTTAC ACAGCAAAGT CCGGTTATGA CCTTGTGACC
GGTCTCGGTG CTCCGAACTT GGCGAAGCTC ATCAGCACGC TGGTTGGGCA GGAAGTATAG
 
Protein sequence
MADRKVFSNS VVPLPDHPGL THNGLMVNAV EPASTTPVDV MFSMDIAPDL RKELENKIGA 
GETVSPNELQ SKYGGDPENA GALIKWLKSQ GFEILDVAAD KSAVYARAAV PVVEQVLQVK
MVPVTRDGIT YMSAQNPPSL PAEIGEPVHA ILGLQPFRRA QKHFRKRFSR VANRNRLRAG
SPQPNIDNSP PYLIAEILRA YGAHGLGLTG NGQQIAVLID TFPAEADLKA FWKANGLKWD
GSRIKKINVG NTPLPPPEGE ETLDVSWTGG IAPEAEIRIY ASGSLQFSAL DRALDRIIAD
VPANPGLRQL SISLGLGETY MGGPDGEVAA QHSRFLKLAA AGVNVFVSTG DAGSNPDPTG
HSPTGPLQAE YESTDTAVVA VGGTTLQLTP DGSVSSEIGW AASGGGRSVL FSRPVWQAGV
GIPPGTDRLV PDVCAAADPN TGAFLVLHGQ PTGIGGTSWS APMWAGFCAL INEARHKNGQ
PALPYLNPLL YPLSGTAAFR DISHGTNGAY TAKSGYDLVT GLGAPNLAKL ISTLVGQEV