Gene Rleg_6789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6789 
Symbol 
ID8022719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp225600 
End bp230264 
Gene Length4665 bp 
Protein Length1554 aa 
Translation table11 
GC content62% 
IMG OID644833656 
Productpeptidase C14 caspase catalytic subunit p20 
Protein accessionYP_002984790 
Protein GI241666706 
COG category[R] General function prediction only 
COG ID[COG4249] Uncharacterized protein containing caspase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGTAA TAGGGATTGC TCTGGCATGT ATCATGTTGC TGTCGCAGCC GGCTCTGGCG 
CAGTCTCCAC TCGACATCAT CAGGAGCTTG ACCGACGGCG TCACACGTGA TGCCGCGCCG
CCAGATGCTG GTAGCCCACC GTCTAAACCG ATACCGACTC TTCAATCACA GCTGACAAGA
TCGCAAGTGC AACGCCTGCA GCGTGCCCTG GAAGCGCTCG GTTACTATCA TGGTCCGATC
GACGGAAATG CCGGGGTCGA GACGTGGAGT TCGGTGGTAG CGTGGGCACG CGACCGGGGA
TGGGAGGCTC CAACAACCCT CCGTGTCGCG CATCTCAATT CTCTTGAAGC TGAACGCGCG
CAGGGACAGG TCGCAGAACA GGATCGTCCA GGCTCTGGTC CAGAAGATGG GACTATTCGT
CGCGTTCAAT CGGCCCTCTC ACGGCTCGGC TACTATTCGG GGCCGGTGAA TGGCCAGACA
GACCAAGCGA CCATGGCTGC CATGGCGGCC TGGGCAGCCG ATCGGAACTG GGAGGCGCCG
GCATCTATCC GCGAGGCACA TGCCGTCAAC ATGGAGGAGG AGCTTTCCCA TAGGACAGGC
CCGGTTGTCG GCGATGCGGC CTTGCCACCG TGGCAGCCGG ACGACACGGG GATAGGCAGC
AAAATCTGCG CCTCCACCGG CTACAGCAAG GTCTGCCTTC GTCTTGCCTG CGATGCCGAA
GCCGGTGTGG TCCTTGATTT CGAAGAGGTA GCGGGCTACT CGACACCGTC GGTCTCGAGC
GCGGGTCTGA CGGTGGATCG AGGCCCTGAG GCAACGCTTC CCCTCGATGG CATCCGCAAG
AGACCGGTCC TCGATGCCGG TCGCGATGCA GCGCTCATCG CCCACCTCTC CTCCGGGCGG
TCGCTGAACC TCAAGTTGGG GACATGGTCG ATGTCCTTCG GTCTCGCGCA ATTCGGCCCC
GAGTACGACC GGGTTGGAAA GGCATGTGCT ACCCTCCTCG CCAGGCTTCG TAGCGGCCAG
GATCCCTTCC CTGGCCAGTT CTCCCGTCTT GCCGACGAAC TGGCCGAGGC CACCGTGTCC
GAAGGAACCT GGACCTCGCA GCCGAATATG GATCTTGGCG GCGATGACAT TCGACATGGG
CTGACCGATC CGCTTCTGCG TGGTCTCACG GAAGGCGTAT GTATCGCGAT CTGCTCGGAA
ACCGACATGT GCCGAGCCTA TACGTTCAAG CCGAGCGGCG GGGGATGTTT CCTGAAGTCC
GCAAAGGGCT CGCCGAAACC CCACGCAGCC GCAAAAAGCG GCGTTTTCGA TGGCCGGAAA
GCGGGTTTAG CACCTCCACC CACCCGCGGT CCGGGACCGA TCGTGGATGG AGCGGTCGGG
TGGCACGAGG GCGAGCCGCT GGAAGGCTTC CAGGCCCGCG TGAAACAAGC TGCTCGCAGG
CTCGGCGGGT CATGCGAGGA GGAGAAGGAG ACGCTCCGGC GCCTGGCCGA AAAATTCCAA
TGGACACTAC CGCATCAGGG ACCGCTCCGC GCCGGAAACA GCTTTGCGAT CGAGTGGTCG
GGCAACACGC TCGAAGATCG CATACCGGTT TGGTTCATGG TCAAAGCAGA ACAGCCCGTC
CGTTTTAAAG GCAAGGGGCA TGTCGCGTTG GGTCCTGATG CACCCAATCC GTTCGCGATC
AAAACCGGTC TCGGCAAGAC ACGCGCCATG GTCGCTCTGG CGACGCGTGG CGCCGCGGCG
AGTGGTTCAG TTTCAGCCAT TCCGCTGCAA GCCGGGCCTT TGAACCTATC CGTCACATTG
GTGGGCTATC TGCGCGCCTG CGAGGAGGAG ATCGTTCTGA AGGAAGGGTC CGAGCGGCTC
GAAATCGCAC CGGCGCCCGC CGAGATCGTC CTCAACACCG CCGAGGGCCG CGCGGCGTTC
ACACATAGCA TCGATATCCC TAAATTCTCC CGCCGCATCC TCCTCAACGA TACCCGCTTT
CTTCTGCTGG ATGCGGCAAG CGGGACGGAA ATCGTCGAGC GGGCAGGCAC ACACCTGCGG
ATCTCGCCAA CCCACCGCTT CATCGGCGTC GAGCATAATG GCCGGCTCGA CATCGTCGAC
ATGGTCGATG GGCATACGGC TGCGACAGCG GATGCCGGCG ACCTTCGCTG GGCGCTCGGC
GACAGTGTCG TCTTCACCAC ACTCGCGCCT TGGGCGGAGG TGAATTTTTC CTCTACCTTC
GGCGATCATT TGCGCATACG GGAACAGATT ACCGGGCCGT CTTGCTGCAC CGCAGAACGC
GGTCAAACGC GCGTGGCGAT CGATCTGGAG AATGCTGCCT ACGCCATCTG GGGTGGCGCC
GGCTACCGCG TTGGTGCGCT CCAGAACCCG GATTACGCCT CGATCGCGAA TTCTTCGGGC
GCTTACAGTT CCGAAGGTGG CGAGACAATT CCCCTTCATT TTCATATGTT CTGGTCGCTT
GGCATGGTGT CGCCCGTCAG TGTCGCTCGG GAATTCGATG TTGCCGGCGG CTTCAAGACG
ACCAGTACCT GGGAAGACTG GGAGGTCGCG GAAGCCGACA GCCGCCGCCC CCGCGATTTT
ACGGAGAGCC TGTCGCGCAC GTTGGCCAAA ATTGAGCTTC AGGCAGTCAT GATCGATACC
GCCGTGGCAT CGAACGACCA AGGCTCCACC AGAGCCGGCA ACGACACGCC TCTCGCCGCT
GCCTTGCCCG AACAACTCTT GCGCCTCGGC GTCGCGCTCG ATCCAATGGT CGACGGTGAA
CGCCTCCTCG CGTCCTACGC CGCGGGTGAA AACAACGCCC TTCACGCGCT TGATCGCGAC
CAGCGGCTGA AACGATCCGC AGAGGCGATG GGGCGCTTCC GCCGCGAGGC GGAGCGTGCG
GGCTGGCGAT TCGACTGGGC TCTGCCCGTC GGGGAGGAGG GTCCCATCTC CGACTGTGAA
CATCTTCTGC TCGGAGAATC CTCAAGCTCG GGGGGCACGG GCTCGCTGCT AGCGCCTCGC
GATGTTGTCG AGGTCTCGAC CGTCCGCACG TCTCGCGGGG CGGTATGGGT GGCGCGTGCG
GAGTGTGTCG CGGGGGCGAC GTTCGGGAGC CTCAGGCCCT ATGCTGCCCT TTACGTCATG
GATATCGCGC GCCCCGCGCC GGCGGCGAGT GCGGCACTTC AGGCGGAAGG CGCATTTTTC
TTCGAAAACA ATGCCCATCG TCTGTGGTAC CAGCACGCTT TTCGTATCAA AGCCAATGAC
GACCTGCTGC TGACCTATGC GCCCGGCAAC GGCGTCATTA CCGTGCGTGA CCGCGCAACA
CAGAAGTTCC TCTGGATCGG CGAGAATCTT CCGAATGGCG ACCTCCTCGT CGACGCCTGG
TTGACGAAAG ACCGTCGCCA TGCGGTGCAA CTTAATTCTG ACGGCAACTT CTATATTCAC
GCCATTCTGG ACGATAGGCA GTCGCTTCTG TCCGGGCGCA TCGCCGACGA CGAGATCGCG
GTCTGGACCA GGGACTACTT TTACGACGCG ACAGCCGAGG CCGCCGCCCT CATCGATCTG
AAGTTTCCCG GTAGGGTTGG ACAGTATAGT CTCGACCGGT TCGGCATAGC GCGCCAGGTC
CCAGATCTGG CGCGCGCCGT TCTTGATCGA GGGCAGACGC CGCAGGCCGT CGCCGAGGTG
GGCGTACCGC CGTCGTTGAC AGGCGAGATT GCGCTCGAGG AAAACGGCAG TCGCGTAAGA
GCCGTATTGC ACTTCGACCC GGCTGAGACG GTGCATATGT CCGTTTTTCA GGACGGTGTT
CTGACCGGAA CGGTTGATGC AGCCACCATC GGAAATGCAG TGTCGATCGA GAGGCTCAAG
GACGCGCGTT GGGTGTCGGT CATCGGATTC AATTCTGCGG GTCTGGCGAG CCTCCCGGTA
TCGGCTGATC TCGGCGAGCC TCTTGCGCGC CGTGCGGTCA CCCGCGCGCT CGTCATCGGC
GTGAACACTT ACGAGGATGA GCGCCTCCGT TCGCTCAATT ACCCTCTGCG CGATGCAGGC
ACGGTGCTCG AAACGCTGAC CGAGCCGCTT GGTAAAGAGC CACCGTTCCG CGGCGAGGCA
GGGCCGAAGG ACAGGCGTGC CACACCCGAG GCCATACTAG AGGCGACGGC GCGACTGCTG
GACGGGCTGG TCCGTGGGGA CCACGCGGTC CTGTTTCTGG CCGGCCATGG CATGCAGGAT
CGCAATGGGC GTTTCTATTT CGCGACGTCT GCCACCGATC CAACGGATCT CGAACACACA
GCCTTGCCTT TCGACCGGCT GGCCGCGCTG TTCGAAAGGA CCGAGGCGCG CATCACTATC
CTTCTTGACG CCTGCCACTC CGGCGCCGCG GGGACCGGGG CTTTCACCAC CAACGACGAT
CTCGCCAACA GCCTCGTCGC CCTTAAATCC AATCTCACCA TTCTCGCCGC TGCGAAGGGG
CGACAGGAGT CGCTTGGGCG GCGGGAGGTC GGCGGCCTGT TCACCAACGC GGTCGTTACT
GTCCTCGGAA AGGAACGCGA TCGTTATGAC CACAATCACA ATGGTCGCAT CGAGGCCTCT
GAACTGTATC GCGGCGTCAA GGCGCTGGTT TTCGCGGCGA GCGACGGCAA GCAGACACCC
TGGATCATCA ACAGCCGATT GGTGGGAGAC TATGCCCTCT TTTAG
 
Protein sequence
MRVIGIALAC IMLLSQPALA QSPLDIIRSL TDGVTRDAAP PDAGSPPSKP IPTLQSQLTR 
SQVQRLQRAL EALGYYHGPI DGNAGVETWS SVVAWARDRG WEAPTTLRVA HLNSLEAERA
QGQVAEQDRP GSGPEDGTIR RVQSALSRLG YYSGPVNGQT DQATMAAMAA WAADRNWEAP
ASIREAHAVN MEEELSHRTG PVVGDAALPP WQPDDTGIGS KICASTGYSK VCLRLACDAE
AGVVLDFEEV AGYSTPSVSS AGLTVDRGPE ATLPLDGIRK RPVLDAGRDA ALIAHLSSGR
SLNLKLGTWS MSFGLAQFGP EYDRVGKACA TLLARLRSGQ DPFPGQFSRL ADELAEATVS
EGTWTSQPNM DLGGDDIRHG LTDPLLRGLT EGVCIAICSE TDMCRAYTFK PSGGGCFLKS
AKGSPKPHAA AKSGVFDGRK AGLAPPPTRG PGPIVDGAVG WHEGEPLEGF QARVKQAARR
LGGSCEEEKE TLRRLAEKFQ WTLPHQGPLR AGNSFAIEWS GNTLEDRIPV WFMVKAEQPV
RFKGKGHVAL GPDAPNPFAI KTGLGKTRAM VALATRGAAA SGSVSAIPLQ AGPLNLSVTL
VGYLRACEEE IVLKEGSERL EIAPAPAEIV LNTAEGRAAF THSIDIPKFS RRILLNDTRF
LLLDAASGTE IVERAGTHLR ISPTHRFIGV EHNGRLDIVD MVDGHTAATA DAGDLRWALG
DSVVFTTLAP WAEVNFSSTF GDHLRIREQI TGPSCCTAER GQTRVAIDLE NAAYAIWGGA
GYRVGALQNP DYASIANSSG AYSSEGGETI PLHFHMFWSL GMVSPVSVAR EFDVAGGFKT
TSTWEDWEVA EADSRRPRDF TESLSRTLAK IELQAVMIDT AVASNDQGST RAGNDTPLAA
ALPEQLLRLG VALDPMVDGE RLLASYAAGE NNALHALDRD QRLKRSAEAM GRFRREAERA
GWRFDWALPV GEEGPISDCE HLLLGESSSS GGTGSLLAPR DVVEVSTVRT SRGAVWVARA
ECVAGATFGS LRPYAALYVM DIARPAPAAS AALQAEGAFF FENNAHRLWY QHAFRIKAND
DLLLTYAPGN GVITVRDRAT QKFLWIGENL PNGDLLVDAW LTKDRRHAVQ LNSDGNFYIH
AILDDRQSLL SGRIADDEIA VWTRDYFYDA TAEAAALIDL KFPGRVGQYS LDRFGIARQV
PDLARAVLDR GQTPQAVAEV GVPPSLTGEI ALEENGSRVR AVLHFDPAET VHMSVFQDGV
LTGTVDAATI GNAVSIERLK DARWVSVIGF NSAGLASLPV SADLGEPLAR RAVTRALVIG
VNTYEDERLR SLNYPLRDAG TVLETLTEPL GKEPPFRGEA GPKDRRATPE AILEATARLL
DGLVRGDHAV LFLAGHGMQD RNGRFYFATS ATDPTDLEHT ALPFDRLAAL FERTEARITI
LLDACHSGAA GTGAFTTNDD LANSLVALKS NLTILAAAKG RQESLGRREV GGLFTNAVVT
VLGKERDRYD HNHNGRIEAS ELYRGVKALV FAASDGKQTP WIINSRLVGD YALF