Gene Rleg_7211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_7211 
Symbol 
ID8022917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp634758 
End bp637283 
Gene Length2526 bp 
Protein Length841 aa 
Translation table11 
GC content65% 
IMG OID644834044 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_002985178 
Protein GI241667094 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5640] Secreted trypsin-like serine protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0794016 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGT CGATCAGCAG GGTCCTCAAT ATTGCGTCCG TCATGCTGCT TCTGGCCACT 
CCCGTGCTGG CGCAGCAGGA TACTGATTTC GCCGGCGAGG ATGGCGGGCG GGTCATCGGC
GGCCAGGCGG CCAAGAAGGG CGAATGGCCC TGGCAGGTGA AGATCCTGGC GCCCGATCCC
GAACAGCGCG GCCGCTTCGG CGGCCATTGC GGCGGCTCGC TGATTTCGCC GCGCTGGATA
TTGACGGCCG CCCATTGCGT CACCAGCGGC CGCTCCGGCA AGCAGGATCT CTTCGCCCGC
GACCTGCTGA TCGTCGAGGG TAAATCGAAG ATCGACAAGG TGATCGCCGT CGATGGGCCC
GATAAGCCGG GACTGGCCGT GGAAGACGTG ATCATTCACG AGGACTTCGA CCGCAAGGTC
TTCGCCAACG ACATCGCCCT CATCAAGCTG AGCGAGCCCG CCAAATCGAA GCCGGCGATC
CTTGCCTCGG CCTCGGACGA CGAAGTGGAG GCAGCCGGCC ACCCCGCCGT CGTCACCGGC
TGGGGCTACA CCAAGGCCGA TCACGGCTGG GACGACAAAT ACCTGCCGAC CGAGCTGCAG
GAAGTGGAAC TGCCGATCGT CCCGCGCGAG GACTGCCGCG CCGCCTATCG CGACAGTTCG
ATGCGCATGA ACCCGATCGA CGAGCGCAAT GTCTGCGCCG GTTATGCCGA AGGCGGCAAG
GACGCCTGCC AGGGCGACAG CGGCGGGCCG CTGGTCGCCC AGCGTCCCGA CAAGCGCTGG
ATCCAGCTCG GCATCGTCAG CTGGGGCGCC GGCTGCGCTG AGGCCGAGCA CTACGGCGTC
TATACCCGTG TTGCCGCGTT CCGCGACTGG ATCGCCGCCA AGACCGACGG CGACGTGCCT
GATGTTGAAG GGCCCGCTGC CAATGATCAG GTCGCGTCCA CCACCACCAG TAGCGGGATG
AAGCAGAGGA AGTCCGGACA GGAAGAGGCC AACCTCGCCA TCACCACCCC ACCCGCCGGC
GATACCGCGC CGGCTGGGAC GACCGAAACG CCAGACGCGC CGGCCAACGA TACCGCGCCA
GTTGACAAGC CCGTCGCCGA CGAACCCGCC GTGCGGCCAG CAGTCGTCCA GACGCCGGTG
ATCGAAAGCA AGCCCGGTGA CCGCGCGCTG TTGATCGGCA TCGACGACTA CGAGATGCGC
GAGGCGAAGC TGACCGGTTC CGCCACCGAT GTGAAGGCGA TGCAGGTCTT CCTCGCCAAG
ACGCTCGCCT ACCGCCCGGA GCAGATCCAC ACGCTGACCA ACCGCAAGGC GACCCGCGAG
GCGATCCTTG CCGAAATCGA CGACTGGCTG GTGCGCCAGT CGACGCCCGG AAGCCGCGTC
TTCCTCTATT TCAGCGGCCA GGGCTCTGAA GAAATGGGCG CCGAGGAAAC GACGAGCCCG
ACGCTGGTGG CCGTCGATGC CAAGCTGGTG CGCGAGGGCG GCAAGGTGAC GGTCACCAAC
CAGATCCGCG AAACCGAGAT CGCCGCAAGG CTGAACAGCC TCAAGGATCG CCGCGTCACG
CTGCTGATCG ATGCCTGCCA TGTCGGGCCC GGCAGCCGCA GCGCGGTGGC CGCACCATCC
GGCACCGTGC GTTGCCTCGG CCCGGCGCTG GCAGGGCTCG AACCGCCGAA CAAATCAGGC
AAGGAAGCGA AATTCTCCTT CGGCGGCGAA AACGCCATGG TCTGGTCGGC TGTCAACGCA
GGGCAGTGGG CGCTGGTCGA TAGCGAGGCC AAGCCACCGC TTGGCGTCTT CACCCGCCGC
TTCATCGAAG GCGTGCAGGA TGGCGTGGCG CGCGCAGCCG ACAAGCCGAA TGTCAGCAAT
GCCGCTTTGC TCGATTATGT CCGGCGTAAA TCCGACGAAT ATTGCCGGAC GCATGCGGGA
GATTGCCGCT TCACGCCGGT GCCGCAATTT TATGGCCAGC CGGATGCGCT CGGCCGGGAT
GTCATCACCG GCGACGAGGC GAAGACGGCG GTCGCCGCCG TCGAGAATAC GCTGAAGAGC
GACAATGAGG CGGGCGTCGC CGTCGACGTG CTGCCCGGCA CTGCGGTCAG CATCGGCGAC
AAGGTGGCGA TGCGTGTGTC CACCAAAAAG TCCGGTTACC TGATCCTGGT CGATATCGAT
GCCTCCGGCA AGCTGACACA GCTTTTTCCT AACAAGCGCT CGATGGGGCT GAAACCATCA
GCCAAAAGCG GCGACAACCG GCTCGATCCG GCCCGGCCGG TCGTCGTGCC CGATGCGCGC
AATCCCTATA CCGGCTTCGA ATATGTAGTG GAGGGACCGG CCGGCGTCGG CATGGTCGTT
GCCATCCTCA GCGACAAGCC GATCGAAGTG CTCGACCTGC CCGACGTGCC GACGCCGCTC
GTCGGCCAGC GCGCCGCCTT CAACTATGTC TACGATCTCG CCCGAAGCCT CAGGATCGTC
GGCGACGACG AAACCGGCGC CCAAGGCAAA TGGTCGTTCG ATTCCAAATT CTATCGCATC
CGCTGA
 
Protein sequence
MSTSISRVLN IASVMLLLAT PVLAQQDTDF AGEDGGRVIG GQAAKKGEWP WQVKILAPDP 
EQRGRFGGHC GGSLISPRWI LTAAHCVTSG RSGKQDLFAR DLLIVEGKSK IDKVIAVDGP
DKPGLAVEDV IIHEDFDRKV FANDIALIKL SEPAKSKPAI LASASDDEVE AAGHPAVVTG
WGYTKADHGW DDKYLPTELQ EVELPIVPRE DCRAAYRDSS MRMNPIDERN VCAGYAEGGK
DACQGDSGGP LVAQRPDKRW IQLGIVSWGA GCAEAEHYGV YTRVAAFRDW IAAKTDGDVP
DVEGPAANDQ VASTTTSSGM KQRKSGQEEA NLAITTPPAG DTAPAGTTET PDAPANDTAP
VDKPVADEPA VRPAVVQTPV IESKPGDRAL LIGIDDYEMR EAKLTGSATD VKAMQVFLAK
TLAYRPEQIH TLTNRKATRE AILAEIDDWL VRQSTPGSRV FLYFSGQGSE EMGAEETTSP
TLVAVDAKLV REGGKVTVTN QIRETEIAAR LNSLKDRRVT LLIDACHVGP GSRSAVAAPS
GTVRCLGPAL AGLEPPNKSG KEAKFSFGGE NAMVWSAVNA GQWALVDSEA KPPLGVFTRR
FIEGVQDGVA RAADKPNVSN AALLDYVRRK SDEYCRTHAG DCRFTPVPQF YGQPDALGRD
VITGDEAKTA VAAVENTLKS DNEAGVAVDV LPGTAVSIGD KVAMRVSTKK SGYLILVDID
ASGKLTQLFP NKRSMGLKPS AKSGDNRLDP ARPVVVPDAR NPYTGFEYVV EGPAGVGMVV
AILSDKPIEV LDLPDVPTPL VGQRAAFNYV YDLARSLRIV GDDETGAQGK WSFDSKFYRI
R