Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_7211 |
Symbol | |
ID | 8022917 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | + |
Start bp | 634758 |
End bp | 637283 |
Gene Length | 2526 bp |
Protein Length | 841 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644834044 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_002985178 |
Protein GI | 241667094 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5640] Secreted trypsin-like serine protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0794016 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACGT CGATCAGCAG GGTCCTCAAT ATTGCGTCCG TCATGCTGCT TCTGGCCACT CCCGTGCTGG CGCAGCAGGA TACTGATTTC GCCGGCGAGG ATGGCGGGCG GGTCATCGGC GGCCAGGCGG CCAAGAAGGG CGAATGGCCC TGGCAGGTGA AGATCCTGGC GCCCGATCCC GAACAGCGCG GCCGCTTCGG CGGCCATTGC GGCGGCTCGC TGATTTCGCC GCGCTGGATA TTGACGGCCG CCCATTGCGT CACCAGCGGC CGCTCCGGCA AGCAGGATCT CTTCGCCCGC GACCTGCTGA TCGTCGAGGG TAAATCGAAG ATCGACAAGG TGATCGCCGT CGATGGGCCC GATAAGCCGG GACTGGCCGT GGAAGACGTG ATCATTCACG AGGACTTCGA CCGCAAGGTC TTCGCCAACG ACATCGCCCT CATCAAGCTG AGCGAGCCCG CCAAATCGAA GCCGGCGATC CTTGCCTCGG CCTCGGACGA CGAAGTGGAG GCAGCCGGCC ACCCCGCCGT CGTCACCGGC TGGGGCTACA CCAAGGCCGA TCACGGCTGG GACGACAAAT ACCTGCCGAC CGAGCTGCAG GAAGTGGAAC TGCCGATCGT CCCGCGCGAG GACTGCCGCG CCGCCTATCG CGACAGTTCG ATGCGCATGA ACCCGATCGA CGAGCGCAAT GTCTGCGCCG GTTATGCCGA AGGCGGCAAG GACGCCTGCC AGGGCGACAG CGGCGGGCCG CTGGTCGCCC AGCGTCCCGA CAAGCGCTGG ATCCAGCTCG GCATCGTCAG CTGGGGCGCC GGCTGCGCTG AGGCCGAGCA CTACGGCGTC TATACCCGTG TTGCCGCGTT CCGCGACTGG ATCGCCGCCA AGACCGACGG CGACGTGCCT GATGTTGAAG GGCCCGCTGC CAATGATCAG GTCGCGTCCA CCACCACCAG TAGCGGGATG AAGCAGAGGA AGTCCGGACA GGAAGAGGCC AACCTCGCCA TCACCACCCC ACCCGCCGGC GATACCGCGC CGGCTGGGAC GACCGAAACG CCAGACGCGC CGGCCAACGA TACCGCGCCA GTTGACAAGC CCGTCGCCGA CGAACCCGCC GTGCGGCCAG CAGTCGTCCA GACGCCGGTG ATCGAAAGCA AGCCCGGTGA CCGCGCGCTG TTGATCGGCA TCGACGACTA CGAGATGCGC GAGGCGAAGC TGACCGGTTC CGCCACCGAT GTGAAGGCGA TGCAGGTCTT CCTCGCCAAG ACGCTCGCCT ACCGCCCGGA GCAGATCCAC ACGCTGACCA ACCGCAAGGC GACCCGCGAG GCGATCCTTG CCGAAATCGA CGACTGGCTG GTGCGCCAGT CGACGCCCGG AAGCCGCGTC TTCCTCTATT TCAGCGGCCA GGGCTCTGAA GAAATGGGCG CCGAGGAAAC GACGAGCCCG ACGCTGGTGG CCGTCGATGC CAAGCTGGTG CGCGAGGGCG GCAAGGTGAC GGTCACCAAC CAGATCCGCG AAACCGAGAT CGCCGCAAGG CTGAACAGCC TCAAGGATCG CCGCGTCACG CTGCTGATCG ATGCCTGCCA TGTCGGGCCC GGCAGCCGCA GCGCGGTGGC CGCACCATCC GGCACCGTGC GTTGCCTCGG CCCGGCGCTG GCAGGGCTCG AACCGCCGAA CAAATCAGGC AAGGAAGCGA AATTCTCCTT CGGCGGCGAA AACGCCATGG TCTGGTCGGC TGTCAACGCA GGGCAGTGGG CGCTGGTCGA TAGCGAGGCC AAGCCACCGC TTGGCGTCTT CACCCGCCGC TTCATCGAAG GCGTGCAGGA TGGCGTGGCG CGCGCAGCCG ACAAGCCGAA TGTCAGCAAT GCCGCTTTGC TCGATTATGT CCGGCGTAAA TCCGACGAAT ATTGCCGGAC GCATGCGGGA GATTGCCGCT TCACGCCGGT GCCGCAATTT TATGGCCAGC CGGATGCGCT CGGCCGGGAT GTCATCACCG GCGACGAGGC GAAGACGGCG GTCGCCGCCG TCGAGAATAC GCTGAAGAGC GACAATGAGG CGGGCGTCGC CGTCGACGTG CTGCCCGGCA CTGCGGTCAG CATCGGCGAC AAGGTGGCGA TGCGTGTGTC CACCAAAAAG TCCGGTTACC TGATCCTGGT CGATATCGAT GCCTCCGGCA AGCTGACACA GCTTTTTCCT AACAAGCGCT CGATGGGGCT GAAACCATCA GCCAAAAGCG GCGACAACCG GCTCGATCCG GCCCGGCCGG TCGTCGTGCC CGATGCGCGC AATCCCTATA CCGGCTTCGA ATATGTAGTG GAGGGACCGG CCGGCGTCGG CATGGTCGTT GCCATCCTCA GCGACAAGCC GATCGAAGTG CTCGACCTGC CCGACGTGCC GACGCCGCTC GTCGGCCAGC GCGCCGCCTT CAACTATGTC TACGATCTCG CCCGAAGCCT CAGGATCGTC GGCGACGACG AAACCGGCGC CCAAGGCAAA TGGTCGTTCG ATTCCAAATT CTATCGCATC CGCTGA
|
Protein sequence | MSTSISRVLN IASVMLLLAT PVLAQQDTDF AGEDGGRVIG GQAAKKGEWP WQVKILAPDP EQRGRFGGHC GGSLISPRWI LTAAHCVTSG RSGKQDLFAR DLLIVEGKSK IDKVIAVDGP DKPGLAVEDV IIHEDFDRKV FANDIALIKL SEPAKSKPAI LASASDDEVE AAGHPAVVTG WGYTKADHGW DDKYLPTELQ EVELPIVPRE DCRAAYRDSS MRMNPIDERN VCAGYAEGGK DACQGDSGGP LVAQRPDKRW IQLGIVSWGA GCAEAEHYGV YTRVAAFRDW IAAKTDGDVP DVEGPAANDQ VASTTTSSGM KQRKSGQEEA NLAITTPPAG DTAPAGTTET PDAPANDTAP VDKPVADEPA VRPAVVQTPV IESKPGDRAL LIGIDDYEMR EAKLTGSATD VKAMQVFLAK TLAYRPEQIH TLTNRKATRE AILAEIDDWL VRQSTPGSRV FLYFSGQGSE EMGAEETTSP TLVAVDAKLV REGGKVTVTN QIRETEIAAR LNSLKDRRVT LLIDACHVGP GSRSAVAAPS GTVRCLGPAL AGLEPPNKSG KEAKFSFGGE NAMVWSAVNA GQWALVDSEA KPPLGVFTRR FIEGVQDGVA RAADKPNVSN AALLDYVRRK SDEYCRTHAG DCRFTPVPQF YGQPDALGRD VITGDEAKTA VAAVENTLKS DNEAGVAVDV LPGTAVSIGD KVAMRVSTKK SGYLILVDID ASGKLTQLFP NKRSMGLKPS AKSGDNRLDP ARPVVVPDAR NPYTGFEYVV EGPAGVGMVV AILSDKPIEV LDLPDVPTPL VGQRAAFNYV YDLARSLRIV GDDETGAQGK WSFDSKFYRI R
|
| |