Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5799 |
Symbol | |
ID | 6977188 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | + |
Start bp | 206401 |
End bp | 210012 |
Gene Length | 3612 bp |
Protein Length | 1203 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643393254 |
Product | 5-oxoprolinase (ATP-hydrolyzing) |
Protein accession | YP_002278072 |
Protein GI | 209546182 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit [COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0523355 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGGAC AGTGGGATTT CTGGATCGAT CGCGGCGGCA CCTTCACCGA TATCGTCGCA CGCCGTCCCG ACGGTGCGCT GATCGCCCAC AAGCTGCTTT CGGAAAATCC GGAAGCTTAT CGCGACCCGG CCGTGCACGG CATTCGCGAA TTGCTGGGGC TCAAAGCCGG TGACCCGGTC CCGTCCGAGC GCATCGGCGC GGTCAAGATG GGAACGACGG TCGCCACCAA TGCCCTTCTG GAACGCAAAG GCGATCCGAC CCTCCTGGTG ACGACCAAAG GTTTCCGCGA TGCGCTGGAG ATCGGTTACC AGGCCCGCGC CGATATCTTC GCCAAGAAGA TCGTCAAGCC GGAACTGCTC TATGCCGGCG TCATCGAGGC CGACGAGCGC GTGCTGGCCG ATGGCACGGT GGAACGCCCG CTCGACGAGG ATAGGCTGCG ACGAGCGCTC GAAGCCGCCT ATGCCGAGGG GCTGCGCGCC GTCGCCATCG TCTTCATGCA CGCCTACCGC TATCCCGAGC ATGAGCAGCG GGCCGCCGCC ATTGCGCGCG CGATCGGCTT CACACAGATC TCTCCCTCGC ACGTCGTTTC CCCGCTAATC AAGCTCGTCG GCCGCGGCGA CACCGCCGTC GTTGACGCCT ATCTGTCTCC GGTTCTCCGG CGCTATGTCG ATCAGGTCGC CACCGAACTC GGCGCCGTCG AGGGACAAGG CCCGAAGCTG ATGTTCATGC AGTCCTCCGG CGGGCTGACC GACGCGCATC TGTTCCAGGG CAAGGACGCG ATCCTCTCGG GCCCCGCCGG CGGCGTGGTC GGCGCGGTCG AGGTGTCGCG CATTGCCGGC TTCGGCCAGA TGATCGGTTT CGACATGGGC GGCACCTCGA CGGATGTGTC GCATTATGAC GGTGAACTGG AGCGCGCCTT CGAGACCGAA GTCGCCGGCG TGCGCATGCG CGCGCCGATG ATGAAGATCC ACACGGTGGC CGCCGGCGGC GGTTCGATCC TGAGCTTTGA CGGTTCGCGC TTCCGCGTCG GCCCTGAATC GGCCGGCGCC ACGCCCGGCC CGAAATCCTA TCGCCGCGGC GGCCCGCTCA CCGTCACCGA CGCCAACATC ATGACCGGCA AGCTGCTGCC GGAATTCTTC CCGGCGATCT TCGGCCCGGG ACAGGACCAG CCGCTCGATG CCGAGGCGGT GCACGCCGCC TTTGCCGAGA TGGCAAAGAC GATCGGCGGC GGCCGGACGG CTGAGGACGC CGCCGACGGT TTCCTCGCCA TCGCCGTCGA GAACATGGCC AATGCCATCA AGAAGATCTC GGTCCAGCGC GGTTATGACG TTTCGGGCTA TGCGCTCACC TGTTTCGGCG GCGCCGGCGG CCAGCATGCC TGCCTCGTCG CCGACAGCCT CGGCATGAAA AGAGTGCTGA TCCACCCCTT CTCCGGCATC CTTTCGGCCT ATGGCATGGG TCTTGCCGAT ATCCGCGCCA CCCGCCAGCG CGCGGTGCTG ACCGAACTCG CTACGGCGTT GGTGACGATC GGCGAGATCA GGGCCAGGTT GGAGGCGGAG GTGCGCGAGG AGTTGACGCT GCAAGGCGTC GAGACCGCCG ACATGGAGGT CGTCACCCGG CTGCACCTGC AATATAAGGG CACCGACACC GCCCTACCCG TCGCCTTCGG GCCGCAGGAA GAGATGGTGC AAGCCTTTGC CGTCGCCCAT AAGAAACAGT TCGGCTTCAT CTTCGAAGAC CGGCCCGTTG TCGTCGATTC CATCGAAGTC GAGGGCATCG GCGGCGGCGC CGATATCGAG GAAACCTATA GGGAGGCGAA AGTCTTCGAG CCGGAAGCGC TGCGCACGAC CCGCTTCTAT TCCGGCGGAA CATGGCAGGA CGCCGGCATC TTCAAGCGCG AAGCGCTGAA GCCGGGCGCC ATCCTCAAAG GGCCTGCCCT CATCATCGAA GCGCATCAGA CGATCGTCGT CGAAGCCGGC TGGCAGGCGC GGCTCACCGG CCACGATCAT ATCGTCCTCA CCCGCGAGAT CCCGCTTGCC CGCCATGCCG CGATCGGCAC CAGCGCCGAT CCGGTCATGC TCGAAGTCTT CAACAATCTG TTCATGGCGA TTGCCGAGCA GATGGGCGTG ACGCTGCAGA ACACCGCGCA TTCGGTCAAT ATCAAGGAAC GGCTCGATTT CTCCTGCGCC GTCTTCGACC GCACCGGTGC GCTCGTCGCC AATGCGCCGC ATATGCCCGT GCATCTCGGC TCGATGGACC GTTCGGTCGA AACGATCATC CGCCTCAACG AAGACCGTAT CCGCCGGGGC GACGTCTTTG CGCTCAACGC GCCTTATAAT GGCGGCACGC ATCTGCCCGA CATCACCGTC GTCACCCCGG TTTTCGACGA TGCCGGAGCG GTGATCCTGT TTTACGTCGC CTCGCGTGGC CATCATGCCG ATATCGGCGG CAAGGCGCCG GGCTCGATGA CGCCGCGGGC GACGAAGGTC GACGAGGAAG GCGTGCTGAT CGACAATTTC CTGCTGGTCG ACAAAGGCCG TTTCCGCGAA GCGGACTTCG CGGCGATGCT GCAGGATCAT CCCTACCCCG CCCGCAATCC GGCCCAGAAC CTCGCCGACG TGAAGGCGCA GATCGCCGCC AACGAAAAGG GCGTGCAGGA GTTGCGCAAG ATGGTTTCCC ATTTCGGGCT CGACGTCGTC GAAGCCTATA TGGGCCATGT GCAGGACAAT GCCGAGGAAA GTGTGCGCCG GGTGATCGCC CGCCTCAGCG ACAGCGAATT TACCTATCCC ACCGATCAGG GCGCCGTCAT CAAGGTGAAG ATCACCGTCG ACAGGCAGGC ACGTGAGGTG ACGGTCGATT TCACCGGCAC GAGCGCGCAG CAGCCGACCA ATTTCAATGC GCCGGAACCG GTGACGCGCG CCGCCGTGCT CTATGTCTTC CGCGTCATGG TCGAGCAGCC GATCCCGATG AATGCCGGCT GCCTGCGGCC GATCCGGATC ATCGTGCCGG ACGGATCGAT GCTGCGCCCG GCCTATCCGG CCGCCGTCGT CGCCGGCAAT GTCGAGACCA GCCAGCATGT GACGAATGCG CTGTTCGGGG CGCTCGGAAC GCTCGCGGCA GCCCAGGGCT CGATGAACAA CCTGACCTTC GGCAATGCTG CCTATCAATA TTACGAGACG ATCTGCGCCG GCGGCCCCGC TGGCCTGCTC AACGATGGCA CCGGCTTTAG CGGCGCCGAT GGCGTGCACA CGCATATGAC CAATTCGCGG CTGACCGATC CTGAAGTGCT GGAATTCCGC TTTCCCGTCG TGCTCGAGGA TTTCCACATC CGCCGCGGCT CCGGCGGCAA GGGTCAGTAT AGCTCCGGCG GCGGCACCGA GCGCACCATC CGCTTCCTGG AGACGATGGA TTGCAGCATT CTCTCCTCGC ATCGCACGAT CCGACCGTTC GGCCTCTTGG GCGGGGAAGA CGGGCAATTG GGAAAAACCG AAATCCGTCG CGCCGACGGC AAGGTCGAAC GGCTGGAGGG CGCTGACCAG GCAATGCTCG TCGCCGGCGA CGCCGTGATC GTCACGACGC CGACCGGCGG CGGCTACGGC AAACCGGTTT AG
|
Protein sequence | MGGQWDFWID RGGTFTDIVA RRPDGALIAH KLLSENPEAY RDPAVHGIRE LLGLKAGDPV PSERIGAVKM GTTVATNALL ERKGDPTLLV TTKGFRDALE IGYQARADIF AKKIVKPELL YAGVIEADER VLADGTVERP LDEDRLRRAL EAAYAEGLRA VAIVFMHAYR YPEHEQRAAA IARAIGFTQI SPSHVVSPLI KLVGRGDTAV VDAYLSPVLR RYVDQVATEL GAVEGQGPKL MFMQSSGGLT DAHLFQGKDA ILSGPAGGVV GAVEVSRIAG FGQMIGFDMG GTSTDVSHYD GELERAFETE VAGVRMRAPM MKIHTVAAGG GSILSFDGSR FRVGPESAGA TPGPKSYRRG GPLTVTDANI MTGKLLPEFF PAIFGPGQDQ PLDAEAVHAA FAEMAKTIGG GRTAEDAADG FLAIAVENMA NAIKKISVQR GYDVSGYALT CFGGAGGQHA CLVADSLGMK RVLIHPFSGI LSAYGMGLAD IRATRQRAVL TELATALVTI GEIRARLEAE VREELTLQGV ETADMEVVTR LHLQYKGTDT ALPVAFGPQE EMVQAFAVAH KKQFGFIFED RPVVVDSIEV EGIGGGADIE ETYREAKVFE PEALRTTRFY SGGTWQDAGI FKREALKPGA ILKGPALIIE AHQTIVVEAG WQARLTGHDH IVLTREIPLA RHAAIGTSAD PVMLEVFNNL FMAIAEQMGV TLQNTAHSVN IKERLDFSCA VFDRTGALVA NAPHMPVHLG SMDRSVETII RLNEDRIRRG DVFALNAPYN GGTHLPDITV VTPVFDDAGA VILFYVASRG HHADIGGKAP GSMTPRATKV DEEGVLIDNF LLVDKGRFRE ADFAAMLQDH PYPARNPAQN LADVKAQIAA NEKGVQELRK MVSHFGLDVV EAYMGHVQDN AEESVRRVIA RLSDSEFTYP TDQGAVIKVK ITVDRQAREV TVDFTGTSAQ QPTNFNAPEP VTRAAVLYVF RVMVEQPIPM NAGCLRPIRI IVPDGSMLRP AYPAAVVAGN VETSQHVTNA LFGALGTLAA AQGSMNNLTF GNAAYQYYET ICAGGPAGLL NDGTGFSGAD GVHTHMTNSR LTDPEVLEFR FPVVLEDFHI RRGSGGKGQY SSGGGTERTI RFLETMDCSI LSSHRTIRPF GLLGGEDGQL GKTEIRRADG KVERLEGADQ AMLVAGDAVI VTTPTGGGYG KPV
|
| |