Gene Rleg2_6261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6261 
Symbol 
ID6983334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011370 
Strand
Start bp203740 
End bp205473 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content57% 
IMG OID643399270 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_002284026 
Protein GI209552110 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0857847 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.425917 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAACA GAAATATCGA TCCTATCAGC TTCGCGGTCA TAAAGAGCGC GCTGGACACA 
ATCGTCGACG ACATGGCCTA CGCGGTGATG CGCACTGCCC GCTCGCCGAT CGTACGTGAC
ATTCTGGATT ACTCGGCGAC GCTTTGCGAT CGCGAAGGCC GGATCCTGAC CCAGGCGAAA
ACAGTAGCTC TACATCTCGG TGCGGTACCG GACGCGATGG AAGTTATCAC CAGCCGCTTC
TCGGCGACTG CCCGACCTGG TGACGTATTC ATCTTGAATG ATCCCTATCA GGGTGGCATG
CACCTCCCCG ACATCTTCAT GTTTAAGCCG CTGTTCTTCC GGGACAAGCT TGAAGGGTTC
TCCGTGGTCA TCTTCCATCA CTGCGACGTC GGTGGCCGTG TCCCAGGTTC CAACGCGGCA
GATTCTACCG AAATCTTCCA GGAAGGCATC CGGATTCCGC CTGTGAAGCT CTACGACAAA
GGTGAGCCCA ACAACTGGAT CTTCGACATC ATCAGGGAGA ACGTTCGTCT TCCCGATCTC
GTTATCGGCG ACCTAGAATC GCAGCTTGCC ACCTGCAATA TTGGCGAGCG TGAATATCTA
AAGCTCTTCG AACGCCACGG ATCTGAGGTT CTCAATGAGT ACTTCGACGA GCTCATGGAC
TATGGCGAGC AGATGACCCG GAAGGCGATC TCTTCTTGGC CGGACGGCGA CTACGAGTTC
ACCGACTATG TCGATGGCGA TGGTTTTAGC ACCGCACCGA TCCCTATCAA GTGCAAGATG
ACGGTTGCAG GCGATCATCT GACGGTGGAT TTCGAAGGCA CATCGCCGCA GGTTCGCGGC
GCAATCAATC CGACCTTCTC TTTTACGAAG TCTGCCACTT ACCTCACCAT CCGATGCGCC
CTCGATCAAG ACGTTCCGAA CAATGCTGGC GTTTATCGCG CGATCACCGT CAAGGCTCCG
CTCGGGACTA TCCTCAATCC AATCTCCCCG GCGCCAGTCG CGGCCCGCGC TCTGACCGGA
TACCGTGTCA TGGATACCGT GATGGGCGCC TTGGCTCAGG TGGCTCCGAA AAAGGTGATC
GCCGCCAGTG AGGGGGGGAA CACCGTTATT GCGTTCGGCG GCTACGACAA GAAGTCCGGA
GAGCCCTTCA TCCTCGTTGA TATGATCAAC GGAGCTTGGG GCGGCCGCTT CAATAAGGAC
GGTATCGAAG GCGTGACCAA CCCGGGGCAA AACCTCTCAA ACCTGCCTGT CGAGAGCCTC
GAGGCACGGT ACCCTCTGCG CATTGACGAA TACTGCCTCC GTGACGACTC CTGTGGTGCG
GGCGAATTTC GTGGCGGACT TGGTCTCGCC CGACAGTATC GGTTCCTGGC CGACGAAGCG
ATTCTTCAAA TTCGTGCGGA CCGATACGCC CATGCTCCAT ACGGGTTGTT CGGAGGGGAA
GCGGCAGCAT TCACCCGAAA CCTGCTCGAC CCGGGTAGCG AAGGCGAAGT CCTCCTTCCA
TCGAAGGTCA CACGGCAGGT TGAAAAAGGT CTCGTCTTCC GTCACGAGCA GTCGGGTGGT
GGTGGTTACG GCGACCCGCT GAAGCGCTCC CTTGAGCTGA TCTCCAAGGA TCTCGGCAAC
GGAAAGATCT CGCGTCGTTA CGCAGAGGAA AAGCATGCTG TTGTCTTTGT CGGAGATGGC
TTTGAGATCG ATCGCGCCGG AACAGAAGCG GCCCGCGACG TACGCTCCGC ATAG
 
Protein sequence
MTNRNIDPIS FAVIKSALDT IVDDMAYAVM RTARSPIVRD ILDYSATLCD REGRILTQAK 
TVALHLGAVP DAMEVITSRF SATARPGDVF ILNDPYQGGM HLPDIFMFKP LFFRDKLEGF
SVVIFHHCDV GGRVPGSNAA DSTEIFQEGI RIPPVKLYDK GEPNNWIFDI IRENVRLPDL
VIGDLESQLA TCNIGEREYL KLFERHGSEV LNEYFDELMD YGEQMTRKAI SSWPDGDYEF
TDYVDGDGFS TAPIPIKCKM TVAGDHLTVD FEGTSPQVRG AINPTFSFTK SATYLTIRCA
LDQDVPNNAG VYRAITVKAP LGTILNPISP APVAARALTG YRVMDTVMGA LAQVAPKKVI
AASEGGNTVI AFGGYDKKSG EPFILVDMIN GAWGGRFNKD GIEGVTNPGQ NLSNLPVESL
EARYPLRIDE YCLRDDSCGA GEFRGGLGLA RQYRFLADEA ILQIRADRYA HAPYGLFGGE
AAAFTRNLLD PGSEGEVLLP SKVTRQVEKG LVFRHEQSGG GGYGDPLKRS LELISKDLGN
GKISRRYAEE KHAVVFVGDG FEIDRAGTEA ARDVRSA