Gene Rleg_1241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1241 
Symbol 
ID8012346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1216270 
End bp1217343 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content65% 
IMG OID644823822 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_002975072 
Protein GI241203976 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.853192 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.000530935 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCCAGT CTGGGAAAAA CGGCCTGACA TATAGCGATG CGGGCGTCGA CATCGATGCC 
GGCAACCTCC TCGTCGAGAA GATCAAGCCG GCGGTGCGCT CGACCCGCCG TCCCGGCGCC
GACGGCGAGA TCGGCGGCTT CGGCGGGCTT TTCGATCTCA AGGCGGCAGG TTTTACCGAT
CCGGTTCTCG TCGCCGCCAA TGACGGCGTC GGCACCAAGC TGAAGATCGC GATCGATGCC
GACTATCACG ACACCGTCGG GATCGACCTC GTCGCCATGT GCGTCAACGA TCTCGTGGTA
CAGGGCGCCG AGCCGCTGTT CTTCCTGGAT TATTTCGCGA CCGGCAAGCT CGACCCCGAT
CAGGGTGCGG CCATCGTCGG CGGCATTGCC GCCGGCTGCC GCGAGGCCGG CTGCGCGCTG
ATCGGCGGCG AGACGGCCGA AATGCCCGGC ATGTATTCCT CCGGCGACTA CGATCTCGCC
GGCTTTGCTG TCGGTGCGGC CGAACGCGGC AAGCTCCTGC CATCCGGCGA TATTGCCGAG
GGTGACGTGA TCCTCGGCCT TGCCTCCTCC GGCGTTCATT CCAACGGCTT CTCGCTGGTG
CGCAAGATCG TCGAATTGTC CGGCCTCGAT TGGGATGCAC CGGCGCCGTT CGCCGAGGGC
AAGAAGCTCG GCGAAGCCCT GCTCGAGCCG ACACGCATCT ATGTGAAGCC GCTTCTGAAG
GCGATCCGCG AGACCGGCGC CATCAAGGCG CTGGCGCACA TCACCGGCGG CGGCTTCCCG
GAAAATATTC CGCGCGTGCT GCCGAAGCAT CTGGCGGCCG AGATCGATCT TGCCGCCGTC
AAGGTGCCGC CGGTGTTTTC ATGGCTCGCC AAGACGGGCG GCGTCGAATC CAAGGAAATG
CTGCGCACCT TCAACTGCGG TGTCGGCATG ATCGCCGTTG TTGCCGGCGA GAATGTCGCG
ACGGTTTCCG CGGCTCTCGA AGCCGAGGGC GAGACAGTTA TCACGCTCGG CCGCATGATC
GCCCGCGAAG AAGGCGCTGC CGGCACGGTC TACAAGGGCA CGCTTGCCAT ATGA
 
Protein sequence
MSQSGKNGLT YSDAGVDIDA GNLLVEKIKP AVRSTRRPGA DGEIGGFGGL FDLKAAGFTD 
PVLVAANDGV GTKLKIAIDA DYHDTVGIDL VAMCVNDLVV QGAEPLFFLD YFATGKLDPD
QGAAIVGGIA AGCREAGCAL IGGETAEMPG MYSSGDYDLA GFAVGAAERG KLLPSGDIAE
GDVILGLASS GVHSNGFSLV RKIVELSGLD WDAPAPFAEG KKLGEALLEP TRIYVKPLLK
AIRETGAIKA LAHITGGGFP ENIPRVLPKH LAAEIDLAAV KVPPVFSWLA KTGGVESKEM
LRTFNCGVGM IAVVAGENVA TVSAALEAEG ETVITLGRMI AREEGAAGTV YKGTLAI