Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1095 |
Symbol | |
ID | 6979814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 1114597 |
End bp | 1115670 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643395807 |
Product | phosphoribosylaminoimidazole synthetase |
Protein accession | YP_002280615 |
Protein GI | 209548698 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0150] Phosphoribosylaminoimidazole (AIR) synthetase |
TIGRFAM ID | [TIGR00878] phosphoribosylaminoimidazole synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.154044 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.00642232 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCCAGT CTGGGAAAAA CGGCCTGACC TACAGCGACG CGGGCGTCGA CATCGATGCC GGCAACCTCC TCGTCGAGAA GATCAAACCG GCGGTGCGCT CGACCCGCCG CCCCGGCGCT GACGGCGAGA TCGGCGGCTT CGGCGGGCTT TTCGATCTCA AGGCCGCCGG CTTTAACGAC CCGGTTCTCG TTGCCGCCAA TGACGGCGTC GGCACCAAGC TGAAGATCGC CATCGATGCC GATTATCACG ACACCGTCGG CATCGACCTC GTCGCCATGT GCGTCAACGA TCTCGTGGTG CAGGGCGCCG AGCCGCTGTT TTTCCTCGAT TATTTCGCCA CCGGCAAGCT CGACCCCGAC CAGGGTGCTG CGATCGTCGG CGGCATCGCC GCCGGCTGCC GGCAGGCCGG CTGCGCGCTG ATCGGCGGCG AGACGGCCGA AATGCCCGGC ATGTATTCCT CCGGCGACTA TGATCTCGCC GGTTTTGCCG TCGGCGCTGC CGAACGCGGC AAGCTGCTGC CCTCGGGCGA TATCGCCGAG GGCGATGTGA TCCTCGGCCT CGCCTCCTCC GGCGTGCATT CCAACGGTTT CTCGCTGGTG CGCAAGATCG TCGAACTCTC CGGCCTCGGC TGGGATGCGC CGGCGCCATT TGCCAGCGAT AAGAAGCTCG GCGAGGCCCT GCTCGAGCCG ACGCGCATCT ATGTGAAGCC GCTTCTGAAG GCGATCCGCG AGACCGGCGC CATCAAGGCG CTGGCCCACA TCACCGGCGG CGGCTTCCCG GAAAACATCC CGCGCGTGCT GCCGAAGCAT CTGGCGGCCG AGATCGATCT TGCCGCCGTC AAGGCTCCGC CGGTGTTTTC GTGGCTCGCC AGGACGGGCG GCGTCGAAAC CAAGGAGATG CTGCGCACCT TCAACTGCGG CGTCGGCATG ATCGCCGTCG TCGCTAGCGA GAATGTCGCG GCGGTTTCCG CCGCACTCGA GGCCGAGGGC GAAACCGTCG TCACGCTCGG CCGCATGATC GCCCGAGACG AGGGTGCAGC CGGCACGGTC TATCAGGGCA CGCTTGCCCT ATGA
|
Protein sequence | MSQSGKNGLT YSDAGVDIDA GNLLVEKIKP AVRSTRRPGA DGEIGGFGGL FDLKAAGFND PVLVAANDGV GTKLKIAIDA DYHDTVGIDL VAMCVNDLVV QGAEPLFFLD YFATGKLDPD QGAAIVGGIA AGCRQAGCAL IGGETAEMPG MYSSGDYDLA GFAVGAAERG KLLPSGDIAE GDVILGLASS GVHSNGFSLV RKIVELSGLG WDAPAPFASD KKLGEALLEP TRIYVKPLLK AIRETGAIKA LAHITGGGFP ENIPRVLPKH LAAEIDLAAV KAPPVFSWLA RTGGVETKEM LRTFNCGVGM IAVVASENVA AVSAALEAEG ETVVTLGRMI ARDEGAAGTV YQGTLAL
|
| |