Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_36890 |
Symbol | purM |
ID | 7762583 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3744719 |
End bp | 3745774 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643806556 |
Product | phosphoribosylaminoimidazole synthetase |
Protein accession | YP_002800810 |
Protein GI | 226945737 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0150] Phosphoribosylaminoimidazole (AIR) synthetase |
TIGRFAM ID | [TIGR00878] phosphoribosylaminoimidazole synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAGC AACCCTCCCT GAGCTACAAG GACGCCGGTG TCGACATCGA CGCAGGTGAA GCGCTGGTCG AACGCATCAA GGGCGTCGCC AGGCGCACCG CGCGCCCCGA AGTGATGGGT GGCCTGGGCG GCTTCGGCGC CCTGTGCGAA ATCCCGGCCG GCTACAGGCA ACCGGTACTG GTTTCCGGTA CCGACGGTGT CGGCACCAAG CTGCGCCTGG CGATGAACCT CGGCAAGCAC GACAGCATCG GCATCGACCT GGTGGCCATG TGCGTCAACG ATCTGGTGGT ATGCGGCGCC GAACCGCTGT TCTTCCTCGA CTACTACGCC ACCGGCAAGC TCAACGTCGA CGTCGCCGCC CGCGTGGTCG CCGGCATCGG CGAGGGCTGC GAGATGGCCG GCTGCGCGCT GGTCGGTGGC GAGACAGCCG AGATGCCCGG CATGTACGAG GGCGAGGACT ACGACCTGGC CGGCTTCTGC GTTGGCGTGG TGGAGAAGAG CGAAATCATC GACGGCGCGA AGGTCGCCGC CGGCGACGCG CTGATCGCCC TGCCCTCCTC CGGCCCGCAC TCCAACGGCT ACTCGCTGAT CCGCAAGATC CTCGAGCTGA GCGGCACCGA CGTGGCCGGC GCCACCCTGG ACGGCAAGCC GCTGGCCGAC CTGCTGATGG CGCCGACGCG CATCTACGTC AAGCCGCTGC TCAAGCTCAT CCGGGAAACC GGCGCGGTCA AGGCCATGGC CCACATCACC GGTGGCGGCC TCACCGAGAA TATCCCGCGC GTGCTGCCGC AGGGCACCCG GGCGGTGATC GACGTGGCCA GTTGGACCCG CCCGGCGGTG TTCGACTGGC TGCAGGAAAA AGGCAACGTC GACGAGCGCG AAATGCACCG TGTGCTCAAC TGCGGCGTCG GCATGGTCGT CTGCGTCGCT CGGGACAAGG TCGAGCAGGC CCTCGCCGTG CTGCGCGCGG CAGGCGAGCA GCCCTGGCTG ATCGGCGACA TCGCCGCCGG CGACGGCGCC GAGCGCGTCC AGTTGCACAA CCTGAAGGCG CACTGA
|
Protein sequence | MSKQPSLSYK DAGVDIDAGE ALVERIKGVA RRTARPEVMG GLGGFGALCE IPAGYRQPVL VSGTDGVGTK LRLAMNLGKH DSIGIDLVAM CVNDLVVCGA EPLFFLDYYA TGKLNVDVAA RVVAGIGEGC EMAGCALVGG ETAEMPGMYE GEDYDLAGFC VGVVEKSEII DGAKVAAGDA LIALPSSGPH SNGYSLIRKI LELSGTDVAG ATLDGKPLAD LLMAPTRIYV KPLLKLIRET GAVKAMAHIT GGGLTENIPR VLPQGTRAVI DVASWTRPAV FDWLQEKGNV DEREMHRVLN CGVGMVVCVA RDKVEQALAV LRAAGEQPWL IGDIAAGDGA ERVQLHNLKA H
|
| |