Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2063 |
Symbol | |
ID | 5834840 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 2298813 |
End bp | 2299880 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641367861 |
Product | phosphoribosylaminoimidazole synthetase |
Protein accession | YP_001639530 |
Protein GI | 163851487 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0150] Phosphoribosylaminoimidazole (AIR) synthetase |
TIGRFAM ID | [TIGR00878] phosphoribosylaminoimidazole synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.187732 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCGC AGGACGGAAA CGGGCTCACC TACGCGCAGG CCGGCGTCGA CATCGACGCG GGCAACGCGC TCGTCGAGAC GATCAAGCCG CTGGTGCGCG CCACGCGCCG GCCGGGGGCG GATGCGGAGA TCGGCGGCTT CGGCGGCCTT TTCGACCTCA AGGCCGCCGG CTTCAAGGAT GCGATCCTGG TCGCTGCCAA TGACGGCGTC GGGACCAAGG TGAAGATCGC CATCGAGACC GGACGCCACC ACACGATCGG CATCGATCTC GTGGCGATGT GCGTCAACGA CATCATCGTC CAGGGCGCCG AGCCGCTGTT CTTCCTCGAC TACTACGCCA CTGGCAAACT CGTGCCGGGC GTCGGCGCCG ACATCGTGCG CGGCATCGCC GAAGGCTGCC GGCAAGCGGG CTGCGCGCTG ATCGGCGGCG AGACCGCCGA GATGCCGGGC CTCTATGACG GCTCCGACTA CGATCTCGCC GGCTTCTCGG TGGGCGCGGC CGAGCGCGGC ACGCTGCTGC CGCGCCCCGG CATCCTGCCC GGCGACGTCG TGCTCGGCCT GCCCTCCTCA GGCGTGCACT CGAACGGGTT CTCGCTGGTG CGGCGGATCG TGGCCAAGAC CGGCCTCGGC TATGACGCCG ACGCGCCGTT CGCGCCGGGC CGCAGCCTCG GCGAGGCGCT GCTGGAGCCG ACCCGGATCT ACGTGAAGCC GCTGCTCGCC GCCCTGAAGC GGGCCGGTGG CATCCAGGCG TTCGCCCACA TCACCGGCGG CGGCTTCCCC GACAACCTCC CCCGCGTCCT GCCCGACGGC GTCGGCATCG CAATCGACCT CTCGGCGATC GCCGTGCCGC CGGTCTTCGG CTGGCTGGCG CGCGAGGGCG GGGTCGCGGA AGCGGAGATG CTGCGCACCT TCAACTGCGG CATCGGCATG GTGGTGGTCG CCGCCGCCGA CGCCGCCGAC GCCGTGGCGG ACGCGCTGAC CGAGGCCGGC GAGGCGCCGG TCCGGCTCGG GCACATCACC GAGCGCGGGG CGGAAGCCGT GACCTTCACG GGGCAGCTCG CCCTGTGA
|
Protein sequence | MTAQDGNGLT YAQAGVDIDA GNALVETIKP LVRATRRPGA DAEIGGFGGL FDLKAAGFKD AILVAANDGV GTKVKIAIET GRHHTIGIDL VAMCVNDIIV QGAEPLFFLD YYATGKLVPG VGADIVRGIA EGCRQAGCAL IGGETAEMPG LYDGSDYDLA GFSVGAAERG TLLPRPGILP GDVVLGLPSS GVHSNGFSLV RRIVAKTGLG YDADAPFAPG RSLGEALLEP TRIYVKPLLA ALKRAGGIQA FAHITGGGFP DNLPRVLPDG VGIAIDLSAI AVPPVFGWLA REGGVAEAEM LRTFNCGIGM VVVAAADAAD AVADALTEAG EAPVRLGHIT ERGAEAVTFT GQLAL
|
| |