Gene Avin_36890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_36890 
SymbolpurM 
ID7762583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3744719 
End bp3745774 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content68% 
IMG OID643806556 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_002800810 
Protein GI226945737 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAGC AACCCTCCCT GAGCTACAAG GACGCCGGTG TCGACATCGA CGCAGGTGAA 
GCGCTGGTCG AACGCATCAA GGGCGTCGCC AGGCGCACCG CGCGCCCCGA AGTGATGGGT
GGCCTGGGCG GCTTCGGCGC CCTGTGCGAA ATCCCGGCCG GCTACAGGCA ACCGGTACTG
GTTTCCGGTA CCGACGGTGT CGGCACCAAG CTGCGCCTGG CGATGAACCT CGGCAAGCAC
GACAGCATCG GCATCGACCT GGTGGCCATG TGCGTCAACG ATCTGGTGGT ATGCGGCGCC
GAACCGCTGT TCTTCCTCGA CTACTACGCC ACCGGCAAGC TCAACGTCGA CGTCGCCGCC
CGCGTGGTCG CCGGCATCGG CGAGGGCTGC GAGATGGCCG GCTGCGCGCT GGTCGGTGGC
GAGACAGCCG AGATGCCCGG CATGTACGAG GGCGAGGACT ACGACCTGGC CGGCTTCTGC
GTTGGCGTGG TGGAGAAGAG CGAAATCATC GACGGCGCGA AGGTCGCCGC CGGCGACGCG
CTGATCGCCC TGCCCTCCTC CGGCCCGCAC TCCAACGGCT ACTCGCTGAT CCGCAAGATC
CTCGAGCTGA GCGGCACCGA CGTGGCCGGC GCCACCCTGG ACGGCAAGCC GCTGGCCGAC
CTGCTGATGG CGCCGACGCG CATCTACGTC AAGCCGCTGC TCAAGCTCAT CCGGGAAACC
GGCGCGGTCA AGGCCATGGC CCACATCACC GGTGGCGGCC TCACCGAGAA TATCCCGCGC
GTGCTGCCGC AGGGCACCCG GGCGGTGATC GACGTGGCCA GTTGGACCCG CCCGGCGGTG
TTCGACTGGC TGCAGGAAAA AGGCAACGTC GACGAGCGCG AAATGCACCG TGTGCTCAAC
TGCGGCGTCG GCATGGTCGT CTGCGTCGCT CGGGACAAGG TCGAGCAGGC CCTCGCCGTG
CTGCGCGCGG CAGGCGAGCA GCCCTGGCTG ATCGGCGACA TCGCCGCCGG CGACGGCGCC
GAGCGCGTCC AGTTGCACAA CCTGAAGGCG CACTGA
 
Protein sequence
MSKQPSLSYK DAGVDIDAGE ALVERIKGVA RRTARPEVMG GLGGFGALCE IPAGYRQPVL 
VSGTDGVGTK LRLAMNLGKH DSIGIDLVAM CVNDLVVCGA EPLFFLDYYA TGKLNVDVAA
RVVAGIGEGC EMAGCALVGG ETAEMPGMYE GEDYDLAGFC VGVVEKSEII DGAKVAAGDA
LIALPSSGPH SNGYSLIRKI LELSGTDVAG ATLDGKPLAD LLMAPTRIYV KPLLKLIRET
GAVKAMAHIT GGGLTENIPR VLPQGTRAVI DVASWTRPAV FDWLQEKGNV DEREMHRVLN
CGVGMVVCVA RDKVEQALAV LRAAGEQPWL IGDIAAGDGA ERVQLHNLKA H