Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_06880 |
Symbol | purD |
ID | 7759641 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 652983 |
End bp | 654263 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643803609 |
Product | phosphoribosylamine--glycine ligase |
Protein accession | YP_002797913 |
Protein GI | 226942840 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0151] Phosphoribosylamine-glycine ligase |
TIGRFAM ID | [TIGR00877] phosphoribosylamine--glycine ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.285428 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGTAC TGATCATCGG CAGCGGCGGT CGCGAACACG CCCTGGCCTG GAAGGTGGCG CAGGATGCGC GCGTCGAGAA GGTCTTCGTC GCGCCCGGCA ACGCCGGCAC AGCTACCGAA GCCAAGTGCG AGAACGTCGC CATCGATGTG CTGGCCCTCG AGGAACTGGC CGACTTCGCC GAGAAGCATG TGCAACTGAC CATCGTCGGT CCCGAGGCCC CGCTGGTGAA GGGCGTGGTC GATCTGTTCC GCGCGCGCGG CCTGGACATC TTCGGCCCGA CCGCCGGCGC CGCCCAACTG GAAGGCTCCA AGGCCTTTAC CAAGGACTTT CTGGCCCGTC ATGACATCCC CACGGCTGCC TACCGGAACT TCACCGAGGT CGAGCCGGCC CTCGCCTACC TGCGCGAGCG GGGCGCGCCC ATCGTGGTCA AGGCCGACGG CCTGGCCGCC GGCAAGGGCG TGATCGTCGC CACGACCCTC GAGGAAGCCG AGGCCGCCGT GCGCGACATG CTCTCCGGCA ACGCCTTCGG CGACGCCGGT GCCCGCGTGG TGATCGAGGA GTTCCTCGAT GGCGAGGAAG CCAGCTTCAT CGTCATGGTG GACGGCCAGA ACGTGCTGCC GATGGCCACC AGCCAGGACC ACAAGCGCGT CGGCGACGGC GACAGCGGTC CCAACACCGG CGGCATGGGC GCCTACTCCC CGGCGCCGGT GGTCACCCCC GAGGTGCACC GGCGGGTGAT GGACGAGATC ATCCATCCGA CCGTGCGCGG CATGGCCGCC GAGGGCAACG TCTACACCGG CTTCCTGTAT GCCGGCCTGA TGATCGACAA ATCGGGCGCA CCCAAGGTCA TCGAATTCAA CTGCCGCTTC GGCGACCCGG AAACCCAGCC GGTGATGCTG CGCCTGGAGT CCTCGCTGGT GCTTCTGGTC GAAGCGGCGC TCGCCAGGGC GCTGGACAAG GTCGAGGCCC GCTGGGACCC GCGCCCGACC GTGGGCGTGG TGCTGGCCGC CGGCGGCTAC CCGGGCGATT ATGCCAAGGG TGACGTCATC GAGGGCCTGG ACGACGCCGC GCAGCTCGAC GGCAAGGTGT TCCACGCCGG CACCGCGCTC AAGGACGGGC GCATCGTCAC CAGCGGCGGC CGCGTGCTCT GCGCCACCGC CATCGGCCCG AGCGTATCCT CCGCGCAGCA GCAGGCCTAC CGCCTGGCGG AGAAGATCCG CTGGAGCGGT TGCTTCTACC GCAAGGACAT CGGCTACCGC GCCATTGCCC GGGAAGGCTG A
|
Protein sequence | MNVLIIGSGG REHALAWKVA QDARVEKVFV APGNAGTATE AKCENVAIDV LALEELADFA EKHVQLTIVG PEAPLVKGVV DLFRARGLDI FGPTAGAAQL EGSKAFTKDF LARHDIPTAA YRNFTEVEPA LAYLRERGAP IVVKADGLAA GKGVIVATTL EEAEAAVRDM LSGNAFGDAG ARVVIEEFLD GEEASFIVMV DGQNVLPMAT SQDHKRVGDG DSGPNTGGMG AYSPAPVVTP EVHRRVMDEI IHPTVRGMAA EGNVYTGFLY AGLMIDKSGA PKVIEFNCRF GDPETQPVML RLESSLVLLV EAALARALDK VEARWDPRPT VGVVLAAGGY PGDYAKGDVI EGLDDAAQLD GKVFHAGTAL KDGRIVTSGG RVLCATAIGP SVSSAQQQAY RLAEKIRWSG CFYRKDIGYR AIAREG
|
| |