Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_01200 |
Symbol | purK |
ID | 7759087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 121234 |
End bp | 122319 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643803046 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_002797362 |
Protein GI | 226942289 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATCG GCGTGATCGG TGGCGGCCAA CTGGGCCGCA TGCTGGCCCT GGCGGGCACT CCGCTGGGCA TGGACTTCGC CTTCCTCGAC CCGGCGCCGG ATGCCTGCGC GGCCGCGCTC GGCGAGCACA TCCGCGCCGA TTACGGCGAT CAGGACCATC TGCGCCAACT GGCCGACGAG GTTGATCTGG TCACCTTCGA ATTCGAGAGC GTGCCGGCCG AGACCGTGGC CTTCCTCTCC CAGTTCGTTC CCGTCTATCC CTCCGCCGAG TCCCTGCGCA TCGCCCGCGA CCGCTGGTTC GAGAAGAGCC TGTTCAGGAG TCTCGGCATT CCCACCCCGG AATTCGCCAA CATCCACTCG CAGGTCGATC TCGACGCCGC CGTGGCCGAG ATCGGCCTGC CGGCGGTGCT CAAGACCCGC ACCCTGGGCT ACGACGGCAA GGGCCAGAAG GTCCTGCGCC AGCCTGCGGA CGTCGCTGGC GCCTTCGCCG AGCTGGGCAG CGTGCCCTGC ATTCTCGAAG GCTTCGTGCC CTTCAGCGGC GAGGTGTCGC TGATCGCCGT GCGCGGCCGC GACGGCGAGA CGCGCTTCTA CCCGCTGGTG CACAACACTC ACGAGAGCGG CATCCTGCGC CTGTCGGTGG CCAGCAGCGG GCACCCGCTG CAGGCGCTGG CCGAGGACTA TGTCGGCCGG GTGCTCGAGA AGCTGGACTA CGTCGGCGTG CTGGCCTTCG AGTTCTTCGA GGTCGACGGC GGCCTCAAGG CCAACGAGAT CGCCCCCAGG GTGCACAACT CCGGGCACTG GACCATCGAA GGCGCCGAGT GCGGCCAGTT CGAGAACCAT CTGCGCGCGG TGGCCGGCCT GCCGCTGGGC TCCACCGCCA AGGTGGGCGA GAGCGCCATG CTCAACTTCA TCGGCCAGGT GCCGCCGCTG GACAGGCTGG CGGCCGTTGC CGACTGCCAC CCGCATCACT ACGGCAAGGC CTTCAAGGCC GGGCGCAAGG TCGGCCACGC CACTCTGCGC AGCAAGGACC TGCCGACTCT CGAGGCGCGC ATCCGGGAAG TCGAGGCGCT GATCGCCGGG AACTGA
|
Protein sequence | MKIGVIGGGQ LGRMLALAGT PLGMDFAFLD PAPDACAAAL GEHIRADYGD QDHLRQLADE VDLVTFEFES VPAETVAFLS QFVPVYPSAE SLRIARDRWF EKSLFRSLGI PTPEFANIHS QVDLDAAVAE IGLPAVLKTR TLGYDGKGQK VLRQPADVAG AFAELGSVPC ILEGFVPFSG EVSLIAVRGR DGETRFYPLV HNTHESGILR LSVASSGHPL QALAEDYVGR VLEKLDYVGV LAFEFFEVDG GLKANEIAPR VHNSGHWTIE GAECGQFENH LRAVAGLPLG STAKVGESAM LNFIGQVPPL DRLAAVADCH PHHYGKAFKA GRKVGHATLR SKDLPTLEAR IREVEALIAG N
|
| |