Gene Avin_01200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_01200 
SymbolpurK 
ID7759087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp121234 
End bp122319 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content69% 
IMG OID643803046 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_002797362 
Protein GI226942289 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATCG GCGTGATCGG TGGCGGCCAA CTGGGCCGCA TGCTGGCCCT GGCGGGCACT 
CCGCTGGGCA TGGACTTCGC CTTCCTCGAC CCGGCGCCGG ATGCCTGCGC GGCCGCGCTC
GGCGAGCACA TCCGCGCCGA TTACGGCGAT CAGGACCATC TGCGCCAACT GGCCGACGAG
GTTGATCTGG TCACCTTCGA ATTCGAGAGC GTGCCGGCCG AGACCGTGGC CTTCCTCTCC
CAGTTCGTTC CCGTCTATCC CTCCGCCGAG TCCCTGCGCA TCGCCCGCGA CCGCTGGTTC
GAGAAGAGCC TGTTCAGGAG TCTCGGCATT CCCACCCCGG AATTCGCCAA CATCCACTCG
CAGGTCGATC TCGACGCCGC CGTGGCCGAG ATCGGCCTGC CGGCGGTGCT CAAGACCCGC
ACCCTGGGCT ACGACGGCAA GGGCCAGAAG GTCCTGCGCC AGCCTGCGGA CGTCGCTGGC
GCCTTCGCCG AGCTGGGCAG CGTGCCCTGC ATTCTCGAAG GCTTCGTGCC CTTCAGCGGC
GAGGTGTCGC TGATCGCCGT GCGCGGCCGC GACGGCGAGA CGCGCTTCTA CCCGCTGGTG
CACAACACTC ACGAGAGCGG CATCCTGCGC CTGTCGGTGG CCAGCAGCGG GCACCCGCTG
CAGGCGCTGG CCGAGGACTA TGTCGGCCGG GTGCTCGAGA AGCTGGACTA CGTCGGCGTG
CTGGCCTTCG AGTTCTTCGA GGTCGACGGC GGCCTCAAGG CCAACGAGAT CGCCCCCAGG
GTGCACAACT CCGGGCACTG GACCATCGAA GGCGCCGAGT GCGGCCAGTT CGAGAACCAT
CTGCGCGCGG TGGCCGGCCT GCCGCTGGGC TCCACCGCCA AGGTGGGCGA GAGCGCCATG
CTCAACTTCA TCGGCCAGGT GCCGCCGCTG GACAGGCTGG CGGCCGTTGC CGACTGCCAC
CCGCATCACT ACGGCAAGGC CTTCAAGGCC GGGCGCAAGG TCGGCCACGC CACTCTGCGC
AGCAAGGACC TGCCGACTCT CGAGGCGCGC ATCCGGGAAG TCGAGGCGCT GATCGCCGGG
AACTGA
 
Protein sequence
MKIGVIGGGQ LGRMLALAGT PLGMDFAFLD PAPDACAAAL GEHIRADYGD QDHLRQLADE 
VDLVTFEFES VPAETVAFLS QFVPVYPSAE SLRIARDRWF EKSLFRSLGI PTPEFANIHS
QVDLDAAVAE IGLPAVLKTR TLGYDGKGQK VLRQPADVAG AFAELGSVPC ILEGFVPFSG
EVSLIAVRGR DGETRFYPLV HNTHESGILR LSVASSGHPL QALAEDYVGR VLEKLDYVGV
LAFEFFEVDG GLKANEIAPR VHNSGHWTIE GAECGQFENH LRAVAGLPLG STAKVGESAM
LNFIGQVPPL DRLAAVADCH PHHYGKAFKA GRKVGHATLR SKDLPTLEAR IREVEALIAG
N