Gene Avin_06890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_06890 
SymbolpurH 
ID7759642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp654412 
End bp656019 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content68% 
IMG OID643803610 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002797914 
Protein GI226942841 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.956299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGACC AGACTACCCG CCTTCCCGTC CGCCGCGCGC TGATCAGCGT GTCCGACAAG 
ACCGGCGTCG TCGACTTCGC CCGTGAGCTC GCCGCCCTCG GCGTCGAGAT CCTTTCCACC
GGCGGCACCT TCAAGCTGCT GCGTGAGCAC GGCGTCGACG CCGTGGAAGT AGCCGACTAC
ACCGGTTTCC CGGAAATGAT GGACGGTCGG GTGAAGACCC TGCATCCGAA GATCCACGGC
GGCATCCTCG GCCGCCGCGA TCTCGACGCA GCGGTCATGG CCGAGCACGG CATCCAGCCG
ATCGATCTGG TCGCGGTCAA CCTCTACCCC TTCGCCGCCA CCGTGGCCAG GCCCGGCTGC
ACCCTCGCCG AGGCCATCGA GAACATCGAC ATCGGCGGGC CGACCATGGT CCGCTCGGCG
GCGAAGAACC ACAAGGACGT CGCCATCGTG GTCAACGCCG CCGACTATGC CGGCGTGCTC
GAGAGCCTGA AGAACGGCGG CCTGACCTAC GCCCAGCGCT TCGATCTGGC GCTCAGGGCC
TTCGAGCACA CCGCAGCCTA CGACGGCATG ATCGCCAACT ACCTGGGCAC CATCGACCAG
GGCGCCGAAA CCCTTACCAC CGAAGGCCGT GCCGCGTTCC CGCGTACCTT CAACAGCCAG
TTCGTCAAGG CTCAGGACAT GCGCTACGGC GAGAACCCGC ACCAGCAGGC GGCCTTCTAC
GTCGAGACCA GCCCGGCCGA GGCCAGCGTG GCCACCGCCC GCCAGTTGCA GGGCAAGGAG
CTGTCCTACA ACAACGTGGC CGACACCGAT GCCGCGCTGG AGTGCGTGAA GAGCTTCGTC
AAGCCGGCCT GCGTCATCGT CAAGCACGCC AACCCCTGCG GCGTCGCCGT GGTACCGGAA
GACGAAGGCG GCATCCGCAA GGCCTATGAC CTGGCCTACG CCACCGACAG CGAGTCCGCC
TTCGGCGGCA TCATCGCCTT CAACCGCGAA CTGGACGGCG CGACCGCCAG GGCCATCGTC
GAGCGCCAGT TCGTCGAAGT GATCATCGCC CCCAGCGTTT CCGCCGAAGC CCGTGAGGCG
GTGGCGGCCA AGGCCAACGT GCGCCTGCTC GAATGCGGCC AGTGGCCGGC CGAGCGCGCC
GATGGCCTGG ATTTCAAGCG CGTCAACGGC GGCCTGCTGG TGCAGAGCCG CGACATCGGC
ATGATCGCCG AGGCCGACCT CAAGGTCGTC ACCCGGCGCG CGCCGACCGA GCGGGAAATC
CACGACCTGA TCTTCGCCTG GAAGGTGGCC AAGTTCGTCA AGTCCAACGC CATCGTCTAT
GCCAGGAACC GCCAGACCAT CGGCGTCGGC GCCGGCCAGA TGAGCCGCGT CAACTCCGCA
CGCATCGCCG CGATCAAGGC CGAGCACGCC GGGCTCGAGG TCGCGGGGGC GGTGATGGCG
AGCGATGCCT TCTTCCCCTT CCGCGATGGC ATCGACAATG CGGCCAAGGC CGGCATCACC
GCGGTGATCC AGCCGGGCGG CTCGATGCGC GACAACGAGG TGATCGCCGC GGCCGACGAG
GCGGGCATGG CCATGGTGTT CACCGGCATG CGCCACTTCA GGCATTGA
 
Protein sequence
MTDQTTRLPV RRALISVSDK TGVVDFAREL AALGVEILST GGTFKLLREH GVDAVEVADY 
TGFPEMMDGR VKTLHPKIHG GILGRRDLDA AVMAEHGIQP IDLVAVNLYP FAATVARPGC
TLAEAIENID IGGPTMVRSA AKNHKDVAIV VNAADYAGVL ESLKNGGLTY AQRFDLALRA
FEHTAAYDGM IANYLGTIDQ GAETLTTEGR AAFPRTFNSQ FVKAQDMRYG ENPHQQAAFY
VETSPAEASV ATARQLQGKE LSYNNVADTD AALECVKSFV KPACVIVKHA NPCGVAVVPE
DEGGIRKAYD LAYATDSESA FGGIIAFNRE LDGATARAIV ERQFVEVIIA PSVSAEAREA
VAAKANVRLL ECGQWPAERA DGLDFKRVNG GLLVQSRDIG MIAEADLKVV TRRAPTEREI
HDLIFAWKVA KFVKSNAIVY ARNRQTIGVG AGQMSRVNSA RIAAIKAEHA GLEVAGAVMA
SDAFFPFRDG IDNAAKAGIT AVIQPGGSMR DNEVIAAADE AGMAMVFTGM RHFRH