Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_06890 |
Symbol | purH |
ID | 7759642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 654412 |
End bp | 656019 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643803610 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_002797914 |
Protein GI | 226942841 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.956299 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGACC AGACTACCCG CCTTCCCGTC CGCCGCGCGC TGATCAGCGT GTCCGACAAG ACCGGCGTCG TCGACTTCGC CCGTGAGCTC GCCGCCCTCG GCGTCGAGAT CCTTTCCACC GGCGGCACCT TCAAGCTGCT GCGTGAGCAC GGCGTCGACG CCGTGGAAGT AGCCGACTAC ACCGGTTTCC CGGAAATGAT GGACGGTCGG GTGAAGACCC TGCATCCGAA GATCCACGGC GGCATCCTCG GCCGCCGCGA TCTCGACGCA GCGGTCATGG CCGAGCACGG CATCCAGCCG ATCGATCTGG TCGCGGTCAA CCTCTACCCC TTCGCCGCCA CCGTGGCCAG GCCCGGCTGC ACCCTCGCCG AGGCCATCGA GAACATCGAC ATCGGCGGGC CGACCATGGT CCGCTCGGCG GCGAAGAACC ACAAGGACGT CGCCATCGTG GTCAACGCCG CCGACTATGC CGGCGTGCTC GAGAGCCTGA AGAACGGCGG CCTGACCTAC GCCCAGCGCT TCGATCTGGC GCTCAGGGCC TTCGAGCACA CCGCAGCCTA CGACGGCATG ATCGCCAACT ACCTGGGCAC CATCGACCAG GGCGCCGAAA CCCTTACCAC CGAAGGCCGT GCCGCGTTCC CGCGTACCTT CAACAGCCAG TTCGTCAAGG CTCAGGACAT GCGCTACGGC GAGAACCCGC ACCAGCAGGC GGCCTTCTAC GTCGAGACCA GCCCGGCCGA GGCCAGCGTG GCCACCGCCC GCCAGTTGCA GGGCAAGGAG CTGTCCTACA ACAACGTGGC CGACACCGAT GCCGCGCTGG AGTGCGTGAA GAGCTTCGTC AAGCCGGCCT GCGTCATCGT CAAGCACGCC AACCCCTGCG GCGTCGCCGT GGTACCGGAA GACGAAGGCG GCATCCGCAA GGCCTATGAC CTGGCCTACG CCACCGACAG CGAGTCCGCC TTCGGCGGCA TCATCGCCTT CAACCGCGAA CTGGACGGCG CGACCGCCAG GGCCATCGTC GAGCGCCAGT TCGTCGAAGT GATCATCGCC CCCAGCGTTT CCGCCGAAGC CCGTGAGGCG GTGGCGGCCA AGGCCAACGT GCGCCTGCTC GAATGCGGCC AGTGGCCGGC CGAGCGCGCC GATGGCCTGG ATTTCAAGCG CGTCAACGGC GGCCTGCTGG TGCAGAGCCG CGACATCGGC ATGATCGCCG AGGCCGACCT CAAGGTCGTC ACCCGGCGCG CGCCGACCGA GCGGGAAATC CACGACCTGA TCTTCGCCTG GAAGGTGGCC AAGTTCGTCA AGTCCAACGC CATCGTCTAT GCCAGGAACC GCCAGACCAT CGGCGTCGGC GCCGGCCAGA TGAGCCGCGT CAACTCCGCA CGCATCGCCG CGATCAAGGC CGAGCACGCC GGGCTCGAGG TCGCGGGGGC GGTGATGGCG AGCGATGCCT TCTTCCCCTT CCGCGATGGC ATCGACAATG CGGCCAAGGC CGGCATCACC GCGGTGATCC AGCCGGGCGG CTCGATGCGC GACAACGAGG TGATCGCCGC GGCCGACGAG GCGGGCATGG CCATGGTGTT CACCGGCATG CGCCACTTCA GGCATTGA
|
Protein sequence | MTDQTTRLPV RRALISVSDK TGVVDFAREL AALGVEILST GGTFKLLREH GVDAVEVADY TGFPEMMDGR VKTLHPKIHG GILGRRDLDA AVMAEHGIQP IDLVAVNLYP FAATVARPGC TLAEAIENID IGGPTMVRSA AKNHKDVAIV VNAADYAGVL ESLKNGGLTY AQRFDLALRA FEHTAAYDGM IANYLGTIDQ GAETLTTEGR AAFPRTFNSQ FVKAQDMRYG ENPHQQAAFY VETSPAEASV ATARQLQGKE LSYNNVADTD AALECVKSFV KPACVIVKHA NPCGVAVVPE DEGGIRKAYD LAYATDSESA FGGIIAFNRE LDGATARAIV ERQFVEVIIA PSVSAEAREA VAAKANVRLL ECGQWPAERA DGLDFKRVNG GLLVQSRDIG MIAEADLKVV TRRAPTEREI HDLIFAWKVA KFVKSNAIVY ARNRQTIGVG AGQMSRVNSA RIAAIKAEHA GLEVAGAVMA SDAFFPFRDG IDNAAKAGIT AVIQPGGSMR DNEVIAAADE AGMAMVFTGM RHFRH
|
| |