Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_4292 |
Symbol | purH |
ID | 7386535 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | - |
Start bp | 3606692 |
End bp | 3608251 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643652951 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_002551122 |
Protein GI | 222150165 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTGCTGTCGG TGTTCGACAA GAGCGGCATT GTCGATCTTG CCCGGGCCTT GAACGATATG GGTGTGCGGC TGCTATCAAC CGGCGGCACC TACAAGGCGC TGATCGAGGC AGGCCTGCCC GCCACCGACG TGTCAGACGT GACCGGCTTT CCGGAAATCA TGGATGGTCG GGTGAAGACC CTGCATCCTG CCGTGCATGG CGGCCTGCTT GCCATCCGCG ATGATGAAGA CCATGTGAAG GCTATGCAGG CCCATAAGAT CGAGGCCATC GATCTCGCCG TCATCAATCT TTATCCGTTT GAGGCGGTTC TGGCAGCGGG CGGCGACTAT CCGACCACGG TCGAAAATAT CGATATCGGC GGCCCGGCGA TGATCCGTGC CTCTGCCAAG AACCACGCCT ATGTTACCGT GGTGACTGAC CCTGCCGATT ACGCCCAGCT TCTGGACGCG CTGAAAGCGG ACGATTGTCA CACGCCTTAT GCGCTGCGCC AGCAGTTCGC CGCCCGCGCC TATGCCCGGA CCGCCGCTTA TGACGCAACG ATCTCCAACT GGTTTGCCGA GGCGCTTGCC ATCGAGACGC CGCGCAACCG GGTGATTGGC GGCAGCCTGC GCGAAGAAAT GCGCTATGGC GAGAACCCGC ACCAGAAAGC AGGCTTCTAC GTCAATGGCG ATCAGCGTCC CGGGGTCGCA ACCGCCACGC TTTTGCAGGG CAAGCAGCTT TCCTATAACA ATATCAATGA TACGGATGCC GCCTTCGAAC TGGTGTCGGA ATTCCTGCCT GAAAACGGTC CGGCCTGCGC CATTATCAAG CACGCCAATC CATGCGGTGT CGCGGTCGGT AAGACGCTGG CCGATGCCTA TCGCCGGGCA CTGGCCTGCG ACAGCGTCTC GGCTTTCGGC GGCATTATCG CGCTGAACCA GACCCTGGAT GCGGAAACCG CTGAAGAGAT CGTCAAGCTG TTTACCGAGG TGATCATCGC CCCTGACGTC ACGGAAGAGG CAAAGGCCAT TATTGCCCGC AAGGCCAATC TGCGGCTGTT GACCACTGGC GGTCTGGCCG ACCCACGCGC GCCTGGCCTG ACGGCCAAAA CGGTATCGGG TGGCCTGCTG GTGCAAAGCC GCGACAATCT GGTGGTAGAA GATCTGGACC TGAAGGTCGT CACCAAGCGC GCACCGACCG CAGCCGAGCT GGAAGACATG AAGCTGGCCT TTAAGATCGC CAAGCATGTG AAATCCAACG CTGTCATCTA TGCCAAGGAC GGCCAGGCTG TCGGCATTGG CGCGGGCCAG ATGAGCCGGG TGGATTCCGC CCGGATCGCC GCGATGAAAG CCGAAGATGC TGCCAAGGCC ATGGGATTGG CCGAGCCGCT GACCCGTGGC TCTGCCGTTG CCTCCGAAGC GTTCTACCCG TTTGCCGATG GATTGCTGGC TGCCATTGCC GCCGGTGCGA CGGCGGTGAT CCAGCCGGGC GGTTCCATGC GCGATGCCGA GGTGATTGCC GCCGCCGACG AGCACGGCGT CGCCATGGTC TTTACCGGCG TGCGCCACTT CCGGCATTGA
|
Protein sequence | MLSVFDKSGI VDLARALNDM GVRLLSTGGT YKALIEAGLP ATDVSDVTGF PEIMDGRVKT LHPAVHGGLL AIRDDEDHVK AMQAHKIEAI DLAVINLYPF EAVLAAGGDY PTTVENIDIG GPAMIRASAK NHAYVTVVTD PADYAQLLDA LKADDCHTPY ALRQQFAARA YARTAAYDAT ISNWFAEALA IETPRNRVIG GSLREEMRYG ENPHQKAGFY VNGDQRPGVA TATLLQGKQL SYNNINDTDA AFELVSEFLP ENGPACAIIK HANPCGVAVG KTLADAYRRA LACDSVSAFG GIIALNQTLD AETAEEIVKL FTEVIIAPDV TEEAKAIIAR KANLRLLTTG GLADPRAPGL TAKTVSGGLL VQSRDNLVVE DLDLKVVTKR APTAAELEDM KLAFKIAKHV KSNAVIYAKD GQAVGIGAGQ MSRVDSARIA AMKAEDAAKA MGLAEPLTRG SAVASEAFYP FADGLLAAIA AGATAVIQPG GSMRDAEVIA AADEHGVAMV FTGVRHFRH
|
| |