Gene Avi_4292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_4292 
SymbolpurH 
ID7386535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp3606692 
End bp3608251 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content62% 
IMG OID643652951 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002551122 
Protein GI222150165 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTGCTGTCGG TGTTCGACAA GAGCGGCATT GTCGATCTTG CCCGGGCCTT GAACGATATG 
GGTGTGCGGC TGCTATCAAC CGGCGGCACC TACAAGGCGC TGATCGAGGC AGGCCTGCCC
GCCACCGACG TGTCAGACGT GACCGGCTTT CCGGAAATCA TGGATGGTCG GGTGAAGACC
CTGCATCCTG CCGTGCATGG CGGCCTGCTT GCCATCCGCG ATGATGAAGA CCATGTGAAG
GCTATGCAGG CCCATAAGAT CGAGGCCATC GATCTCGCCG TCATCAATCT TTATCCGTTT
GAGGCGGTTC TGGCAGCGGG CGGCGACTAT CCGACCACGG TCGAAAATAT CGATATCGGC
GGCCCGGCGA TGATCCGTGC CTCTGCCAAG AACCACGCCT ATGTTACCGT GGTGACTGAC
CCTGCCGATT ACGCCCAGCT TCTGGACGCG CTGAAAGCGG ACGATTGTCA CACGCCTTAT
GCGCTGCGCC AGCAGTTCGC CGCCCGCGCC TATGCCCGGA CCGCCGCTTA TGACGCAACG
ATCTCCAACT GGTTTGCCGA GGCGCTTGCC ATCGAGACGC CGCGCAACCG GGTGATTGGC
GGCAGCCTGC GCGAAGAAAT GCGCTATGGC GAGAACCCGC ACCAGAAAGC AGGCTTCTAC
GTCAATGGCG ATCAGCGTCC CGGGGTCGCA ACCGCCACGC TTTTGCAGGG CAAGCAGCTT
TCCTATAACA ATATCAATGA TACGGATGCC GCCTTCGAAC TGGTGTCGGA ATTCCTGCCT
GAAAACGGTC CGGCCTGCGC CATTATCAAG CACGCCAATC CATGCGGTGT CGCGGTCGGT
AAGACGCTGG CCGATGCCTA TCGCCGGGCA CTGGCCTGCG ACAGCGTCTC GGCTTTCGGC
GGCATTATCG CGCTGAACCA GACCCTGGAT GCGGAAACCG CTGAAGAGAT CGTCAAGCTG
TTTACCGAGG TGATCATCGC CCCTGACGTC ACGGAAGAGG CAAAGGCCAT TATTGCCCGC
AAGGCCAATC TGCGGCTGTT GACCACTGGC GGTCTGGCCG ACCCACGCGC GCCTGGCCTG
ACGGCCAAAA CGGTATCGGG TGGCCTGCTG GTGCAAAGCC GCGACAATCT GGTGGTAGAA
GATCTGGACC TGAAGGTCGT CACCAAGCGC GCACCGACCG CAGCCGAGCT GGAAGACATG
AAGCTGGCCT TTAAGATCGC CAAGCATGTG AAATCCAACG CTGTCATCTA TGCCAAGGAC
GGCCAGGCTG TCGGCATTGG CGCGGGCCAG ATGAGCCGGG TGGATTCCGC CCGGATCGCC
GCGATGAAAG CCGAAGATGC TGCCAAGGCC ATGGGATTGG CCGAGCCGCT GACCCGTGGC
TCTGCCGTTG CCTCCGAAGC GTTCTACCCG TTTGCCGATG GATTGCTGGC TGCCATTGCC
GCCGGTGCGA CGGCGGTGAT CCAGCCGGGC GGTTCCATGC GCGATGCCGA GGTGATTGCC
GCCGCCGACG AGCACGGCGT CGCCATGGTC TTTACCGGCG TGCGCCACTT CCGGCATTGA
 
Protein sequence
MLSVFDKSGI VDLARALNDM GVRLLSTGGT YKALIEAGLP ATDVSDVTGF PEIMDGRVKT 
LHPAVHGGLL AIRDDEDHVK AMQAHKIEAI DLAVINLYPF EAVLAAGGDY PTTVENIDIG
GPAMIRASAK NHAYVTVVTD PADYAQLLDA LKADDCHTPY ALRQQFAARA YARTAAYDAT
ISNWFAEALA IETPRNRVIG GSLREEMRYG ENPHQKAGFY VNGDQRPGVA TATLLQGKQL
SYNNINDTDA AFELVSEFLP ENGPACAIIK HANPCGVAVG KTLADAYRRA LACDSVSAFG
GIIALNQTLD AETAEEIVKL FTEVIIAPDV TEEAKAIIAR KANLRLLTTG GLADPRAPGL
TAKTVSGGLL VQSRDNLVVE DLDLKVVTKR APTAAELEDM KLAFKIAKHV KSNAVIYAKD
GQAVGIGAGQ MSRVDSARIA AMKAEDAAKA MGLAEPLTRG SAVASEAFYP FADGLLAAIA
AGATAVIQPG GSMRDAEVIA AADEHGVAMV FTGVRHFRH