Gene RSc0504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSc0504 
SymbolpurH 
ID1219308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia solanacearum GMI1000 
KingdomBacteria 
Replicon accessionNC_003295 
Strand
Start bp537887 
End bp539461 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content67% 
IMG OID637236862 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionNP_518625 
Protein GI17545223 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.150632 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCAGC AAGCTCTGCT TTCCGTTTCC GACAAGACCG GCATCGTCGA CTTTGCCCGC 
GCGCTGCACG GCGCGGGCGT CAAGCTCCTC TCCACCGGCG GCACCGCCAA GCTGCTCGCC
GAGTCCGGCC TGCCCGTGAC GGAAGTGGCC GACTACACCG GCTTCCCGGA AATGCTCGAC
GGCCGCGTCA AGACCTTGCA CCCGAAGGTG CACGGCGGCA TCCTCGCCCG CCGCGACCTG
CCCGAACACA TGGCGGCGCT GTCCCAGCAC GGCATCCCGA CCATCGACCT GCTGGTGGTG
AACCTGTATC CGTTCCAGCA GACCGTTGCC AAGGATGAAT GCACGCTGGC CGATGCCATC
GAGAACATCG ACATCGGCGG CCCGACCATG CTGCGCTCGG CCGCCAAGAA CCACCGCGAC
GTGACGGTGA TCGTCGATCC GGCCGACTAC GCCACCGTGC TGGCCGAGAT GCAGGCCAAC
GGCAACACGG TCGGCTACGA CACCAACTTC ATGCTGGCCA AGAAAGTGTT CGCGCACACC
GCGCAGTACG ACGGCGCCAT CACCAACTAC CTGACCAGCC TGGGACAGGA CAAGTCGCAC
AGCACCCGCA GCCAGTATCC GCAGACGCTG AACCTGGCCT TCGAGCAGGT GCAGGAGATG
CGCTACGGCG AGAACCCGCA CCAGTCCGCC GCCTTCTACC GCGACCTGAA GGCCGTGGAC
GGCGCGCTGG CCAACTACGC GCAGCTGCAG GGCAAGGAGC TGTCGTACAA CAACATCGCC
GATGCCGATG CGGCGTGGGA GTGCGTGAAA TCGTTCGACC CGGCCAAGGG CGCGGCGTGC
GTCATCATCA AGCACGCCAA CCCGTGCGGC GTGGCCATCG GCGGCACCGC GCAGGAAGCC
TACGAGAAGG CCTTCAAGAC CGACTCGACC TCGGCCTTCG GCGGCATCAT CGCCTTTAAC
GTGCCGCTGG ACGAAGCGGC CGCGCAGGTG GTGGCCAAGC AGTTCGTCGA AGTGCTGATC
GCACCGAGCT TCTCCGCGGG CGCGCGCACG GTATTCGCGG CCAAGCAGAA CGTGCGCCTG
CTGGAAATTC CGCTGGGCAA GGGCGTCAAC GCCTACGACT TCAAGCGCGT CGGCGGCGGC
CTGCTGGTGC AGAGCCCGGA TGCCAAGAAC GTGCAGTCGG CCGAACTGCG CGTGGTCACC
AAGCGCCACC CGACCCCGAA GGAGATGGAC GACCTGCTGT TCGCCTGGCG CGTCGCCAAG
TTCGTCAAGT CCAACGCCAT CGTGTTCTGC GGCGGCGGCA TGACGCTGGG CGTGGGCGCC
GGCCAGATGA GCCGCGTGGA CTCGGCCCGC ATCGCCAGCA TCAAGGCGCA GAACGCGGGC
CTGATGCTGT CGGGCTCGGC GGTGGCATCG GACGCCTTCT TCCCGTTCCG CGACGGCCTG
GACGTGGTGG TCGACGCCGG CGCCTCGTGC GTGATCCAGC CGGGCGGCTC GGTGCGCGAT
GACGAAGTGA TCGCCGCCGC CGACGAGCGC AACGTGGCCA TGATCTTCAC CGGCACGCGC
CACTTCCGCC ACTAA
 
Protein sequence
MIQQALLSVS DKTGIVDFAR ALHGAGVKLL STGGTAKLLA ESGLPVTEVA DYTGFPEMLD 
GRVKTLHPKV HGGILARRDL PEHMAALSQH GIPTIDLLVV NLYPFQQTVA KDECTLADAI
ENIDIGGPTM LRSAAKNHRD VTVIVDPADY ATVLAEMQAN GNTVGYDTNF MLAKKVFAHT
AQYDGAITNY LTSLGQDKSH STRSQYPQTL NLAFEQVQEM RYGENPHQSA AFYRDLKAVD
GALANYAQLQ GKELSYNNIA DADAAWECVK SFDPAKGAAC VIIKHANPCG VAIGGTAQEA
YEKAFKTDST SAFGGIIAFN VPLDEAAAQV VAKQFVEVLI APSFSAGART VFAAKQNVRL
LEIPLGKGVN AYDFKRVGGG LLVQSPDAKN VQSAELRVVT KRHPTPKEMD DLLFAWRVAK
FVKSNAIVFC GGGMTLGVGA GQMSRVDSAR IASIKAQNAG LMLSGSAVAS DAFFPFRDGL
DVVVDAGASC VIQPGGSVRD DEVIAAADER NVAMIFTGTR HFRH