Gene SeAg_B4420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B4420 
SymbolpurH 
ID6792746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp4310655 
End bp4312244 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content58% 
IMG OID642778514 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002149084 
Protein GI197248309 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000744921 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACAAC GTCGTCCAGT CCGCCGCGCT CTGCTCAGTG TTTCTGACAA GGCCGGTATC 
ATCGAATTCG CCCAGGCACT TTCCGCACGC GGTGTGGAGC TGCTGTCTAC GGGGGGCACC
GCCCGCCTGT TAGCAGAAAA AGGCCTGCCG GTGACCGAAG TTTCCGATTA CACCGGTTTT
CCGGAAATGA TGGATGGACG CGTAAAGACC CTGCATCCAA AAGTCCACGG CGGCATCCTC
GGTCGCCGCG GCCAGGACGA TGCCATTATG GAACAGCACC ACATCGCCCC TATCGATATG
GTTGTCGTTA ACCTGTATCC GTTCGCCGAA ACCGTAGCAC GCGAAGGCTG CTCGCTGGAA
GATGCGGTAG AGAACATCGA TATCGGCGGC CCGACCATGG TGCGCTCTGC TGCGAAGAAC
CATAAAGACG TCGCCATCGT GGTGAAGAGC AGCGACTACG ACGCCATTAT TAAAGAGATG
GATGCTAACG AAGGTTCTCT GACCCTCGAC ACCCGTTTCG ATCTCGCGAT TAAAGCCTTC
GAACACACCG CCGCCTACGA CAGCATGATC GCTAACTACT TCGGCAGCAT GGTTCCGGCC
TACCACGGTG AAAGCAAAGA AGCCGCCGGT CGCTTCCCGC GTACGCTGAA TCTGAACTTC
ATTAAGAAGC AGGATATGCG CTACGGCGAG AACAGCCACC AGCAGGCTGC CTTCTATATA
GAAGAGAATG TGAAAGAAGC GTCCGTTGCC ACCGCACAAC AGGTTCAGGG CAAAGCGCTT
TCCTACAACA ACATCGCCGA TACCGACGCG GCGCTGGAGT GTGTGAAAGC GTTCAACGAG
CCAGCCTGCG TAATCGTCAA GCACGCTAAC CCGTGCGGCG TGGCGGTAAG TACCTCTATT
CTCGACGCTT ACGATCGCGC CTATAAAACC GACCCGACCT CCGCGTTCGG CGGCATCATC
GCCTTTAACC GCGAGCTGGA TGCTGAAACC GCGCAGGCCA TCATCTCCCG CCAGTTCGTG
GAAGTGATCA TCGCCCCATC TGCGACCGAA GAAGCGCTGA AGATCACTGC CGCTAAACAG
AACGTTCGCG TCCTGACCTG TGGCCAGTGG GCACAGCGCG TACCGGGCCT GGATTTCAAA
CGCGTTAACG GCGGCCTGCT GGTTCAGGAC AGGGATCTGG GTATGGTGAG CGAAGCTGAA
CTGCGCGTGG TCTCTAAACG CCAGCCGACC GAGCAGGAAC TGCGCGATGC GCTGTTCTGC
TGGAAGGTAG CCAAGTTCGT GAAATCCAAC GCCATTGTGT ATGCCAAAGA GAACATGACT
ATCGGCATTG GCGCAGGCCA GATGAGCCGC GTGTACTCCG CGAAAATCGC CGGGATTAAA
GCGGCTGACG AAGGTCTGGA AGTGAAAGGC TCCGCGATGG CCTCTGACGC CTTCTTCCCG
TTCCGTGACG GTATTGATGC CGCTGCCGCT GTCGGCGTGA GCTGCGTTAT CCAGCCTGGC
GGTTCGATTC GCGACGAAGA GGTGATTGCC GCCGCCGACG AACACGGCAT TGCGATGATC
TTCACCGACA TGCGCCACTT CCGCCATTAA
 
Protein sequence
MQQRRPVRRA LLSVSDKAGI IEFAQALSAR GVELLSTGGT ARLLAEKGLP VTEVSDYTGF 
PEMMDGRVKT LHPKVHGGIL GRRGQDDAIM EQHHIAPIDM VVVNLYPFAE TVAREGCSLE
DAVENIDIGG PTMVRSAAKN HKDVAIVVKS SDYDAIIKEM DANEGSLTLD TRFDLAIKAF
EHTAAYDSMI ANYFGSMVPA YHGESKEAAG RFPRTLNLNF IKKQDMRYGE NSHQQAAFYI
EENVKEASVA TAQQVQGKAL SYNNIADTDA ALECVKAFNE PACVIVKHAN PCGVAVSTSI
LDAYDRAYKT DPTSAFGGII AFNRELDAET AQAIISRQFV EVIIAPSATE EALKITAAKQ
NVRVLTCGQW AQRVPGLDFK RVNGGLLVQD RDLGMVSEAE LRVVSKRQPT EQELRDALFC
WKVAKFVKSN AIVYAKENMT IGIGAGQMSR VYSAKIAGIK AADEGLEVKG SAMASDAFFP
FRDGIDAAAA VGVSCVIQPG GSIRDEEVIA AADEHGIAMI FTDMRHFRH