Gene SeHA_C4507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4507 
SymbolpurH 
ID6490266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4387866 
End bp4389455 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content57% 
IMG OID642744581 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002048161 
Protein GI194449397 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0348151 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.014905 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACAAC GTCGTCCAGT CCGCCGCGCT CTGCTCAGTG TTTCTGACAA GGCCGGTATC 
ATCGAATTCG CCCAGGCACT TTCCGCACGC GGTGTGGAGC TGCTGTCTAC GGGGGGCACC
GCCCGCCTGT TAGCAGAAAA AGGCCTGCCG GTGACCGAAG TTTCCGATTA CACCGGTTTC
CCGGAAATGA TGGATGGACG CGTAAAGACC CTGCATCCAA AAATCCACGG CGGCATCCTC
GGTCGTCGCG GCCAGGACGA TGCCATTATG GAACAGCACC ACATCGCCCC TATCGATATG
GTTGTCGTTA ACCTGTATCC GTTCGCCGAG ACCGTGGCAC GCGAAGGCTG CTCGCTGGAA
GATGCGGTAG AGAACATTGA TATCGGCGGC CCGACCATGG TGCGCTCTGC TGCTAAGAAC
CATAAAGACG TGGCCATCGT GGTGAAGAGC AGCGACTACG ACGCCATTAT TAAAGAGATG
GATGCTAACG AAGGTTCTCT GACCCTCGAC ACCCGTTTCG ATCTCGCGAT TAAAGCCTTC
GAACACACCG CCGCCTACGA CAGCATGATC GCTAACTACT TCGGCAGCAT GGTTCCGGCC
TACCACGGTG AAAGCAAAGA AGCCGCCGGT CGCTTCCCGC GTACGCTGAA TCTGAACTTC
ATTAAGAAGC AGGATATGCG CTATGGCGAG AACAGCCACC AGCAGGCAGC CTTCTATATA
GAAGAAAATG TGAAAGAAGC GTCCGTTGCC ACCGCACAGC AGGTTCAGGG CAAAGCGCTT
TCTTACAACA ACATCGCTGA TACCGACGCG GCGCTGGAGT GTGTGAAAGA GTTCAACGAG
CCAGCCTGCG TAATCGTCAA GCACGCTAAC CCGTGCGGCG TGGCGGTGAG CACCACTATT
CTCGACGCTT ACGACCGTGC GTATAAAACC GACCCAACCT CCGCGTTCGG CGGCATTATC
GCCTTCAACC GCGAACTGGA TGCCGAAACC GCGCAGGCCA TTATCTCCCG CCAGTTCGTG
GAAGTGATCA TCGCCCCGTC CGCAACCGAA GAGGCGCTGA AAATCACCGC CGCCAAGCAG
AATGTGCGCG TACTGACCTG TGGTCAGTGG GCACAGCGCG TACCTGGTCT GGACTTCAAA
CGCGTTAACG GCGGCCTGCT GGTTCAGGAC AGGGACCTGG GTATGGTGAG CGAAGCTGAA
CTGCGCGTGG TTTCCAAACG CCAGCCGACC GAGCAAGAGC TGCGCGATGC GCTGTTCTGC
TGGAAAGTGG CAAAGTTCGT GAAATCTAAC GCCATTGTTT ACGCCAAAGA GAATATGACC
ATCGGCATAG GCGCAGGCCA GATGAGCCGC GTCTACTCCG CGAAAATCGC TGGCATTAAA
GCGGCTGACG AAGGTCTGGA AGTGAAAGGT TCCGCGATGG CCTCTGACGC CTTCTTCCCG
TTCCGCGATG GTATTGATGC CGCTGCCGCC GTGGGCGTGA GCTGCGTGAT CCAGCCTGGC
GGTTCTATCC GTGATGATGA AGTCATTGCC GCCGCCGACG AACACGGTAT TGCGATGATC
TTCACCGACA TGCGCCACTT CCGCCATTAA
 
Protein sequence
MQQRRPVRRA LLSVSDKAGI IEFAQALSAR GVELLSTGGT ARLLAEKGLP VTEVSDYTGF 
PEMMDGRVKT LHPKIHGGIL GRRGQDDAIM EQHHIAPIDM VVVNLYPFAE TVAREGCSLE
DAVENIDIGG PTMVRSAAKN HKDVAIVVKS SDYDAIIKEM DANEGSLTLD TRFDLAIKAF
EHTAAYDSMI ANYFGSMVPA YHGESKEAAG RFPRTLNLNF IKKQDMRYGE NSHQQAAFYI
EENVKEASVA TAQQVQGKAL SYNNIADTDA ALECVKEFNE PACVIVKHAN PCGVAVSTTI
LDAYDRAYKT DPTSAFGGII AFNRELDAET AQAIISRQFV EVIIAPSATE EALKITAAKQ
NVRVLTCGQW AQRVPGLDFK RVNGGLLVQD RDLGMVSEAE LRVVSKRQPT EQELRDALFC
WKVAKFVKSN AIVYAKENMT IGIGAGQMSR VYSAKIAGIK AADEGLEVKG SAMASDAFFP
FRDGIDAAAA VGVSCVIQPG GSIRDDEVIA AADEHGIAMI FTDMRHFRH