Gene SeSA_A4387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A4387 
SymbolpurH 
ID6516108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp4258468 
End bp4260057 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content57% 
IMG OID642749338 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002117077 
Protein GI194738357 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00813885 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.752552 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACAAC GTCGTCCAGT CCGCCGCGCT CTGCTCAGTG TTTCTGACAA GGCCGGTATC 
ATCGAATTCG CCCAGGCACT TTCCGCACGC GGTGTGGAAC TGCTGTCTAC GGGGGGCACC
GCCCGCCTGT TAGCAGAAAA AGGCCTGCCG GTGACCGAAG TTTCCGATTA CACCGGTTTC
CCGGAAATGA TGGATGGACG CGTAAAGACC CTGCATCCAA AAGTACACGG CGGCATCCTC
GGCCGTCGCG GCCAGGACGA TGCCATTATG GAACAGCACC ACATCGCCCC TATCGATATG
GTTGTCGTTA ACCTGTATCC GTTCGCCGAA ACCGTTGCAC GCGTTGGTTG CTCGCTGGAA
GATGCAGTAG AGAACATCGA TATCGGCGGC CCGACCATGG TGCGCTCCGC CGCGAAGAAC
CATAAAGACG TGGCCATCGT GGTGAAGAGC AGCGACTACG ACGCCATTAT TAAAGAGATG
GACGCTAACG AAGGTTCTTT GACCCTCGAC ACCCGTTTCG ATCTCGCGAT TAAAGCCTTC
GAACACACCG CCGCCTACGA CAGCATGATC GCTAACTACT TCGGCAGCAT GGTTCCGGCC
TACCACGGTG AAAGCAAAGA AGCCGCCGGT CGCTTCCCGC GTACGCTGAA CCTGAACTTC
ATTAAGAAGC AGGATATGCG CTACGGCGAG AACAGCCACC AGCAAGCTGC CTTCTATATA
GAAGAGAATG TGAAAGAAGC GTCCGTTGCC ACCGCACAAC AGGTGCAGGG CAAAGCGCTC
TCTTATAACA ACATCGCTGA TACCGACGCG GCGCTGGAGT GTGTGAAAGA GTTCAACGAG
CCAGCCTGCG TAATCGTCAA GCACGCTAAC CCGTGCGGCG TGGCGGTAAG TACCTCTATT
CTCGACGCTT ACGATCGCGC CTATAAAACC GACCCGACCT CCGCGTTCGG CGGCATTATC
GCCTTTAACC GTGAGCTGGA TGCGGAAACC GCACAGGCCA TCATTTCTCG TCAGTTTGTC
GAAGTGATCA TCGCCCCATC CGCAAGCGAA GAAGCGCTGA AAATCACCGC TGCCAAGCAG
AACGTCCGTG TTCTGACCTG CGGCCAATGG GCAAGCCGCG TTCCGGGCCT GGATTTCAAA
CGCGTTAACG GTGGCCTGCT GGTTCAGGAC AGGGATCTGG GTATGGTGAG TGAAGCTGAA
CTGCGCGTGG TGTCCAAACG CCAGCCGACC GAGCAGGAGC TGCGTGACGC GCTGTTCTGC
TGGAAGGTAG CCAAGTTCGT GAAATCCAAC GCTATTGTGT ATGCCAAAGA GAACATGACC
ATCGGTATAG GCGCAGGCCA GATGAGCCGC GTGTACTCCG CCAAAATCGC CGGGATTAAA
GCCGCTGATG AAGGTCTGGA AGTGAAAGGC TCAGCCATGG CTTCCGACGC GTTCTTCCCG
TTCCGCGATG GTATTGATGC CGCTGCCGCT GTCGGCGTGA GCTGCGTTAT CCAGCCAGGC
GGTTCTATCC GTGATGATGA AGTCATTGCC GCTGCCGATG AACACGGTAT TGCGATGATC
TTCACCGACA TGCGCCACTT CCGCCATTAA
 
Protein sequence
MQQRRPVRRA LLSVSDKAGI IEFAQALSAR GVELLSTGGT ARLLAEKGLP VTEVSDYTGF 
PEMMDGRVKT LHPKVHGGIL GRRGQDDAIM EQHHIAPIDM VVVNLYPFAE TVARVGCSLE
DAVENIDIGG PTMVRSAAKN HKDVAIVVKS SDYDAIIKEM DANEGSLTLD TRFDLAIKAF
EHTAAYDSMI ANYFGSMVPA YHGESKEAAG RFPRTLNLNF IKKQDMRYGE NSHQQAAFYI
EENVKEASVA TAQQVQGKAL SYNNIADTDA ALECVKEFNE PACVIVKHAN PCGVAVSTSI
LDAYDRAYKT DPTSAFGGII AFNRELDAET AQAIISRQFV EVIIAPSASE EALKITAAKQ
NVRVLTCGQW ASRVPGLDFK RVNGGLLVQD RDLGMVSEAE LRVVSKRQPT EQELRDALFC
WKVAKFVKSN AIVYAKENMT IGIGAGQMSR VYSAKIAGIK AADEGLEVKG SAMASDAFFP
FRDGIDAAAA VGVSCVIQPG GSIRDDEVIA AADEHGIAMI FTDMRHFRH