Gene SeD_A4583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4583 
SymbolpurH 
ID6870984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4422902 
End bp4424491 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content57% 
IMG OID642787490 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002218092 
Protein GI198244601 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00178575 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACAAC GTCGTCCAGT CCGCCGCGCT CTGCTCAGTG TTTCTGACAA GGCCGGTATC 
ATCGAATTCG CCCAGGCACT TTCCGCACGC GGTGTGGAGC TGCTGTCTAC GGGGGGCACC
GCCCGCCTGT TAGCAGAAAA AGGCCTGGCG GTGACCGAAG TTTCCGATTA CACCGGTTTC
CCGGAAATGA TGGATGGACG CGTAAAGACC CTGCATCCAA AAGTCCACGG CGGCATCCTC
GGTCGTCGCG GCCAGGACGA TGCCATTATG GAACAGCACC ACATCGCCCC TATCGATATG
GTTGTCGTTA ACCTGTATCC GTTCGCCGAA ACCGTGGCAC GCGAAGGCTG CTCGCTGGAA
GATGCGGTAG AGAACATTGA TATCGGCGGC CCGACCATGG TGCGCTCTGC TGCTAAGAAC
CATAAAGACG TCGCCATCGT GGTGAAGAGC AGCGACTACG ACGCCATTAT TAAAGAGATG
GATGCTAACG AAGGTTCTCT GACCCTCGAC ACCCGTTTCG ATCTCGCGAT TAAAGCCTTC
GAACACACCG CCGCCTACGA CAGCATGATC GCTAACTACT TCGGCAGCAT GGTTCCGGCC
TACCACGGTG AAAGCAAAGA AGCCGCCGGT CGCTTCCCGC GTACGCTGAA TCTGAACTTC
ATTAAGAAGC AGGATATGCG CTATGGCGAG AACAGCCACC AGCAGGCAGC CTTCTATATA
GAAGAAAATG TGAAAGAAGC GTCCGTTGCC ACCGCACAGC AGGTTCAGGG CAAAGCGCTT
TCTTACAACA ACATCGCTGA TACCGACGCG GCGCTGGAGT GTGTGAAAGA GTTCAACGAG
CCAGCCTGCG TAATCGTCAA GCACGCTAAC CCGTGCGGCG TGGCGGTGAG CACCACTATT
CTCGACGCTT ACGACCGTGC GTATAAAACC GACCCAACCT CCGCGTTCGG CGGCATTATC
GCCTTCAACC GCGAACTGGA TGCCGAAACC GCGCAGGCCA TCATCTCCCG CCAGTTCGTG
GAAGTGATCA TCGCCCCGTC CGCGACCGAA GATGCGCTGA AAATCACGGC TGCCAAGCAG
AATGTGCGCG TACTGACCTG TGGTCAGTGG GCACAGCGCG TACCTGGTCT GGACTTCAAA
CGCGTTAACG GCGGCCTGCT GGTTCAGGAT CGTGACCTGG GTATGGTGAG CGAAGCTGAA
CTGCGCGTGG TTTCCAAACG CCAGCCGACC GAGCAGGAGC TGCGTGACGC GCTGTTCTGC
TGGAAGGTAG CCAAGTTCGT GAAATCCAAC GCCATTGTGT ATGCCAAAGA GAACATGACT
ATCGGCATAG GCGCAGGCCA GATGAGCCGC GTGTACTCCG CGAAAATCGC TGGCATTAAA
GCGGCTGACG AAGGTCTGGA AGTGAAAGGC TCCGCGATGG CCTCTGACGC CTTCTTCCCG
TTCCGTGACG GTATTGATGC CGCTGCCGCT GTCGGCGTGA GCTGCGTTAT CCAGCCTGGC
GGCTCTATCC GTGATGATGA AGTCATTGCC GCCGCCGACG AACACGGCAT TGCGATGATC
TTCACCGACA TGCGCCACTT CCGCCATTAA
 
Protein sequence
MQQRRPVRRA LLSVSDKAGI IEFAQALSAR GVELLSTGGT ARLLAEKGLA VTEVSDYTGF 
PEMMDGRVKT LHPKVHGGIL GRRGQDDAIM EQHHIAPIDM VVVNLYPFAE TVAREGCSLE
DAVENIDIGG PTMVRSAAKN HKDVAIVVKS SDYDAIIKEM DANEGSLTLD TRFDLAIKAF
EHTAAYDSMI ANYFGSMVPA YHGESKEAAG RFPRTLNLNF IKKQDMRYGE NSHQQAAFYI
EENVKEASVA TAQQVQGKAL SYNNIADTDA ALECVKEFNE PACVIVKHAN PCGVAVSTTI
LDAYDRAYKT DPTSAFGGII AFNRELDAET AQAIISRQFV EVIIAPSATE DALKITAAKQ
NVRVLTCGQW AQRVPGLDFK RVNGGLLVQD RDLGMVSEAE LRVVSKRQPT EQELRDALFC
WKVAKFVKSN AIVYAKENMT IGIGAGQMSR VYSAKIAGIK AADEGLEVKG SAMASDAFFP
FRDGIDAAAA VGVSCVIQPG GSIRDDEVIA AADEHGIAMI FTDMRHFRH