Gene Spro_0293 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_0293 
SymbolpurH 
ID5607182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp334269 
End bp335858 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content59% 
IMG OID640935792 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001476531 
Protein GI157368542 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000389639 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0060748 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAACAAC CTCGTCCAAT CCGCCGGGCC CTGCTCAGCG TCTCTGACAA AGCCGGTATC 
GTTGAATTCG CCGAAGCGCT GTCCCAGCGT GGCGTTGAAC TGCTCTCCAC CGGTGGCACC
GCCCGCCTGC TGGCAGATGC CGGCCTGCCT GTTACCGAAG TTTCCGACTA CACCGGCTTC
CCGGAAATGA TGGACGGACG AGTAAAGACC CTGCACCCCA AAGTACACGG CGGTATTCTC
GGCCGCCGCG GCCAGGACGA CGCCATCATG GGTCAGCATG ACATCAAGCC GATCGACATG
GTGGTGGTAA ACCTCTATCC GTTCGCCCAG ACCGTGGCGC GCCCGAACTG CTCACTGGAA
GACGCGGTCG AGAACATCGA CATCGGCGGC CCAACCATGG TGCGTTCCGC GGCCAAGAAC
CACAAAGACG TCGCCATCGT GGTAAAGAGC AGCGACTACG CCGCTATTAT TACCGAGATG
GATAACAACG ACGGTTCACT GCAATACACC ACCCGTTTCG ATCTGGCCAT CAAAGCCTTC
GAGCACACCG CCGCTTACGA CAGCATGATC GCCAACTACT TCGGCGCGCT GGTTCCGGCC
TACCACGGCG ATACCGAACA ACCTGCCGGT CGTTTCCCTC GCACCCTGAA CCTCAACTAT
ATAAAGAAGC AGGATATGCG CTACGGTGAG AACAGCCACC AGCAAGCAGC CTTCTATATA
GAAGAGAACG TTCAGGAAGC CTCTGTCGCC ACCGCGGAAC AACTGCAAGG CAAAGCGCTG
TCCTACAACA ACATCGCCGA CACCGACGCC GCACTGGAAT GTGTGAAGGA ATTCGCCGAG
CCGGCCTGCG TGATCGTCAA GCACGCCAAC CCATGCGGTG TGGCGATCGG CGATGATATT
CTGTCTGCCT ATGAGCGCGC CTATCAAACC GACCCGACCT CTGCTTTCGG CGGCATCATC
GCCTTTAACC GCGAACTGGA CGCCGCTACC GCACAGGCCA TTATCAGCCG TCAGTTTGTG
GAAGTGATTA TCGCGCCGAG CATCAGTCAG GAAGCTCGCT CCCTGTTGGC AGCCAAACAG
AACGTGCGCG TACTGGCCTG CGGCCAATGG CAGCAACGTA TTGCCGCTCT CGACTTCAAA
CGTGTCAACG GTGGCCTGCT GGTGCAAGAC CGCGATCTGG GTATGGTGAG CGAAGGCGAC
CTGCGCGTGG TATCTGAACG TCAGCCGACC GCGCAGGAAC TGCGTGATGC GCTGTTCTGT
TGGAAAGTCG CCAAGTTCGT GAAGTCCAAC GCTATCGTCT ATGCACGTGA CAACATGACC
ATCGGCATAG GCGCCGGGCA AATGAGCCGC GTTTACTCTG CCAAGATCGC CGGGATCAAA
GCCGCGGACG AAGGCCTGGA AGTCAAAGGC TCCGCCATGG CGTCTGACGC TTTCTTCCCG
TTCCGTGATG GCATCGATGC CGCCGCAGCG GTGGGCATCA GCTGCGTGAT CCAGCCAGGC
GGTTCGATCC GCGATGATGA AGTGATTGCC GCCGCCAATG AGCACGGCAT CGCAATGATC
TTCACCGACA TGCGCCACTT CCGCCATTAA
 
Protein sequence
MQQPRPIRRA LLSVSDKAGI VEFAEALSQR GVELLSTGGT ARLLADAGLP VTEVSDYTGF 
PEMMDGRVKT LHPKVHGGIL GRRGQDDAIM GQHDIKPIDM VVVNLYPFAQ TVARPNCSLE
DAVENIDIGG PTMVRSAAKN HKDVAIVVKS SDYAAIITEM DNNDGSLQYT TRFDLAIKAF
EHTAAYDSMI ANYFGALVPA YHGDTEQPAG RFPRTLNLNY IKKQDMRYGE NSHQQAAFYI
EENVQEASVA TAEQLQGKAL SYNNIADTDA ALECVKEFAE PACVIVKHAN PCGVAIGDDI
LSAYERAYQT DPTSAFGGII AFNRELDAAT AQAIISRQFV EVIIAPSISQ EARSLLAAKQ
NVRVLACGQW QQRIAALDFK RVNGGLLVQD RDLGMVSEGD LRVVSERQPT AQELRDALFC
WKVAKFVKSN AIVYARDNMT IGIGAGQMSR VYSAKIAGIK AADEGLEVKG SAMASDAFFP
FRDGIDAAAA VGISCVIQPG GSIRDDEVIA AANEHGIAMI FTDMRHFRH