Gene Ent638_0216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_0216 
SymbolpurH 
ID5110767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp248731 
End bp250383 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content54% 
IMG OID640490378 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001174957 
Protein GI146309883 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000146667 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0046283 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGTTTTT TGCGAAAAAT TCATCTAACA CTCTCTGTCA TCGTGAAATC CAGGGGATTT 
ACCATGCAAC AACGTCGTCC AGTCCGCCGC GCTCTGCTCA GTGTTTCTGA CAAAGCCGGT
ATCGTCGAAT TCGCTCAGGC ACTTTCTGCA CGTGGTGTAG AACTGCTATC CACAGGCGGC
ACCGCTCGCC TGTTAGCAGA TAAAGGTCTG CCGGTAACCG AAGTGTCCGA TTACACCGGT
TTCCCGGAAA TGATGGATGG ACGCGTAAAG ACCCTGCATC CGAAAGTACA CGGCGGCATT
CTCGGTCGTC GCGGCCAGGA CGACGGCATC ATGGAACAAC ACGACATCGC CCCGATCGAT
ATGGTGGTCG TTAACCTTTA TCCGTTCGCC CAAACCGTCG CACGCGAAAA CTGCTCACTG
GAAGACGCCG TTGAGAACAT TGATATCGGT GGCCCGACCA TGGTGCGCTC CGCGGCGAAG
AACCATAAAG ATGTTGCCAT CGTAGTAAAG AGCAGTGACT ACGACGTCAT TATTAAAGAA
ATGGATGCCA ACGAAGGTTC TCTTCTGCTG GCGACCCGTT TCGACCTCGC CATCAAAGCG
TTTGAACACA CCGCCGCTTA CGACAGCATG ATCGCCAACT ACTTTGGTAG CCTGGTTCCG
GCCTATCACG GCGAAAGCAA CGAACCTTCA GGTCGTTTCC CGCGTACCCT CAATCTGAAC
TTCATTAAGA AGCAGGATAT GCGTTACGGC GAAAACAGCC ACCAGAACGC AGCCTTCTAT
ATAGAAGAAG AAATTAAAGA GGCGTCCGTC GCCACTGCTC AACAAGTTCA AGGCAAAGCG
CTCTCTTATA ACAACATCGC CGATACCGAT GCGGCGCTGG AGTGTGTGAA AGAGTTCAGC
GAGCCGGCAT GCGTCATCGT GAAACATGCC AATCCGTGTG GCGTTGCCGT CAGCACGTCT
ATTCTTGAAG CCTACGACCG GGCTTACAAA ACCGATCCGA CGTCCGCGTT CGGCGGCATT
ATCGCGTTTA ACCGTGAACT TGATGCCGAG ACGGCACAGG CAATCATCTC CCGTCAGTTT
GTCGAAGTGA TCATCGCGCC TTCCGCAACA GAAGAAGCCC TGAAAATCAC CGCAGCCAAA
CAAAACGTTC GCGTGCTGGT TTGTGGTCAG TGGGCTAAGC GCGTTCCAGG TCTGGATTTC
AAACGTGTTA ATGGCGGCCT GCTGGTTCAG GATCGTGATT TGGGCATGGT GACTGCGGGC
GGCCTGCGTT TCGTGACTCA ACGTCAGCCA ACCGAACAAG AACTGCGTGA CGCGCTGTTC
TGCTGGAAGG TCGCCAAATT TGTTAAATCC AACGCGATTG TGTATTCGAA AGAGAATATG
ACGATCGGCA TAGGCGCAGG CCAGATGAGC CGCGTCTACT CTGCCAAAAT CGCCGGTATT
AAAGCCAGCG ACGAAGGCCT GGAAGTAAAA GGCTCCGCAA TGGCATCTGA CGCCTTCTTC
CCGTTCCGCG ACGGTATTGA TGCAGCAGCA GCCGTTGGCG TGACCTGTGT TATCCAGCCG
GGCGGATCCA TTCGTGATGA TGAAGTCATC GCCGCCGCTG ACGAACACGG CATCGCCATG
ATCTTCACCG ACATGCGTCA CTTCCGCCAT TAA
 
Protein sequence
MSFLRKIHLT LSVIVKSRGF TMQQRRPVRR ALLSVSDKAG IVEFAQALSA RGVELLSTGG 
TARLLADKGL PVTEVSDYTG FPEMMDGRVK TLHPKVHGGI LGRRGQDDGI MEQHDIAPID
MVVVNLYPFA QTVARENCSL EDAVENIDIG GPTMVRSAAK NHKDVAIVVK SSDYDVIIKE
MDANEGSLLL ATRFDLAIKA FEHTAAYDSM IANYFGSLVP AYHGESNEPS GRFPRTLNLN
FIKKQDMRYG ENSHQNAAFY IEEEIKEASV ATAQQVQGKA LSYNNIADTD AALECVKEFS
EPACVIVKHA NPCGVAVSTS ILEAYDRAYK TDPTSAFGGI IAFNRELDAE TAQAIISRQF
VEVIIAPSAT EEALKITAAK QNVRVLVCGQ WAKRVPGLDF KRVNGGLLVQ DRDLGMVTAG
GLRFVTQRQP TEQELRDALF CWKVAKFVKS NAIVYSKENM TIGIGAGQMS RVYSAKIAGI
KASDEGLEVK GSAMASDAFF PFRDGIDAAA AVGVTCVIQP GGSIRDDEVI AAADEHGIAM
IFTDMRHFRH