Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_0216 |
Symbol | purH |
ID | 5110767 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | - |
Start bp | 248731 |
End bp | 250383 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640490378 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_001174957 |
Protein GI | 146309883 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000146667 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0046283 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGTTTTT TGCGAAAAAT TCATCTAACA CTCTCTGTCA TCGTGAAATC CAGGGGATTT ACCATGCAAC AACGTCGTCC AGTCCGCCGC GCTCTGCTCA GTGTTTCTGA CAAAGCCGGT ATCGTCGAAT TCGCTCAGGC ACTTTCTGCA CGTGGTGTAG AACTGCTATC CACAGGCGGC ACCGCTCGCC TGTTAGCAGA TAAAGGTCTG CCGGTAACCG AAGTGTCCGA TTACACCGGT TTCCCGGAAA TGATGGATGG ACGCGTAAAG ACCCTGCATC CGAAAGTACA CGGCGGCATT CTCGGTCGTC GCGGCCAGGA CGACGGCATC ATGGAACAAC ACGACATCGC CCCGATCGAT ATGGTGGTCG TTAACCTTTA TCCGTTCGCC CAAACCGTCG CACGCGAAAA CTGCTCACTG GAAGACGCCG TTGAGAACAT TGATATCGGT GGCCCGACCA TGGTGCGCTC CGCGGCGAAG AACCATAAAG ATGTTGCCAT CGTAGTAAAG AGCAGTGACT ACGACGTCAT TATTAAAGAA ATGGATGCCA ACGAAGGTTC TCTTCTGCTG GCGACCCGTT TCGACCTCGC CATCAAAGCG TTTGAACACA CCGCCGCTTA CGACAGCATG ATCGCCAACT ACTTTGGTAG CCTGGTTCCG GCCTATCACG GCGAAAGCAA CGAACCTTCA GGTCGTTTCC CGCGTACCCT CAATCTGAAC TTCATTAAGA AGCAGGATAT GCGTTACGGC GAAAACAGCC ACCAGAACGC AGCCTTCTAT ATAGAAGAAG AAATTAAAGA GGCGTCCGTC GCCACTGCTC AACAAGTTCA AGGCAAAGCG CTCTCTTATA ACAACATCGC CGATACCGAT GCGGCGCTGG AGTGTGTGAA AGAGTTCAGC GAGCCGGCAT GCGTCATCGT GAAACATGCC AATCCGTGTG GCGTTGCCGT CAGCACGTCT ATTCTTGAAG CCTACGACCG GGCTTACAAA ACCGATCCGA CGTCCGCGTT CGGCGGCATT ATCGCGTTTA ACCGTGAACT TGATGCCGAG ACGGCACAGG CAATCATCTC CCGTCAGTTT GTCGAAGTGA TCATCGCGCC TTCCGCAACA GAAGAAGCCC TGAAAATCAC CGCAGCCAAA CAAAACGTTC GCGTGCTGGT TTGTGGTCAG TGGGCTAAGC GCGTTCCAGG TCTGGATTTC AAACGTGTTA ATGGCGGCCT GCTGGTTCAG GATCGTGATT TGGGCATGGT GACTGCGGGC GGCCTGCGTT TCGTGACTCA ACGTCAGCCA ACCGAACAAG AACTGCGTGA CGCGCTGTTC TGCTGGAAGG TCGCCAAATT TGTTAAATCC AACGCGATTG TGTATTCGAA AGAGAATATG ACGATCGGCA TAGGCGCAGG CCAGATGAGC CGCGTCTACT CTGCCAAAAT CGCCGGTATT AAAGCCAGCG ACGAAGGCCT GGAAGTAAAA GGCTCCGCAA TGGCATCTGA CGCCTTCTTC CCGTTCCGCG ACGGTATTGA TGCAGCAGCA GCCGTTGGCG TGACCTGTGT TATCCAGCCG GGCGGATCCA TTCGTGATGA TGAAGTCATC GCCGCCGCTG ACGAACACGG CATCGCCATG ATCTTCACCG ACATGCGTCA CTTCCGCCAT TAA
|
Protein sequence | MSFLRKIHLT LSVIVKSRGF TMQQRRPVRR ALLSVSDKAG IVEFAQALSA RGVELLSTGG TARLLADKGL PVTEVSDYTG FPEMMDGRVK TLHPKVHGGI LGRRGQDDGI MEQHDIAPID MVVVNLYPFA QTVARENCSL EDAVENIDIG GPTMVRSAAK NHKDVAIVVK SSDYDVIIKE MDANEGSLLL ATRFDLAIKA FEHTAAYDSM IANYFGSLVP AYHGESNEPS GRFPRTLNLN FIKKQDMRYG ENSHQNAAFY IEEEIKEASV ATAQQVQGKA LSYNNIADTD AALECVKEFS EPACVIVKHA NPCGVAVSTS ILEAYDRAYK TDPTSAFGGI IAFNRELDAE TAQAIISRQF VEVIIAPSAT EEALKITAAK QNVRVLVCGQ WAKRVPGLDF KRVNGGLLVQ DRDLGMVTAG GLRFVTQRQP TEQELRDALF CWKVAKFVKS NAIVYSKENM TIGIGAGQMS RVYSAKIAGI KASDEGLEVK GSAMASDAFF PFRDGIDAAA AVGVTCVIQP GGSIRDDEVI AAADEHGIAM IFTDMRHFRH
|
| |