Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_03836 |
Symbol | purH |
ID | 8113507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 4112056 |
End bp | 4113645 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 644849995 |
Product | hypothetical protein |
Protein accession | YP_003001568 |
Protein GI | 251787264 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000322464 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACAAC GTCGTCCAGT CCGCCGCGCT CTGCTCAGTG TTTCTGACAA AGCCGGTATC GTCGAATTCG CCCAGGCACT TTCCGCACGC GGTGTGGAGC TGCTGTCTAC AGGGGGCACT GCCCGTCTGT TAGCAGAAAA AGGTCTGCCG GTAACCGAAG TTTCCGATTA CACCGGTTTC CCGGAGATGA TGGATGGACG CGTGAAGACC CTGCATCCGA AAGTACATGG TGGCATTCTG GGCCGTCGCG GCCAGGACGA TGCCATTATG GAAGAACATC AGATCCAGCC TATCGATATG GTGGTTGTTA ACCTGTATCC GTTCGCCCAG ACCGTGGCCC GTGAAGGTTG CTCGCTGGAA GATGCGGTTG AGAACATCGA TATCGGCGGC CCAACGATGG TGCGCTCCGC CGCCAAGAAC CATAAAGATG TCGCAATCGT GGTGAAGAGC AGCGACTATG ACGCCATTAT TAAAGAGATG GATGACAACG AAGGATCGCT GACGCTTGCA ACCCGTTTCG ACCTCGCCAT CAAAGCCTTC GAACACACTG CCGCCTACGA CAGCATGATT GCCAACTACT TCGGCAGCAT GGTTCCGGCT TACCACGGTG AAAGCAAAGA AGCCGCCGGT CGCTTCCCAC GCACGCTGAA CCTGAACTTC ATTAAGAAGC TGGATATGCG TTACGGCGAG AACAGCCACC AGCAGGCTGC CTTCTATATA GAAGAGAATG TGAAAGAAGC CTCCGTTGCT ACCGCAACCC AGGTTCAGGG TAAAGCCCTC TCTTATAACA ACATCGCCGA TACCGATGCG GCGCTGGAGT GCGTGAAAGA GTTCGCCGAG CCGGCATGTG TGATTGTGAA GCACGCCAAC CCTTGCGGCG TGGCTATCGG CAATTCCATT CTTGATGCTT ACGATCGCGC GTACAAAACC GACCCAACCT CCGCATTCGG CGGCATCATT GCCTTTAACC GCGAGCTGGA TGCGGAAACC GCACAGGCCA TCATTTCTCG TCAGTTTGTC GAAGTGATTA TTGCGCCGTC CGCCAGCGAA GAAGCCCTGA AAATCACCGC CGCCAAACAG AACGTACGCG TTCTGACCTG CGGTCAGTGG GGCGAGCGTG TTCCGGGCCT CGATTTCAAA CGCGTGAACG GCGGTCTGCT GGTTCAGGAT CGTGACCTGG GGATGGTCGG TGCGGAAGAA CTGCGCGTGG TGACCAAACG TCAGCCGAGC GAACAGGAAC TGCGTGATGC GCTGTTCTGC TGGAAGGTGG CGAAGTTTGT GAAATCCAAC GCTATCGTCT ATGCCAAAAA CAATATGACT ATCGGCATTG GCGCGGGCCA GATGAGCCGC GTGTACTCCG CAAAAATCGC CGGTATTAAA GCGGCCGATG AAGGCCTGGA AGTGAAAGGT TCCTCGATGG CTTCTGACGC GTTCTTCCCG TTCCGCGACG GTATTGATGC CGCCGCCGCT GCGGGCGTGA CCTGCGTAAT CCAGCCTGGC GGTTCTATCC GTGATGACGA AGTGATTGCC GCCGCCGACG AGCACGGTAT TGCGATGCTC TTCACCGACA TGCGCCACTT CCGCCATTAA
|
Protein sequence | MQQRRPVRRA LLSVSDKAGI VEFAQALSAR GVELLSTGGT ARLLAEKGLP VTEVSDYTGF PEMMDGRVKT LHPKVHGGIL GRRGQDDAIM EEHQIQPIDM VVVNLYPFAQ TVAREGCSLE DAVENIDIGG PTMVRSAAKN HKDVAIVVKS SDYDAIIKEM DDNEGSLTLA TRFDLAIKAF EHTAAYDSMI ANYFGSMVPA YHGESKEAAG RFPRTLNLNF IKKLDMRYGE NSHQQAAFYI EENVKEASVA TATQVQGKAL SYNNIADTDA ALECVKEFAE PACVIVKHAN PCGVAIGNSI LDAYDRAYKT DPTSAFGGII AFNRELDAET AQAIISRQFV EVIIAPSASE EALKITAAKQ NVRVLTCGQW GERVPGLDFK RVNGGLLVQD RDLGMVGAEE LRVVTKRQPS EQELRDALFC WKVAKFVKSN AIVYAKNNMT IGIGAGQMSR VYSAKIAGIK AADEGLEVKG SSMASDAFFP FRDGIDAAAA AGVTCVIQPG GSIRDDEVIA AADEHGIAML FTDMRHFRH
|
| |