Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2957 |
Symbol | hisA |
ID | 6966991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 2732025 |
End bp | 2732762 |
Gene Length | 738 bp |
Protein Length | 245 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643386797 |
Product | 1-(5-phosphoribosyl)-5-[(5- phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase |
Protein accession | YP_002271265 |
Protein GI | 209400912 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0106] Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase |
TIGRFAM ID | [TIGR00007] phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.0000000399894 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATTATTC CGGCATTAGA TTTAATCGAC GGCACTGTGG TGCGTCTCCA TCAGGGCGAT TACGGCAAAC AACGCGATTA CGGTAACGAC CCGCTGCCGC GCTTACAGGA TTACGCCGCA CAAGGTGCCG AAGTGCTGCA TTTGGTGGAT CTGACCGGGG CAAAAGATCC GGCAAAACGT CAAATCCCGC TGATTAAAAC CCTGGTCGCT GGCGTTAACG TTCCGGTGCA GGTTGGCGGC GGCGTGCGTA GCGAAGAAGA CGTGGCGGCG TTACTGGAAG CTGGGGTTGC GCGCGTGGTG GTTGGCTCCA CTGCGGTGAA ATCACCAGAA ATGGTGAAAG GCTGGTTTGA ACGTTTCGGT GCCGATGCCT TAGTGCTGGC GCTGGATGTC CGTATTGACG AGCAAGGCAA CAAGCAGGTG GCGGTCAGCG GCTGGCAAGA GAATTCGGGC GTGTCACTGG AACAACTGGT GGAAACCTAT CTGCCCGTCG GCCTGAAACA TGTGCTGTGT ACCGATATCT CGCGCGACGG CACGCTGGCA GGCTCTAACG TCTCTTTATA TGAAGAAGTG TGCGCCAGAT ATCCGCAGGT GGCATTTCAG TCATCCGGCG GCATTGGCGA CATTAACGAT GTTGCTGCCC TGCGTGGCAC TGGCGTGCGC GGCGTAATAG TTGGTCGTGC ATTACTGGAA GGTAAATTCA CCGTGAAGGA GGCCATCTCA TGCTGGCAAA ACGCATAA
|
Protein sequence | MIIPALDLID GTVVRLHQGD YGKQRDYGND PLPRLQDYAA QGAEVLHLVD LTGAKDPAKR QIPLIKTLVA GVNVPVQVGG GVRSEEDVAA LLEAGVARVV VGSTAVKSPE MVKGWFERFG ADALVLALDV RIDEQGNKQV AVSGWQENSG VSLEQLVETY LPVGLKHVLC TDISRDGTLA GSNVSLYEEV CARYPQVAFQ SSGGIGDIND VAALRGTGVR GVIVGRALLE GKFTVKEAIS CWQNA
|
| |