Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1036 |
Symbol | hisA |
ID | 6143881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1057238 |
End bp | 1057975 |
Gene Length | 738 bp |
Protein Length | 245 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641615923 |
Product | 1-(5-phosphoribosyl)-5-[(5- phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase |
Protein accession | YP_001743115 |
Protein GI | 170680218 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0106] Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase |
TIGRFAM ID | [TIGR00007] phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.202251 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTATTC CGGCATTAGA TTTAATCGAC GGCACCGTGG TGCGTCTCCA TCAGGGCGAT TATGGCAAAC AACGCGATTA CGGTAACGAC CCGCTGCCAC GCTTACAGGA TTACGCCGCA CAAGGTGCTG AAGTGCTGCA CCTGGTGGAT CTGACCGGGG CAAAAGATCC GGCTAAACGT CAAATCCCGC TGATTAAAAC CCTGGTCGCG GGCGTTAACG TTCCGGTGCA GGTTGGCGGC GGCGTGCGTA GCGAAGAAGA CGTAGCGGCG TTACTGGAAG CAGGTGTTGC GCGTGTGGTA GTTGGCTCCA CCGCGGTGAA ATCACCAGAA AGGGTGAAAG GCTGGTTTGA ACGCTTCGGT GCGGATGCCT TAGTGCTGGC GCTGGATGTC CGTATTGACG AGCAAGGCAA CAAGCAAGTG GCGGTCAGTG GCTGGCAAGA GAATTCGGGT GTGTCGCTGG AACAACTGGT GGAAACCTAT CTGCCCGTCG GCCTGAAACA TGTGCTGTGT ACCGATATCT CGCGCGACGG CACGCTGGCA GGCTCTAACG TCTCTTTATA TGAAGAAGTG TGCGCCAGAT ATCCGCAAGT AGCATTTCAG TCCTCCGGCG GTATTGGCGA CATTAATGAT GTCGCTGCCC TGCGTGGCAC TGGCGTGCGC GGCGTAATAG TTGGTCGGGC ATTACTGGAA GGTAAATTCA CCGTGAAGGA GGCCATCGCA TGCTGGCAAA ACGCATAA
|
Protein sequence | MIIPALDLID GTVVRLHQGD YGKQRDYGND PLPRLQDYAA QGAEVLHLVD LTGAKDPAKR QIPLIKTLVA GVNVPVQVGG GVRSEEDVAA LLEAGVARVV VGSTAVKSPE RVKGWFERFG ADALVLALDV RIDEQGNKQV AVSGWQENSG VSLEQLVETY LPVGLKHVLC TDISRDGTLA GSNVSLYEEV CARYPQVAFQ SSGGIGDIND VAALRGTGVR GVIVGRALLE GKFTVKEAIA CWQNA
|
| |