Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A2414 |
Symbol | hisA |
ID | 6872718 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 2285095 |
End bp | 2285832 |
Gene Length | 738 bp |
Protein Length | 245 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642785504 |
Product | 1-(5-phosphoribosyl)-5-[(5- phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase |
Protein accession | YP_002216162 |
Protein GI | 198243070 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0106] Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase |
TIGRFAM ID | [TIGR00007] phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.00286316 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATTATTC CGGCATTAGA TTTAATTGAC GGCACCGTGG TGCGTCTCCA CCAGGGCGAC TACGCCCGGC AGCGGGATTA CGGTAACGAT CCCCTGCCCC GTTTGCAGGA TTACGCCGCC CAGGGCGCCG GGGTGCTGCA TCTGGTAGAT CTGACCGGCG CTAAAGATCC GGCTAAGCGA CAGATACCGC TGATTAAAAC CCTGGTCGCG GGCGTGAACG TGCCTGTTCA GGTCGGCGGC GGCGTGCGTA CCGAAGAAGA CGTTGCGGCA TTACTGAAAG CTGGCGTTGC CCGTGTGGTC ATCGGTTCAA CGGCGGTGAA ATCCCCTGAC GTGGTGAAAG GCTGGTTTGA ACGTTTTGGC GCGCAGGCGC TGGTACTGGC GCTGGACGTT CGCATAGACG AACACGGCAA CAAGCAGGTG GCGGTTAGCG GCTGGCAGGA AAATTCCGGC GTCTCGCTGG AACAACTGGT GGAGACCTAT CTCCCCGTCG GCCTGAAACA TGTACTGTGT ACCGATATTT CTCGCGACGG CACGCTGGCG GGCTCTAACG TTTCGCTGTA CGAAGAGGTA TGCGCCAGAT ATCCGCAGAT CGCCTTTCAA TCCTCCGGCG GTATTGGCGA TATCGATGAT ATTGCCGCCC TGCGCGGCAC CGGCGTGCGC GGCGTGATTG TCGGACGCGC TCTGTTAGAA GGGAAATTTA CCGTTAAGGA GGCCATCCAA TGCTGGCAAA ACGTATAA
|
Protein sequence | MIIPALDLID GTVVRLHQGD YARQRDYGND PLPRLQDYAA QGAGVLHLVD LTGAKDPAKR QIPLIKTLVA GVNVPVQVGG GVRTEEDVAA LLKAGVARVV IGSTAVKSPD VVKGWFERFG AQALVLALDV RIDEHGNKQV AVSGWQENSG VSLEQLVETY LPVGLKHVLC TDISRDGTLA GSNVSLYEEV CARYPQIAFQ SSGGIGDIDD IAALRGTGVR GVIVGRALLE GKFTVKEAIQ CWQNV
|
| |