Gene SNSL254_A2255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2255 
SymbolhisA 
ID6482310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2163353 
End bp2164090 
Gene Length738 bp 
Protein Length245 aa 
Translation table11 
GC content58% 
IMG OID642737602 
Product1-(5-phosphoribosyl)-5-[(5- phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase 
Protein accessionYP_002041344 
Protein GI194444200 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0106] Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase 
TIGRFAM ID[TIGR00007] phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.0530045 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTATTC CGGCATTAGA TTTAATTGAC GGCACCGTGG TGCGTCTCCA CCAGGGTGAC 
TACGCCCGGC AGCGGGATTA CGGTAACGAT CCCCTGCCCC GTTTGCAGGA TTACGCCGCC
CAGGGCGCCG GGGTGCTGCA TCTGGTAGAT CTGACCGGCG CTAAAGATCC GGCTAAGCGA
CAGATACCGC TGATTAAAAC CCTGGTCGCG GGCGTGAACG TGCCTGTTCA GGTCGGCGGC
GGCGTGCGTA CCGAAGAAGA CGTTGCGGCA TTACTGAAAG CTGGCGTTGC CCGTGTGGTC
ATCGGTTCAA CGGCGGTGAA ATCCCCTGAC GTGGTGAAAG GCTGGTTTGA ACGTTTTGGC
GCGCAGGCGC TGGTACTGGC GCTGGACGTT CGCATAGACG AACACGGCAA CAAGCAGGTG
GCGGTTAGCG GCTGGCAGGA AAATTCCGGC GTCTCGCTGG AACAACTGGT GGAGACCTAT
CTCCCCGTCG GCCTGAAACA TGTACTGTGT ACCGATATTT CTCGCGACGG CACGCTGGCG
GGCTCTAACG TTTCGCTGTA CGAAGAGGTA TGCGCCAGAT ATCCGCAGAT CGCCTTTCAA
TCCTCCGGCG GTATTGGCGA TATCGATGAT ATTGCCGCCC TGCGCGGCAC CGGCGTGCGC
GGCGTGATTG TCGGACGCGC TCTGTTAGAA GGGAAATTTA CCGTTAAGGA GGCCATCCAA
TGCTGGCAAA ACGTATAA
 
Protein sequence
MIIPALDLID GTVVRLHQGD YARQRDYGND PLPRLQDYAA QGAGVLHLVD LTGAKDPAKR 
QIPLIKTLVA GVNVPVQVGG GVRTEEDVAA LLKAGVARVV IGSTAVKSPD VVKGWFERFG
AQALVLALDV RIDEHGNKQV AVSGWQENSG VSLEQLVETY LPVGLKHVLC TDISRDGTLA
GSNVSLYEEV CARYPQIAFQ SSGGIGDIDD IAALRGTGVR GVIVGRALLE GKFTVKEAIQ
CWQNV