Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2253 |
Symbol | hisB |
ID | 6484856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 2161696 |
End bp | 2162763 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642737600 |
Product | imidazole glycerol-phosphate dehydratase/histidinol phosphatase |
Protein accession | YP_002041342 |
Protein GI | 194445508 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0131] Imidazoleglycerol-phosphate dehydratase [COG0241] Histidinol phosphatase and related phosphatases |
TIGRFAM ID | [TIGR01261] histidinol-phosphatase [TIGR01656] histidinol-phosphate phosphatase family domain [TIGR01662] HAD-superfamily hydrolase, subfamily IIIA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 0.0799032 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCAGA AGTATCTTTT CATCGACCGG GACGGAACCT TGATTTCCGA ACCGCCGAGC GATTTTCAGG TAGACCGCTT TGATAAACTG GCCTTTGAGC CAGAGGTGAT TCCCGTATTG CTGAAGCTGC AAAAAGCCGG TTTTAAGCTG GTGATGATCA CTAACCAGGA TGGACTTGGC ACGCAAAGCT TCCCGCAGGC GGACTTCGAC GGACCGCACA ACCTGATGAT GCAGATTTTC ACTTCTCAGG GCGTATGCTT TGATGAGGTA CTGATCTGCC CTCACCTGCC CGCAGACGAC TGCGACTGCC GCAAGCCCAA AGTGAAGCTG GTGGAGCGTT ATCTTGCGGA ACAAGCGATG GATAGCGCCA ACAGCTATGT GATTGGCGAT CGTGCGACCG ATATCCAGCT CGCTGATAAC ATGGGCATTA CTGGTTTACG CTATCACCGT GAAACGCTGA ACTGGACGAT GATTGGCGAA CAGCTAACGA AACGCGATCG CTATGCGCAT GTGGTCCGCA ACACCAAAGA AACACAGATT GATGTCAGCG TCTGGCTGGA TCGCGAAGGC AACAGCAAGA TTAATACCGG CGTCGGCTTC TTTGACCATA TGCTCGATCA AATCGCCACC CACGGCGGCT TTCGCATGGA GATTACCGTT AAGGGCGATC TCTATATCGA CGATCACCAC ACGGTAGAAG ATACCGGACT GGCGCTCGGT GAGGCATTAA AACTGGCACT CGGCGACAAA CGCGGTATCT GCCGTTTTGG CTTTGTACTG CCGATGGATG AATGTCTGGC GCGCTGCGCG CTGGATATTT CCGGTCGTCC GCATCTGGAA TATAAAGCTG AATTTACCTA CCAGCGTGTG GGCGATTTGA GCACAGAGAT GATTGAACAC TTTTTCCGCT CACTCTCTTA CACGATGGGC GTCACTCTGC ATCTCAAGAC TAAGGGTAAG AACGATCACC ACCGTGTCGA AAGTTTGTTT AAAGCCTTTG GTCGTACGCT ACGCCAGGCT ATTCGCGTGG AGGGCGATAC ATTACCGTCC TCGAAAGGAG TGCTGTGA
|
Protein sequence | MSQKYLFIDR DGTLISEPPS DFQVDRFDKL AFEPEVIPVL LKLQKAGFKL VMITNQDGLG TQSFPQADFD GPHNLMMQIF TSQGVCFDEV LICPHLPADD CDCRKPKVKL VERYLAEQAM DSANSYVIGD RATDIQLADN MGITGLRYHR ETLNWTMIGE QLTKRDRYAH VVRNTKETQI DVSVWLDREG NSKINTGVGF FDHMLDQIAT HGGFRMEITV KGDLYIDDHH TVEDTGLALG EALKLALGDK RGICRFGFVL PMDECLARCA LDISGRPHLE YKAEFTYQRV GDLSTEMIEH FFRSLSYTMG VTLHLKTKGK NDHHRVESLF KAFGRTLRQA IRVEGDTLPS SKGVL
|
| |