Gene SNSL254_A2253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2253 
SymbolhisB 
ID6484856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2161696 
End bp2162763 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content52% 
IMG OID642737600 
Productimidazole glycerol-phosphate dehydratase/histidinol phosphatase 
Protein accessionYP_002041342 
Protein GI194445508 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0131] Imidazoleglycerol-phosphate dehydratase
[COG0241] Histidinol phosphatase and related phosphatases 
TIGRFAM ID[TIGR01261] histidinol-phosphatase
[TIGR01656] histidinol-phosphate phosphatase family domain
[TIGR01662] HAD-superfamily hydrolase, subfamily IIIA 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.0799032 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAGA AGTATCTTTT CATCGACCGG GACGGAACCT TGATTTCCGA ACCGCCGAGC 
GATTTTCAGG TAGACCGCTT TGATAAACTG GCCTTTGAGC CAGAGGTGAT TCCCGTATTG
CTGAAGCTGC AAAAAGCCGG TTTTAAGCTG GTGATGATCA CTAACCAGGA TGGACTTGGC
ACGCAAAGCT TCCCGCAGGC GGACTTCGAC GGACCGCACA ACCTGATGAT GCAGATTTTC
ACTTCTCAGG GCGTATGCTT TGATGAGGTA CTGATCTGCC CTCACCTGCC CGCAGACGAC
TGCGACTGCC GCAAGCCCAA AGTGAAGCTG GTGGAGCGTT ATCTTGCGGA ACAAGCGATG
GATAGCGCCA ACAGCTATGT GATTGGCGAT CGTGCGACCG ATATCCAGCT CGCTGATAAC
ATGGGCATTA CTGGTTTACG CTATCACCGT GAAACGCTGA ACTGGACGAT GATTGGCGAA
CAGCTAACGA AACGCGATCG CTATGCGCAT GTGGTCCGCA ACACCAAAGA AACACAGATT
GATGTCAGCG TCTGGCTGGA TCGCGAAGGC AACAGCAAGA TTAATACCGG CGTCGGCTTC
TTTGACCATA TGCTCGATCA AATCGCCACC CACGGCGGCT TTCGCATGGA GATTACCGTT
AAGGGCGATC TCTATATCGA CGATCACCAC ACGGTAGAAG ATACCGGACT GGCGCTCGGT
GAGGCATTAA AACTGGCACT CGGCGACAAA CGCGGTATCT GCCGTTTTGG CTTTGTACTG
CCGATGGATG AATGTCTGGC GCGCTGCGCG CTGGATATTT CCGGTCGTCC GCATCTGGAA
TATAAAGCTG AATTTACCTA CCAGCGTGTG GGCGATTTGA GCACAGAGAT GATTGAACAC
TTTTTCCGCT CACTCTCTTA CACGATGGGC GTCACTCTGC ATCTCAAGAC TAAGGGTAAG
AACGATCACC ACCGTGTCGA AAGTTTGTTT AAAGCCTTTG GTCGTACGCT ACGCCAGGCT
ATTCGCGTGG AGGGCGATAC ATTACCGTCC TCGAAAGGAG TGCTGTGA
 
Protein sequence
MSQKYLFIDR DGTLISEPPS DFQVDRFDKL AFEPEVIPVL LKLQKAGFKL VMITNQDGLG 
TQSFPQADFD GPHNLMMQIF TSQGVCFDEV LICPHLPADD CDCRKPKVKL VERYLAEQAM
DSANSYVIGD RATDIQLADN MGITGLRYHR ETLNWTMIGE QLTKRDRYAH VVRNTKETQI
DVSVWLDREG NSKINTGVGF FDHMLDQIAT HGGFRMEITV KGDLYIDDHH TVEDTGLALG
EALKLALGDK RGICRFGFVL PMDECLARCA LDISGRPHLE YKAEFTYQRV GDLSTEMIEH
FFRSLSYTMG VTLHLKTKGK NDHHRVESLF KAFGRTLRQA IRVEGDTLPS SKGVL