Gene SeHA_C2300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C2300 
SymbolhisB 
ID6488852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp2200613 
End bp2201680 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content52% 
IMG OID642742489 
Productimidazole glycerol-phosphate dehydratase/histidinol phosphatase 
Protein accessionYP_002046124 
Protein GI194448612 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0131] Imidazoleglycerol-phosphate dehydratase
[COG0241] Histidinol phosphatase and related phosphatases 
TIGRFAM ID[TIGR01261] histidinol-phosphatase
[TIGR01656] histidinol-phosphate phosphatase family domain
[TIGR01662] HAD-superfamily hydrolase, subfamily IIIA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0245158 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value0.0544886 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAGA AGTATCTTTT CATCGACCGG GACGGAACCT TAATTTCCGA ACCGCCGAGC 
GATTTTCAGG TAGACCGCTT TGATAAACTG GCCTTTGAGC CAGAGGTGAT TCCCGTATTG
CTGAAGCTGC AAAAAGCCGG TTTTAAGCTG GTGATGATCA CTAACCAGGA TGGACTTGGC
ACGCAAAGCT TCCCGCAGGC GGACTTCGAC GGACCGCACA ACCTGATGAT GCAGATTTTC
ACCTCTCAGG GCGTATGCTT TGATGAGGTG CTGATCTGCC CTCACCTGCC CGCAGACGAC
TGCGACTGCC GCAAGCCCAA AGTGAAGCTG GTGGAGCGTT ATCTTGCGGA ACAAGCGATG
GATAGCGCCA ACAGCTATGT GATTGGCGAT CGTGCGACCG ATATCCAGCT CGCTGATAAC
ATGGGCATTA CTGGTTTACG CTATCACCGT GAAACGCTGA ACTGGACGAT GATTGGCGAA
CAGCTAACGA AACGCGATCG TTATGCGCAT GTGGTCCGCA ACACCAAAGA AACACAGATT
GATGTCAGCG TCTGGCTGGA TCGCGAAGGC AACAGCAAGA TTAATACCGG CGTCGGCTTC
TTTGACCATA TGCTCGATCA AATCGCCACC CACGGCGGCT TTCGCATGGA GATTACCGTT
AAGGGCGATC TCTATATCGA CGATCACCAC ACGGTAGAAG ATACCGGACT GGCGCTCGGT
GAGGCATTAA AACTGGCACT CGGCGACAAG CGCGGTATCT GCCGTTTTGG CTTTGTACTA
CCGATGGATG AATGTCTGGC GCGCTGCGCG CTGGATATTT CCGGTCGTCC GCATCTGGAA
TATAAAGCTG AATTTACCTA CCAGCGTGTG GGCGATTTGA GCACAGAGAT GATTGAACAC
TTTTTCCGCT CACTCTCTTA CACGATGGGC GTCACTCTGC ATCTCAAGAC TAAAGGTAAG
AACGATCACC ACCGTGTCGA AAGTTTGTTT AAAGCCTTTG GTCGGACGCT ACGCCAGGCT
ATTCGCGTGG AGGGCGATAC ATTACCGTCC TCGAAAGGAG TGCTGTGA
 
Protein sequence
MSQKYLFIDR DGTLISEPPS DFQVDRFDKL AFEPEVIPVL LKLQKAGFKL VMITNQDGLG 
TQSFPQADFD GPHNLMMQIF TSQGVCFDEV LICPHLPADD CDCRKPKVKL VERYLAEQAM
DSANSYVIGD RATDIQLADN MGITGLRYHR ETLNWTMIGE QLTKRDRYAH VVRNTKETQI
DVSVWLDREG NSKINTGVGF FDHMLDQIAT HGGFRMEITV KGDLYIDDHH TVEDTGLALG
EALKLALGDK RGICRFGFVL PMDECLARCA LDISGRPHLE YKAEFTYQRV GDLSTEMIEH
FFRSLSYTMG VTLHLKTKGK NDHHRVESLF KAFGRTLRQA IRVEGDTLPS SKGVL