Gene SNSL254_A2251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2251 
SymbolhisD 
ID6486222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2159319 
End bp2160623 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content58% 
IMG OID642737598 
Producthistidinol dehydrogenase 
Protein accessionYP_002041340 
Protein GI194442217 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.849877 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value0.226426 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTCA ATACCCTGAT TGACTGGAAC AGCTGTAGCC CTGAACAGCA GCGTGCGCTG 
CTGACGCGTC CGGCGATTTC CGCCTCTGAC AGTATTACCC GGACGGTCAG CGATATTCTG
GATAATGTAA AAACGCGCGG TGACGATGCC CTGCGTGAAT ACAGCGCTAA ATTTGATAAA
ACAGAAGTGA CAGCGCTACG CGTCACCCCT GAAGAGATCG CCGCCGCCGG CGCGCGTCTG
AGCGACGAAT TAAAACAGGC GATGGCCGCT GCCGTCAAAA ATATTGAAAC GTTCCATTCC
GCGCAGACGC TACCGCCTGT AGATGTGGAA ACCCAGCCAG GCGTACGTTG TCAGCAGGTT
ACGCGTCCCG TCGCGTCTGT CGGTCTTTAT ATTCCCGGCG GCTCGGCTCC GCTCTTCTCA
ACGGTGCTGA TGCTGGCAAC GCCGGCGCGC ATTGCGGGAT GTCAGAACGT GGTTCTGTGC
TCGCCGCCGC CCATCGCTGA TGAAATCCTC TATGCGGCAC AACTGTGTGG CGTGCAGGAA
ATCTTTAACG TCGGCGGCGC GCAGGCGATT GCCGCTCTGG CCTTCGGCAG CGAGTCCGTA
CCGAAAGTGG ATAAAATTTT TGGCCCCGGC AACGCCTTTG TAACCGAAGC CAAACGTCAG
GTCAGCCAGC GTCTCGACGG CGCGGCTATC GATATGCCAG CCGGGCCGTC TGAAGTACTG
GTGATCGCAG ACAGCGGCGC AACACCGGAT TTCGTCGCTT CTGACCTGCT CTCCCAGGCT
GAGCACGGCC CGGATTCCCA GGTGATCCTG CTGACGCCTG ATGCTGACAT TGCCCGCAAG
GTGGCGGAGG CGGTAGAACG TCAACTGGCG GAACTGCCGC GCGCGGACAC CGCCCGGCAG
GCCCTGAGCG CCAGTCGTCT GATTGTGACC AAAGATTTAG CGCAGTGCGT CGCCATCTCT
AATCAGTATG GGCCGGAACA CTTAATCATC CAGACGCGCA ATGCGCGCGA TTTGGTGGAT
GCGATTACCA GCGCAGGCTC GGTATTTCTC GGCGACTGGT CGCCGGAATC CGCCGGTGAT
TACGCTTCCG GAACCAACCA TGTTTTACCG ACCTATGGCT ATACTGCTAC CTGTTCCAGC
CTTGGGTTAG CGGATTTCCA GAAACGGATG ACCGTTCAGG AACTGTCGAA AGCGGGCTTT
TCCGCTCTGG CATCAACCAT TGAAACATTG GCGGCGGCAG AACGTCTGAC CGCCCATAAA
AATGCCGTGA CCCTGCGCGT AAACGCCCTC AAGGAGCAAG CATGA
 
Protein sequence
MSFNTLIDWN SCSPEQQRAL LTRPAISASD SITRTVSDIL DNVKTRGDDA LREYSAKFDK 
TEVTALRVTP EEIAAAGARL SDELKQAMAA AVKNIETFHS AQTLPPVDVE TQPGVRCQQV
TRPVASVGLY IPGGSAPLFS TVLMLATPAR IAGCQNVVLC SPPPIADEIL YAAQLCGVQE
IFNVGGAQAI AALAFGSESV PKVDKIFGPG NAFVTEAKRQ VSQRLDGAAI DMPAGPSEVL
VIADSGATPD FVASDLLSQA EHGPDSQVIL LTPDADIARK VAEAVERQLA ELPRADTARQ
ALSASRLIVT KDLAQCVAIS NQYGPEHLII QTRNARDLVD AITSAGSVFL GDWSPESAGD
YASGTNHVLP TYGYTATCSS LGLADFQKRM TVQELSKAGF SALASTIETL AAAERLTAHK
NAVTLRVNAL KEQA