Gene STER_1204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSTER_1204 
SymbolhisD 
ID4438706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus thermophilus LMD-9 
KingdomBacteria 
Replicon accessionNC_008532 
Strand
Start bp1110765 
End bp1112048 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content43% 
IMG OID639676837 
Producthistidinol dehydrogenase 
Protein accessionYP_820590 
Protein GI116627971 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGAT TAACTGGAAC TAATAAAGAA ATTGCTGAGC TTCTTTATCA GGAGCAATTG 
GAACTTTCAA AAGAAAATAG AGACGTTGAA AGAACGGTTC AAGCTATTAT CGAAGACGTT
AAAGAACGTG GAGACGAGGC CCTTCGTGAT TATTCAGCGA AGTTCGATAA GGTTGATTTG
ACTGATATTG AAGTTGGACA AGATCTTATT GATAAAGCCT TTAAGGAAAT TGATCCAGAG
GTTTATCAAG CCCTAGTCAA TGCTAAAGAA AACATTGAAT CTTACCACAA ACATCAGTTG
GAGGCTGGTT TTGAGGACCA ACCAAGTGAG GGAGTTATTC GTGGTCAAAT GATTCGTCCC
ATTAATCGTG TAGGTGTTTA TGTTCCTGGG GGAACTGCGG CCTATCCGTC CTCAGTCCTT
ATGAATGTCA TCCCTGCCAA AATTGCTGGG GTTAAAGAGA TTATTATGAT TACACCACCT
CAGGAACACT TTGTTCCTGC AATTCTAGTA GCTGCAAAAC TGGCTGGTGT GGACACCATT
TACCAAGTCG GTGGAGCTCA AGGAATTGCG GCCTTGGCAT TTGGGACAGA AACTCTTCCA
AAGGTGGATA AGATTACAGG TCCTGGTAAT ATTTTCGTAG CTACTGCTAA GAAACAGGTT
TACGGTATTG TCGGTATTGA TATGATTGCA GGGCCATCAG AGATTGGTGT CATTGCTGAT
AGTAGCGCTA ATCCAAGCTA TGTAGCAGCT GATCTTTTGT CTCAAGCGGA GCACGACAAG
CGAGCACGCG CCATTTTGGT AACAGACTCT GAAGCCTTGG CCGATGCGGT TGAAAGCGAG
ATTGAACGTC AACTCAAGCT TTTACCACGA GAAGCGATTG CTCGTCCTTC CATTGAAAAT
AACGGCCGTA TTATTATCAC TAAGGACACA GATGCCATGT TTGAACTCAT GAACTCGGTT
GCGCCAGAGC ATTTGGAAAT TGCTATGGAA AAGGCCTATG ATTATCTAGA AAAAGTGGAA
AATGCTGGTT CCGTCTTTCT TGGTCACTTT ACGAGTGAAC CAATTGGTGA CTACTATGCC
GGTGCTAATC ACATTCTTCC AACGACAGCA ACCAGCCGCT TTTCATCAGC TTTAGGTGTA
CATGATTTTG TCAAACGTAT CCAATATACG CAATACGACA AGGCAGCAGT CAATAAGGCA
AAACACGATA TTACAATCTT GGCTTATGCG GAAGGCTTGC AGGCCCACGC CAAGGCTATT
GAAGTCAGAA ATGACAATAA TTGA
 
Protein sequence
MKRLTGTNKE IAELLYQEQL ELSKENRDVE RTVQAIIEDV KERGDEALRD YSAKFDKVDL 
TDIEVGQDLI DKAFKEIDPE VYQALVNAKE NIESYHKHQL EAGFEDQPSE GVIRGQMIRP
INRVGVYVPG GTAAYPSSVL MNVIPAKIAG VKEIIMITPP QEHFVPAILV AAKLAGVDTI
YQVGGAQGIA ALAFGTETLP KVDKITGPGN IFVATAKKQV YGIVGIDMIA GPSEIGVIAD
SSANPSYVAA DLLSQAEHDK RARAILVTDS EALADAVESE IERQLKLLPR EAIARPSIEN
NGRIIITKDT DAMFELMNSV APEHLEIAME KAYDYLEKVE NAGSVFLGHF TSEPIGDYYA
GANHILPTTA TSRFSSALGV HDFVKRIQYT QYDKAAVNKA KHDITILAYA EGLQAHAKAI
EVRNDNN