Gene B21_01908 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01908 
SymbolhisD 
ID8116338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp1986213 
End bp1987517 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content58% 
IMG OID644848123 
Producthypothetical protein 
Protein accessionYP_002999696 
Protein GI251785392 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTTTA ACACAATCAT TGACTGGAAT AGCTGTACTG CGGAGCAACA ACGCCAGCTG 
TTAATGCGCC CGGCGATCTC CGCTTCTGAA AGCATTACCC GCACTGTTAA CGATATTCTC
GATAACGTGA AAACGCGTGG CGATGAGGCC CTTCGGGAAT ACAGCGCGAA GTTTGATAAA
ACCACGGTTA CCGCGCTGAA GGTGTCTGCT GAGGAGATCG CCGCCGCCAG CGAACGCCTG
AGCGACGAGC TAAAACAGGC GATGGCGGTG GCAGTAAAGA ATATTGAAAC CTTCCACACT
GCGCAAAAAC TGCCGCCGGT AGATGTAGAA ACGCAGCCAG GCGTACGTTG CCAGCAAGTC
ACGCGTCCGG TAGCTTCAGT TGGGTTGTAT ATTCCTGGCG GCTCCGCCCC GCTCTTCTCA
ACGGTATTAA TGCTGGCAAC TCCGGCGCGT ATTGCGGGCT GTAAAAAAGT GGTGTTGTGC
TCACCGCCGC CGATTGCCGA TGAGATCCTT TATGCGGCGC AGCTGTGCGG TGTGCAGGAC
GTGTTTAACG TCGGCGGCGC ACAGGCCATT GCCGCGCTGG CGTTTGGTAC GGAATCTGTG
CCGAAAGTGG ACAAAATCTT CGGGCCGGGT AACGCCTTTG TCACCGAAGC AAAACGCCAG
GTAAGCCAGC GTCTGGACGG TGCGGCGATC GATATGCCCG CAGGCCCGTC GGAAGTGCTG
GTGATTGCTG ACAGCGGCGC TACGCCGGAT TTCGTGGCTT CTGATTTGCT TTCTCAGGCT
GAACACGGCC CGGACTCACA GGTGATTTTA CTGACGCCCG ACGCCGATAT GGCGCGTCGC
GTTGCCGAGG CTGTCGAACG CCAACTGGCA GAACTGCCGC GAGCTGAAAC CGCCCGCCAG
GCACTGAACG CCAGCCGCCT GATCGTGACT AAAGATTTAG CGCAGTGCGT AGAGATCTCC
AACCAGTACG GCCCGGAGCA CCTGATCATT CAGACCCGCA ACGCCCGCGA ACTGGTCGAT
GGCATCACCA GCGCCGGTTC GGTATTTCTT GGTGACTGGT CACCGGAATC GGCAGGCGAC
TATGCCTCCG GCACCAACCA CGTTCTGCCG ACTTACGGTT ACACCGCCAC CTGTTCCAGC
CTCGGGCTGG CGGATTTCCA GAAGCGCATG ACCGTGCAGG AACTGTCGAA AGTAGGTTTC
TCCGCTCTGG CGTCGACCAT TGAAACACTG GCCGCCGCCG AGCGCCTGAC CGCCCACAAA
AATGCCGTTA CTTTGCGTGT TAACGCCCTT AAGGAGCAAG CATGA
 
Protein sequence
MSFNTIIDWN SCTAEQQRQL LMRPAISASE SITRTVNDIL DNVKTRGDEA LREYSAKFDK 
TTVTALKVSA EEIAAASERL SDELKQAMAV AVKNIETFHT AQKLPPVDVE TQPGVRCQQV
TRPVASVGLY IPGGSAPLFS TVLMLATPAR IAGCKKVVLC SPPPIADEIL YAAQLCGVQD
VFNVGGAQAI AALAFGTESV PKVDKIFGPG NAFVTEAKRQ VSQRLDGAAI DMPAGPSEVL
VIADSGATPD FVASDLLSQA EHGPDSQVIL LTPDADMARR VAEAVERQLA ELPRAETARQ
ALNASRLIVT KDLAQCVEIS NQYGPEHLII QTRNARELVD GITSAGSVFL GDWSPESAGD
YASGTNHVLP TYGYTATCSS LGLADFQKRM TVQELSKVGF SALASTIETL AAAERLTAHK
NAVTLRVNAL KEQA