Gene ECD_01922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_01922 
SymbolhisD 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp1986963 
End bp1988204 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content59% 
IMG OID 
Producthistidinol dehydrogenase 
Protein accessionACT43773 
Protein GI253978103 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCCGG CGATCTCCGC TTCTGAAAGC ATTACCCGCA CTGTTAACGA TATTCTCGAT 
AACGTGAAAA CGCGTGGCGA TGAGGCCCTT CGGGAATACA GCGCGAAGTT TGATAAAACC
ACGGTTACCG CGCTGAAGGT GTCTGCTGAG GAGATCGCCG CCGCCAGCGA ACGCCTGAGC
GACGAGCTAA AACAGGCGAT GGCGGTGGCA GTAAAGAATA TTGAAACCTT CCACACTGCG
CAAAAACTGC CGCCGGTAGA TGTAGAAACG CAGCCAGGCG TACGTTGCCA GCAAGTCACG
CGTCCGGTAG CTTCAGTTGG GTTGTATATT CCTGGCGGCT CCGCCCCGCT CTTCTCAACG
GTATTAATGC TGGCAACTCC GGCGCGTATT GCGGGCTGTA AAAAAGTGGT GTTGTGCTCA
CCGCCGCCGA TTGCCGATGA GATCCTTTAT GCGGCGCAGC TGTGCGGTGT GCAGGACGTG
TTTAACGTCG GCGGCGCACA GGCCATTGCC GCGCTGGCGT TTGGTACGGA ATCTGTGCCG
AAAGTGGACA AAATCTTCGG GCCGGGTAAC GCCTTTGTCA CCGAAGCAAA ACGCCAGGTA
AGCCAGCGTC TGGACGGTGC GGCGATCGAT ATGCCCGCAG GCCCGTCGGA AGTGCTGGTG
ATTGCTGACA GCGGCGCTAC GCCGGATTTC GTGGCTTCTG ATTTGCTTTC TCAGGCTGAA
CACGGCCCGG ACTCACAGGT GATTTTACTG ACGCCCGACG CCGATATGGC GCGTCGCGTT
GCCGAGGCTG TCGAACGCCA ACTGGCAGAA CTGCCGCGAG CTGAAACCGC CCGCCAGGCA
CTGAACGCCA GCCGCCTGAT CGTGACTAAA GATTTAGCGC AGTGCGTAGA GATCTCCAAC
CAGTACGGCC CGGAGCACCT GATCATTCAG ACCCGCAACG CCCGCGAACT GGTCGATGGC
ATCACCAGCG CCGGTTCGGT ATTTCTTGGT GACTGGTCAC CGGAATCGGC AGGCGACTAT
GCCTCCGGCA CCAACCACGT TCTGCCGACT TACGGTTACA CCGCCACCTG TTCCAGCCTC
GGGCTGGCGG ATTTCCAGAA GCGCATGACC GTGCAGGAAC TGTCGAAAGT AGGTTTCTCC
GCTCTGGCGT CGACCATTGA AACACTGGCC GCCGCCGAGC GCCTGACCGC CCACAAAAAT
GCCGTTACTT TGCGTGTTAA CGCCCTTAAG GAGCAAGCAT GA
 
Protein sequence
MRPAISASES ITRTVNDILD NVKTRGDEAL REYSAKFDKT TVTALKVSAE EIAAASERLS 
DELKQAMAVA VKNIETFHTA QKLPPVDVET QPGVRCQQVT RPVASVGLYI PGGSAPLFST
VLMLATPARI AGCKKVVLCS PPPIADEILY AAQLCGVQDV FNVGGAQAIA ALAFGTESVP
KVDKIFGPGN AFVTEAKRQV SQRLDGAAID MPAGPSEVLV IADSGATPDF VASDLLSQAE
HGPDSQVILL TPDADMARRV AEAVERQLAE LPRAETARQA LNASRLIVTK DLAQCVEISN
QYGPEHLIIQ TRNARELVDG ITSAGSVFLG DWSPESAGDY ASGTNHVLPT YGYTATCSSL
GLADFQKRMT VQELSKVGFS ALASTIETLA AAERLTAHKN AVTLRVNALK EQA