Gene GWCH70_2979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2979 
SymbolhisD 
ID7977278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2999898 
End bp3001166 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content52% 
IMG OID644799779 
Producthistidinol dehydrogenase 
Protein accessionYP_002950918 
Protein GI239828294 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0128196 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAATCG AACGAGTGAC AAACCGTGTA TCATTGCGCC GTACGATTGA ATCAGGGACG 
GAAGAACAGC GCAGTAAGGT TCTGGAGATT ATTGCCGATG TGCGCGCTCG CGGTGATGAA
GCGCTAAAAA GTTATACGGA AAAATTCGAT GGCGTCCGCC TTGATTCGCT GTGCGTGACA
AACGAAGAAA TAGAAAGAGC GTATCAGAAC GTAAGCGCAG AAGTGCTTCG GATCATCCAA
GAGGCGGCGG AAAACATTCG CGATTATCAT GAGCGTCAAA AGAGAGAGTC ATGGATCATG
ACAAAAGAAG ACGGCACGAT GCTTGGTCAG AAAATAACGC CGCTTGATGC GGTTGGATTG
TACGTTCCAG GAGGGACAGC CGCCTACCCG TCATCGGTGC TTATGAATGT CATTCCTGCC
CAAGTGGCAG GGGTCAAACG AATTGTGATC ACCTCTCCGC CAAATAAAGA CGGAACGCTT
CCGGCCGGCG TGTTAGTGGC GGCGAACGAA TTAGGAGTGA AAGAAATCTA TAAAGTCGGC
GGTGCGCAGG CGATTGCCGC GCTTGCATAC GGAACGGAGA CGATTCGTCC GGTCGATAAA
ATTTTCGGGC CAGGCAACAT TTACGTGGCG CTCGCGAAAC GCGAAGTGTT CGGGCAAGTC
GCGATTGATA TGATTGCCGG ACCGAGCGAA ATCGTCGTGT TGGCGGATGA AACGGCAAAG
GCGAACGAAA TTGCCGCTGA TTTGTTGTCG CAAGCCGAGC ACGATGAACG CGCTTCCGCG
ATTCTCGTTA CTCCATCGAT GAAATTGGCG CTTGCAGTCG CACGCGAGGT CGAAAAACAG
CTGGAGACGC TGCCGCGCAA AGCGATTGCC TCTGCGTCGC TCGAGAACTA CGGAGCCATT
TACGTCACAG AAACGCTTGC GGAAGCGGTC GAAGTTGTGA ATGAATTAGC ACCGGAGCAT
TTGGAAGTAA TGACAGCCGA ACCGATGCAA CTCCTTGGTC AAATCCGCCA TGCGGGAGCG
ATTTTTTTAG GGCGCTTCAG CTCGGAGCCG GTTGGCGATT ACTTCGCCGG TCCAAACCAT
GTGCTGCCGA CGAATGGTAC AGCCCGGTTT TCGAGCGGAT TAAGCGTGGA TGAATTTGTG
AAAAAATCGA GCATCATTTT TTACAGCGAG CCGGCGTTAA AGCAAAACGC GGAAAAAATC
GCGGCGTTTG CCAGACTTGA AGGGCTTGAA GCACATGCGC GCGCCGTTGA AGAACGTTTT
AAAAAATAA
 
Protein sequence
MKIERVTNRV SLRRTIESGT EEQRSKVLEI IADVRARGDE ALKSYTEKFD GVRLDSLCVT 
NEEIERAYQN VSAEVLRIIQ EAAENIRDYH ERQKRESWIM TKEDGTMLGQ KITPLDAVGL
YVPGGTAAYP SSVLMNVIPA QVAGVKRIVI TSPPNKDGTL PAGVLVAANE LGVKEIYKVG
GAQAIAALAY GTETIRPVDK IFGPGNIYVA LAKREVFGQV AIDMIAGPSE IVVLADETAK
ANEIAADLLS QAEHDERASA ILVTPSMKLA LAVAREVEKQ LETLPRKAIA SASLENYGAI
YVTETLAEAV EVVNELAPEH LEVMTAEPMQ LLGQIRHAGA IFLGRFSSEP VGDYFAGPNH
VLPTNGTARF SSGLSVDEFV KKSSIIFYSE PALKQNAEKI AAFARLEGLE AHARAVEERF
KK