Gene GYMC61_3156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_3156 
SymbolhisD 
ID8527044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp3204237 
End bp3205511 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content60% 
IMG OID 
Producthistidinol dehydrogenase 
Protein accessionYP_003254195 
Protein GI261420513 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATCG AACGGATCCG AGGCGGCGTT TCGCTGCGGC GCACGATTGA AAGCGGAACG 
GAGGAGCAGC GGCGCGTGGT GCTGGATATC ATTTCCAACG TGCGCGCCCG CGGCGATGAA
GCGCTGAAAG AATACACGGA ACGGTTTGAC GGCGTCAAGC TGGATTCGCT CAAGGTGACG
GAAGAGGAAA TGAAGCGCGC GCATGCGGCG ATGGACGCGG AGATGCTGGA GATCATTCGC
CAAGCGGCGG CGAACATTCG CGACTACCAT GAGCGGCAAA AGCGCGAATC ATGGTGGATG
ACGAAAGAAG ACGGCACAAT TCTCGGACAA AAGGTGACGC CGCTCGATGC GGTCGGGTTG
TACGTGCCAG GCGGGACGGC CGCTTATCCG TCGTCTGTGC TGATGAACGT TATTCCCGCA
CAAGTGGCGG GGGTGAAACG GATTGTCATC ACCTCGCCGC CAAACAAAGA CGGCACGCTC
CCGGCTGGGG TGCTGGCCGC CGCCTATGAA CTCGGCGTGA CGGAAATTTA CAAAGTCGGC
GGCGCGCAGG CGATCGCCGC GCTTGCTTAC GGGACGGAAA CGATTCGGCC GGTTGACAAA
ATTTTCGGGC CGGGCAATAT TTATGTGGCA TTGGCGAAGC GGGAAGTGTT CGGGCATGTG
GCGATCGACA TGATCGCGGG GCCGAGCGAA ATTGTCGTGC TGGCGGATGA AACGGCCCGA
CCGGATGAGA TTGCGGCGGA TTTGTTGTCG CAAGCCGAGC ATGACGTGCG GGCGTCGGCC
ATTTTGGTGA CGCCGTCGAT GAAATTGGCG CTGGCGGTGG CGAGCGAAGT CGAACGGCAG
CTTGAAACGC TGCCGCGCCG CGACATTGCC CAAGCGGCGC TTGAGAACTA CGGCGCCATT
TACGTCACCG AGACGCTTGA GGAAGCGGTG GATGTTGTGA ACGAACTGGC GCCGGAGCAT
TTGGAAGTGA TGACGGCAGA ACCGCTCGCG CTTTTCGGCC GGCTCCGCCA TGCGGGAGCG
ATGTTTTTCG GCCGCTTCAG CTCCGAGCCG GTCGGCGACT ATTTCGCCGG GCCGAACCAC
GTGCTGCCGA CGAACGGTAC GGCAAGGTTT TCAAGCGGTC TCGGCGTCGA TGAGTTTGTG
AAAAAATCAA GCGTGATTGT TTACAGTGAA GCCGCATTGA AACAACATGG AGAAAAAATC
GCCGCCTTTG CCCGCCTCGA GGGGCTGGAG GCGCACGCGC GCGCCATTGA GGTGCGGCTC
GAGAAAGGGG AATGA
 
Protein sequence
MKIERIRGGV SLRRTIESGT EEQRRVVLDI ISNVRARGDE ALKEYTERFD GVKLDSLKVT 
EEEMKRAHAA MDAEMLEIIR QAAANIRDYH ERQKRESWWM TKEDGTILGQ KVTPLDAVGL
YVPGGTAAYP SSVLMNVIPA QVAGVKRIVI TSPPNKDGTL PAGVLAAAYE LGVTEIYKVG
GAQAIAALAY GTETIRPVDK IFGPGNIYVA LAKREVFGHV AIDMIAGPSE IVVLADETAR
PDEIAADLLS QAEHDVRASA ILVTPSMKLA LAVASEVERQ LETLPRRDIA QAALENYGAI
YVTETLEEAV DVVNELAPEH LEVMTAEPLA LFGRLRHAGA MFFGRFSSEP VGDYFAGPNH
VLPTNGTARF SSGLGVDEFV KKSSVIVYSE AALKQHGEKI AAFARLEGLE AHARAIEVRL
EKGE