Gene Mboo_2149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2149 
SymbolhisD 
ID5410125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2220451 
End bp2221731 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content59% 
IMG OID640869394 
Producthistidinol dehydrogenase 
Protein accessionYP_001405306 
Protein GI154151688 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.547932 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGAGCG CGGTAGATGT CGAGACGTGG GTTTCGCGGC GTACATCCCG GCTGGACGAG 
GTTCACGGCC CGGTCCAGGA AATCATCGGG AAGGTAAAAT CGGGTGGAGA TGCCGCACTC
ATCGAGCTTT CCGAAAAATT TGACAAAGTT CACCTGGATT CGGTTGCGGT TGACGAGGAT
GCCCGGGAGG CGGCCTACGA TCAGGTCGAT GCCAAAGTTA CCGAATGCCT TGTGGAGGCT
GAAGCCAGGA TCAGCAGGTT TCACGAACTC CAGCTCCCCC GGGGCCTCTG GCTTTCCGAG
GTAGAGCCCG GGATCACGCT CGGCGTGAAG ACCACTCCCC TTTCACGGGT CGGTGCGTAT
GTGCCCGGGG GGCGTGCCGC GTATCCCTCA ACGGCCCTTA TGTGCACGAT CCCGGCGCGA
ATCGCCGGAG TCCCGGAGAT CTGTTGCTGC TCACCACCCC CGATTCAGCC GCTCACCCTT
GTCGCGCTCG ATATTGCCGG GGTCGAAGAG ATATACCAGT GCGGCGGGGC CCAGGCAATT
GCGGCCATGG CTCTAGGAAC CGAGAGTATC GAGCCGGTGG AAAAGATCGT GGGACCGGGG
AATGTGTATG TCACCGCGGC AAAGATGCTG CTGCGGGAAT ATGCCGAGAT CGATTTCCCG
GCCGGGCCAA GCGAAATTGC GGTCATTGCT GATGAAAATG CCGTGCCTTC CTATGTTGCG
GCAGATATCC TTGCACAGGC CGAACACGAT CCGCATGCTG CATGTGTGCT CATCACTACC
TCTGCCGCAT TTGCCGACGA GGTTGGCGCC GAGATCAAAA AACAGGCCGA AAACGCCCCG
CGCAAGGAAA TAATGGCACA GGCGCTTAAG AACTCGGGAT ATATCCTGGC CGGGGATCTT
GATGAGGCCG TTGCGATCTC AAACGCGGTT GCCCCGGAAC ACCTCTCGAT TCAGGTTGCC
GACCCGCTCC CGGTGCTTGG CGGGATCCGG AATGCGGGCT CCATCTTTGT CGGTCCCTAT
ACGCCGGTTG CATGCGGTGA CTATGCATCA GGGACCAACC ACGTCCTCCC GACTGCTGGA
TATGCACGTC AGTACTCGGG TCTCAATGTA CATCATTTCT GTAAAACATC ATCAGTCCAG
ATGCTGTCCC GGGAGGGGCT TGAAAGCATT GGGGATATTA TCGAAACCCT TGCAACCGCT
GAGGGGCTTG CCGCTCACGC ACAGTCTGTG GATGTGCGGC TCAGGAGTGC CAAACCCGAC
TGTAAGGCAC CCCTTACCTG A
 
Protein sequence
MWSAVDVETW VSRRTSRLDE VHGPVQEIIG KVKSGGDAAL IELSEKFDKV HLDSVAVDED 
AREAAYDQVD AKVTECLVEA EARISRFHEL QLPRGLWLSE VEPGITLGVK TTPLSRVGAY
VPGGRAAYPS TALMCTIPAR IAGVPEICCC SPPPIQPLTL VALDIAGVEE IYQCGGAQAI
AAMALGTESI EPVEKIVGPG NVYVTAAKML LREYAEIDFP AGPSEIAVIA DENAVPSYVA
ADILAQAEHD PHAACVLITT SAAFADEVGA EIKKQAENAP RKEIMAQALK NSGYILAGDL
DEAVAISNAV APEHLSIQVA DPLPVLGGIR NAGSIFVGPY TPVACGDYAS GTNHVLPTAG
YARQYSGLNV HHFCKTSSVQ MLSREGLESI GDIIETLATA EGLAAHAQSV DVRLRSAKPD
CKAPLT