Gene Mpal_2206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2206 
SymbolhisD 
ID7270291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2348675 
End bp2349925 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content63% 
IMG OID643570820 
Producthistidinol dehydrogenase 
Protein accessionYP_002467225 
Protein GI219852793 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.209889 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGAAGG CGGTGGAGAT CAATGACTGG ACTATGAAGC GGAGGGCGAC CCTCGCGCAG 
GTTCAGGATG CCGTGCAGGG GATCATCGGC GAGGTCCGGG AGTCTGGAGA CCGTGCCCTG
ATCGATCTCG CCGCCCGGTT CGATCGGGTC AAACTGGACG GAATTCGGGT CAGTGAGGAG
GAGCAGCAGG AGGCCTACGA CCTCGTCGAC GAGCAGGTGG TCGAGAGCCT TGTCGAGGCG
GCTGCCAGGA TCACGGTCTT CCATGAACTG CAGCGCCCGA AAGACCTCTG GCTCTCTGAG
GTGGAGCCGG GCATCACCCT TGGGGTCAAG ACCACGCCGC TCTCACGGGT TGGAGCCTAT
GTCCCCGGCG GGCGGGCCTC GTACCCGTCG ACGGCGTTGA TGTGCACTAT TCCCGCGAAG
GTCGCCGGGG TCGGTGAGAT CTGCTGCTGC TCGCCGCCGC CGATCCACCC GTTAACCCTG
GTTGCCCTTG ATATCGCCGG TGTCTCAGAG ATCTATCGGG CAGGCGGGGC ACAGGCGATC
GCCGCGATGG CACTCGGCAC CGAAACGATA AAACCGGTTC AGAAGATCGT CGGGCCGGGA
AACGTCTATG TGACGGCGGC TAAGATGCTC CTCCGGGAGT ACGCCGAGAT TGACTTCCCG
GCCGGTCCGA GCGAGGTCGC GATCCTGGCC GATGAGACTG CGACCCCCTC CTTCGTCGCT
GCCGATATCC TCGCCCAGGC CGAGCATGAC CCAAACGCTG CCTGTCTTCT GATCACCACA
GACCCGACCC TGGCCCGGGA GGTCGGTGAG GAGGTCGGTC GGCAACTACT GATGGCCCCA
CGGAAGGCGA TCATCGAGCA GTCCCTCAAC AACGCTGGTT ACCTGATCGC CAGCGATCTG
GATGTAGCAA TCGAGGCCGT CAACACGGTC GCCCCCGAGC ACCTCTCGAT CCAGGTCGCC
GACCCCCTCT CCGCCCTTGG CTCGATTCGA AATGCGGGCT CGATCTTCAT CGGCCCGTAC
GCCCCGGTGG CCTGTGGGGA TTACGCGTCC GGTACCAACC ATGTACTGCC GACGGCAGGT
TATGCAGCCC GTTTTTCAGG GCTCGATGTG AATCACTTCT GCAAGACATC GACCGTACAG
ATGATCAGCA GGCGCGGGCT TGAGACGATT GGAGACGTGG TCGAGACGAT CGCCGAGGCT
GAAGGACTCT CTGCCCACGC CGAGTCAGTT CGGGTGCGGC GCAGATCCTG A
 
Protein sequence
MWKAVEINDW TMKRRATLAQ VQDAVQGIIG EVRESGDRAL IDLAARFDRV KLDGIRVSEE 
EQQEAYDLVD EQVVESLVEA AARITVFHEL QRPKDLWLSE VEPGITLGVK TTPLSRVGAY
VPGGRASYPS TALMCTIPAK VAGVGEICCC SPPPIHPLTL VALDIAGVSE IYRAGGAQAI
AAMALGTETI KPVQKIVGPG NVYVTAAKML LREYAEIDFP AGPSEVAILA DETATPSFVA
ADILAQAEHD PNAACLLITT DPTLAREVGE EVGRQLLMAP RKAIIEQSLN NAGYLIASDL
DVAIEAVNTV APEHLSIQVA DPLSALGSIR NAGSIFIGPY APVACGDYAS GTNHVLPTAG
YAARFSGLDV NHFCKTSTVQ MISRRGLETI GDVVETIAEA EGLSAHAESV RVRRRS