Gene Mbur_1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbur_1103 
SymbolhisD 
ID3997710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcoides burtonii DSM 6242 
KingdomArchaea 
Replicon accessionNC_007955 
Strand
Start bp1184434 
End bp1185714 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content47% 
IMG OID637958873 
Producthistidinol dehydrogenase 
Protein accessionYP_565781 
Protein GI91773089 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000289644 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGTATA AACACCTATC GGAAGTCACT GACGAAGAGA TGGAGAAGCT CCTCTGCCGC 
ACTGGAGATA TGATGGACGT TGCTGGTACT GTTTCATCAA TTCTAAAGGA TGTGCAGGAG
AAAGGCGATG AAGGCCTGCG TGAATATACT TTGAAGTTTG ACAAGGCCGA TATTCATACC
ATTGAGGTAA GTTCTGAGGA GATCGAGCAG GCAATGAAAG AGGTCGATGC AGAACTTCTC
AGACATCTTA AGATCGCAGC GGAGAACATA AGGAACTTCC ACACAGCACA ACTTCCACAG
AAGACCTGGT ACATTGAACC GTCTCCCGGG ATCAAACTTG GTCAGATGGC CACTCCACTA
GCATCGGTAG GTGCCTATGT TCCGGGTGGT CGTGCTTCTT ATCCTTCAAC TGCACTTATG
ACCATAATCC CTGCAAAGGT CGCAGGGGTA AAGAATGTTG TGATGTGCAC TCCTCCCGGT
TCTGATGGGA AGGTAAATCC ACTTACTCTT GTAGCAGGCA AAGTGGCTGG TGCAGATCAT
ATCTATAAGG TCGGTGGAGT TCAGGCGATC GCAGCAATGG CCTACGGTAC TGAATCAGTA
TTAAGCGTCG ACAAGATCGT TGGACCTGGG AATGTCTTTG TGACCGTTGC CAAGATGATG
GTTCGGGATA AAGCGGAGAT CGATTTCCCC GCAGGTCCAA GTGAAGTGCT TATTATTGCA
GATGATTCTG CGGATGCTGC TATGATCGCA TCTGATATTC TTGCACAGGC AGAGCACGAT
CCAAAATCGG TGTCAGTTCT TGTGACAACA TCCTCTGAAC TGGCAGAACA GACCAACAAC
GAGGTCAAAA AGCAGGCTGC TATTGCTGTG CGCAGGGAGA TCATTGAGCA ATCTCTTGAG
AATGCAGCTA TTCTCGTGAC AGATTCCATG GATGAATGTA TCAGGATCTC CAATGATTTT
GCACCGGAAC ACCTCGAGAT CATGGTTGAA GATGATGATG CGGTTCTCGA TATGATCGAG
AATGCAGGTT CTATCTTTGT CGGTAATTAT GCTCCGGTTG CAGCAGGTGA TTATGCATCG
GGTACAAACC ATGTACTTCC AACAGCCGGT TATCCTAAGA TATATTCAGG TTTGAATATT
CACCATTTCC TGAAATATTC CACCATACAG AAGATCACGA AGGAAGGCCT AAGCTCCATC
GGAGATACGA TAATAGCCCT TGCTGAAAAA GAGGGTTTAC AGGCACATGC TGATTCTGTT
AAGTCTCGTC TTATGTCTTG A
 
Protein sequence
MLYKHLSEVT DEEMEKLLCR TGDMMDVAGT VSSILKDVQE KGDEGLREYT LKFDKADIHT 
IEVSSEEIEQ AMKEVDAELL RHLKIAAENI RNFHTAQLPQ KTWYIEPSPG IKLGQMATPL
ASVGAYVPGG RASYPSTALM TIIPAKVAGV KNVVMCTPPG SDGKVNPLTL VAGKVAGADH
IYKVGGVQAI AAMAYGTESV LSVDKIVGPG NVFVTVAKMM VRDKAEIDFP AGPSEVLIIA
DDSADAAMIA SDILAQAEHD PKSVSVLVTT SSELAEQTNN EVKKQAAIAV RREIIEQSLE
NAAILVTDSM DECIRISNDF APEHLEIMVE DDDAVLDMIE NAGSIFVGNY APVAAGDYAS
GTNHVLPTAG YPKIYSGLNI HHFLKYSTIQ KITKEGLSSI GDTIIALAEK EGLQAHADSV
KSRLMS