Gene Mvan_2804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_2804 
SymbolhisD 
ID4646536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp2972542 
End bp2973897 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content71% 
IMG OID639806285 
Producthistidinol dehydrogenase 
Protein accessionYP_953617 
Protein GI120403788 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAGCG TGAGTGTATC GCCAGGGCTT CTGCGGCGCA TCGATCTGCG CGGCACCACG 
CTGTCGGCGG CCCGCCTGCG CTCGGCACTG CCGCGCGGCG GTGTCGACGT CGACACCGTC
GTGCCCAAAG TCAGGCCTAT CGTCGATGCG GTCGCCGAGC GCGGAGCCGC CGCGGCGCTG
GAGTACGGTG CCGCGTTCGA CGGTATCCGG CCCGACCAGG TGCGGGTTCC CGCCGAAAGG
CTGAAGGCGG CGCTCGCCGA GCTCGATCCC GATGTCCGCA CCGCGCTGGA GGTCGCGATC
GAGCGGGCCC GCGCCGTGCA CGCCGATCAG CGCCGCACCG ACACCACGAC CACACTCGCG
CCGGGCGCCA CCGTGACCGA GCGCTGGGTG CCCGTCGAGC GGGTCGGACT CTACGTGCCC
GGCGGAAACG CCGTCTACCC GTCCAGCGTC GTGATGAACG TGGTGCCCGC CCAGACCGCG
GGCGTGGACT CGCTGGTGAT CGCCAGCCCG CCGCAGGCGG GGAATGCTGA GCCCTTCAAG
GGCCTTCCGC ATCCGACGAT CCTGGCCGCG GCGGCGCTGC TCGGTGTCGA CGAGGTCTGG
GCGGTCGGCG GGGCCCAGGC CGTCGCGTTG CTGGCCTACG GCGGCGTCGA CACCGACGGA
GCCGAACTCG CTCCGGTCGA CATGATCACC GGGCCCGGCA ACATCTACGT CACCGCCGCC
AAACGCATCT GCCGGTCGGC GGTCGGCATC GACGCCGAGG CCGGCCCCAC CGAGATCGCG
ATCCTGGCCG ACCACACCGC CGACCCGGCG CACGTCGTCG CGGACCTGAT CAGTCAGGCC
GAGCACGACG AGATGGCCGC CAGCGTCCTG GTCACCGACA GCGCCGAGTT GGCCGACGCC
ACCGACCGCG AGCTGGCCGT ACAGCTGGAG ACCACCGTGC ACCGCGAGCG GGTGACGGCC
GCGCTGGGTG GGCAGCAGTC GGCGATCGTC CTCGTCGACG ACATCGAGGC CGGGATCCGC
ACCGTGAACG CCTACGCCGC CGAGCACCTG GAGATCCAGA CCGTCGACGC GGCCGCTGTC
GCGGGCAGGA TCCGTTCTGC CGGAGCGATT TTCGTCGGTC CGTGGTCACC GGTGAGCCTC
GGTGACTACT GCGCAGGCTC GAACCATGTG CTCCCCACCG CGGGCTGCGC CCGGCATTCC
AGTGGATTGT CGGTGCAGAC CTTCCTGCGC GGCATCCACG TCGTGGACTA CACCGAGGCG
GCGCTGAAGG ACGTGTCGGG CTACGTCATC ACGCTGGCCA AGGCCGAGAA CCTGCCCAGC
CACGGCGAAG CCGTGCGCCG GAGGTTCGAG CGGTGA
 
Protein sequence
MVSVSVSPGL LRRIDLRGTT LSAARLRSAL PRGGVDVDTV VPKVRPIVDA VAERGAAAAL 
EYGAAFDGIR PDQVRVPAER LKAALAELDP DVRTALEVAI ERARAVHADQ RRTDTTTTLA
PGATVTERWV PVERVGLYVP GGNAVYPSSV VMNVVPAQTA GVDSLVIASP PQAGNAEPFK
GLPHPTILAA AALLGVDEVW AVGGAQAVAL LAYGGVDTDG AELAPVDMIT GPGNIYVTAA
KRICRSAVGI DAEAGPTEIA ILADHTADPA HVVADLISQA EHDEMAASVL VTDSAELADA
TDRELAVQLE TTVHRERVTA ALGGQQSAIV LVDDIEAGIR TVNAYAAEHL EIQTVDAAAV
AGRIRSAGAI FVGPWSPVSL GDYCAGSNHV LPTAGCARHS SGLSVQTFLR GIHVVDYTEA
ALKDVSGYVI TLAKAENLPS HGEAVRRRFE R