Gene Mjls_3079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_3079 
SymbolhisD 
ID4878792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp3218116 
End bp3219540 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content72% 
IMG OID640140379 
Producthistidinol dehydrogenase 
Protein accessionYP_001071349 
Protein GI126435658 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.141018 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATGTAT CGCCGGTGAC GATGGCCCGC ATCGACTTGC GCGGGAGGAC TCTGTCCACT 
GCGCAGTTGC GGTCGGCGCT GCCGCGTGGC GGTGTCGACG TCGACGCCGT CGTGCCGAAG
GTCCGCCCGA TCGTCGAGGC CGTCGCCGAG CGCGGCGCCG AGGCCGCACT CGAATACGGC
GAGGCGTTCG ACGGGGTCCG TCCCGCGTCG GTGCGCGTGC CTGCCGAGCG GTTGGCGTCG
GCGCTGGCCG AACTCGATCC CGACGTCCGC GCGGCGCTGC AGGTCGCGAT CGACCGGGCC
CGCGCTGTAC ACGCCGACCA GCGACGCACC GACACCACGA CCACCCTGGC CCCCGGAGCG
ACCGTCACCG AACGCTGGGT GCCGCTCGAG CGGGTCGGGC TCTACGTACC GGGCGGCAAC
GCGGTCTACC CCTCGAGTGT GGTGATGAAC GTGGTGCCCG CCCAGACCGC GGGCGTCGAC
TCGCTGGTCA TCGCCAGCCC GCCGCAGGCC GACCACGGGG GACTGCCACA CCCGACCATC
CTGGCGGCCG CCGCGCTCCT CGGCGTCGAC GAGGTGTGGG CGGTCGGCGG CGCCCAGGCC
GTCGCGCTGC TGGCCTACGG CGGCACCGAT ATCGATGGCA CTGGGGCAAG CGGAGCGCCG
GGAGATGGCA CTGGGGCAAG CGGAGCGCCG GGAGATGGCA CTGGGGCAAG CGGAGCGCCG
GGAGATGTCA CCGAGCTCGC GCCGGTGGAC ATGATCACCG GCCCCGGCAA CATCTACGTC
ACCGCCGCCA AGCGGATCTG CCGCTCACAG GTCGGCATCG ACGCCGAGGC CGGGCCGACC
GAGATCGCGG TGCTGGCCGA CCACACCGCC GATCCGGTGC ACGTCGCGGC CGACCTGATC
AGCCAGGCCG AACACGACGA GATGGCCGCC AGCGTGCTGG TGACCACGAG CCCCGACCTG
GCCGACGCCA TCGATCGCGA ACTGCGGCGC CAGCTCGAGA CGACGGTGCA CCGCGAACGC
GTCAGCACGG CACTGACCGG TGAGCAGTCC GCGATCGTGC TCGTCGACGA CCTCGACGCC
GGGGTGCGGG TCGTGAACGC CTACGCCGCA GAACATCTCG AGATCCAGAC TGTCGACGCC
GCCGAGGTGG CCGGCCGGAT CCGTTCTGCC GGTGCGATCT TCGTCGGTCC GTACGCGCCG
GTCAGCCTCG GGGACTACTG CGCCGGGTCC AACCACGTGC TGCCGACGGC GGGGTGCGCC
CGCCACTCCA GCGGTCTGTC GGTGCAGACG TTCCTGCGCG GCATCCACGT CGTCGACTAC
ACCGAGGCGG CGCTCAAGGA CGTGTCGGGA TACGTGATCA CCCTGGCGCA GGCCGAGAAC
CTGCCCGCCC ACGGCGAAGC GGTGCGACGG AGGTTCGAGT CATGA
 
Protein sequence
MNVSPVTMAR IDLRGRTLST AQLRSALPRG GVDVDAVVPK VRPIVEAVAE RGAEAALEYG 
EAFDGVRPAS VRVPAERLAS ALAELDPDVR AALQVAIDRA RAVHADQRRT DTTTTLAPGA
TVTERWVPLE RVGLYVPGGN AVYPSSVVMN VVPAQTAGVD SLVIASPPQA DHGGLPHPTI
LAAAALLGVD EVWAVGGAQA VALLAYGGTD IDGTGASGAP GDGTGASGAP GDGTGASGAP
GDVTELAPVD MITGPGNIYV TAAKRICRSQ VGIDAEAGPT EIAVLADHTA DPVHVAADLI
SQAEHDEMAA SVLVTTSPDL ADAIDRELRR QLETTVHRER VSTALTGEQS AIVLVDDLDA
GVRVVNAYAA EHLEIQTVDA AEVAGRIRSA GAIFVGPYAP VSLGDYCAGS NHVLPTAGCA
RHSSGLSVQT FLRGIHVVDY TEAALKDVSG YVITLAQAEN LPAHGEAVRR RFES