Gene Mpe_A0831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0831 
SymbolhisD 
ID4786966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp871088 
End bp872431 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content71% 
IMG OID640089392 
Producthistidinol dehydrogenase 
Protein accessionYP_001020028 
Protein GI124266024 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.685549 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCC CCGCCCGCGT GAACCTGCGC CGCCTGGCCA CCACGCAGCC CGATTTCGAG 
GCGGCATTCC GGGCCGTGCA GCACTGGTCG GCGGAGACCG ACGACGCGAT CGAGGGCCGT
GTGGCGGAGA TCCTGGCCGA CGTGCGGCAG CGCGGCGACG CCGCGGTCCT GGAGTACACC
GCGCGCTTCG ACGCACTGCC GGTCGGCTCG ATGGGCGAGC TCGAGCTGAC GCAGGCCGAT
TTCGCCGCCG CCTTCGCGGC CATCACGCCG GCGCAGCGCG ACGCGCTGCA GCAGGCCACG
CGCCGCATCC GCAGCTACCA CGAGCGCCAG CTCGACGCCT GCGGCCGCAG CTGGAGCTAC
CGCGACGAGG ACGGCACGCT GCTGGGCCAG AAGGTCACGC CGTTGGACCG CGTGGGCATC
TACGTGCCGG GTGGCAAGGC GGCCTATCCG TCCAGCCTGC TGATGAATGC GGTGCCGGCC
CATGTGGCCG GCGTGCCCGA GATCGTGATG GTGGTGCCCA CGCCGAAGGG CGAGCGCAAC
ACCCTGGTGC TGGCCGCCGC GCACGTGGCG GGCGTGACGC GCGCCTTCAC GATCGGCGGC
GCGCAGGCGG TGGCCGCACT CGCCTACGGC ACGGCCACGG TGCCGCAGGT CGACAAGATC
ACCGGCCCCG GCAACGCCTA TGTGGCGAGC GCGAAGAAGC GCGTGTTCGG CACGGTGGGC
ATCGACATGA TCGCCGGACC CAGCGAAATT CTGGTGCTGG CCGACGGCAG CACGCCGGCC
GACTGGGTGG CGATGGACCT GTTCAGCCAG GCCGAGCACG ACGAGCTGGC GCAGAGCATC
CTGCTGTGCC CCGATGCGGC CTATCTCGAC GCAGTACAGG CAGAGATCAA CCGGCTGCTG
CCCGGCATGC CGCGCGCTGC CATCATCCGC GCTTCGCTGG AAGGGCGCGG CGCGTTGATC
CACACCCGCT CGATGGAAGA GGCCTGCGAG ATCAGCAACC GCATCGCGCC CGAGCACCTG
GAAGTGAGCT CGAACGAGCC GCACCGCTGG GAGCCGCTGC TGCGCCACGC CGGCGCGATC
TTCCTCGGTG CCTACACCAG CGAATCGCTG GGCGACTACT GTGCCGGGCC GAACCACGTG
CTGCCGACCT CGGGCACCGC GCGCTTCAGC TCGCCGCTGG GCGTCTACGA CTTCCAGAAG
CGCAGCAGCC TGATCGAGGT CAGCGAGCAG GGCGCGCAGA CGCTGGGCGT GATCGCCGCC
GAGCTGGCCT ACGGCGAGGG CCTGCAGGCG CACGCGCAGG CGGCCGAGAT GCGGCTGGCG
AAGAAGCCGA GAGCGGCACG ATGA
 
Protein sequence
MSAPARVNLR RLATTQPDFE AAFRAVQHWS AETDDAIEGR VAEILADVRQ RGDAAVLEYT 
ARFDALPVGS MGELELTQAD FAAAFAAITP AQRDALQQAT RRIRSYHERQ LDACGRSWSY
RDEDGTLLGQ KVTPLDRVGI YVPGGKAAYP SSLLMNAVPA HVAGVPEIVM VVPTPKGERN
TLVLAAAHVA GVTRAFTIGG AQAVAALAYG TATVPQVDKI TGPGNAYVAS AKKRVFGTVG
IDMIAGPSEI LVLADGSTPA DWVAMDLFSQ AEHDELAQSI LLCPDAAYLD AVQAEINRLL
PGMPRAAIIR ASLEGRGALI HTRSMEEACE ISNRIAPEHL EVSSNEPHRW EPLLRHAGAI
FLGAYTSESL GDYCAGPNHV LPTSGTARFS SPLGVYDFQK RSSLIEVSEQ GAQTLGVIAA
ELAYGEGLQA HAQAAEMRLA KKPRAAR