Gene TM1040_3563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3563 
SymbolhisD 
ID4075484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp604396 
End bp605700 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content61% 
IMG OID638005076 
Producthistidinol dehydrogenase 
Protein accessionYP_611794 
Protein GI99078536 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0867737 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGTAT TTCTGACTTC TGCTGAGGCA GACTTCGAAC AATCCTTCAC CACCCTCTTG 
AACGCAAAAC GAGAGGACAG CCCAGATGTG GACGCCGTCG TCGCGGACAT TATTGCGGAT
GTGCGGGGAC GCGGAGATGC GGCTCTGCTG GAGCTGACGC AGAAATTTGA CCGCCTAGAC
CTGCCCGATT CGGCAGCGCT GAGAATCACT GCAGAGGAAG TCGACGACGC CATCAAATCC
GTGTCAGAGG CCGAGCGCGC AGCACTTGAA CTCGCAGCGG ACCGTATTCG TGCCTATCAC
GCTGAACAAA TGCCAGAAAA CAAGAGCTGG ACGGATGCCG GCGGCGCAAC CCTCGGGTGG
CGCTGGTCGG CTGTTTCCGC GGCAGGACTC TATGTGCCCG GCGGGCTTGC CAGCTATCCG
TCCTCTGTGC TGATGAATGC CATTCCCGCC AAGGTTGCAG GAGTCGAACG TCTTGCGGTG
ACGGTTCCGA CGCCGGACGG GCAGATCAAC CCTCTGGTGC TCCTGGCGTG TCGGGTTTCT
GGCGTCGACG AAATTTACCG CGTTGGCGGC GCGCAAGCGA TCGCCGCGCT TGCGTATGGG
ACCGAAACCA TCGCCCCTGT GGATAAGATC ACAGGCCCCG GCAACGCCTT TGTGGCAGCC
GCCAAGCGGC GCGTATTTGG TAAAGTCGGC ATCGACATGA TCGCTGGGCC CTCCGAGATC
CTTGTGATCG CGGACAAGGA CAACAACCCA GATTGGATCG CATTGGACCT GCTCAGTCAG
GCGGAGCATG ACGAAAGCGC GCAATCCATC CTGATCACCG ACGATGCGGA ATTCGGATCT
GCGGTGGCGG TGGCGGTCGA TAAACGACTG GAAACGCTGG AACGCCGCGC CATCGCCGGC
GCCAGCTGGC GTGATTTCGG CGCTGTAATC GTAGTGCGTG ACATGGACGA GGCGGCGGCG
CTTTCCAACC GGATTGCACC CGAGCACCTT GAACTCTGTG TCGCCGATCC CGAAGCGCTG
AGCAAGAAAA CGATCCACGC GGGCGCAATT TTCATGGGCC AATATACGCC TGAGGCCATT
GGAGACTACA TCGGCGGGCC AAATCACGTC CTGCCCACGG CGAGGTCTGC GCGCTTCTCC
TCCGGTCTGT CGGTGATGGA TTTCATCAAG CGCACAACTC TGAGTCAGAT GACCCCCGAC
GCCTTGCGCA GCATTGGACC AGCGGCGGCG ACGCTGGCCG AGAGCGAGAG CCTCGAAGCG
CACGGGCTGT CCGTTCTCGC TCGCCTTGAG GCACTCAACC GCTGA
 
Protein sequence
MPVFLTSAEA DFEQSFTTLL NAKREDSPDV DAVVADIIAD VRGRGDAALL ELTQKFDRLD 
LPDSAALRIT AEEVDDAIKS VSEAERAALE LAADRIRAYH AEQMPENKSW TDAGGATLGW
RWSAVSAAGL YVPGGLASYP SSVLMNAIPA KVAGVERLAV TVPTPDGQIN PLVLLACRVS
GVDEIYRVGG AQAIAALAYG TETIAPVDKI TGPGNAFVAA AKRRVFGKVG IDMIAGPSEI
LVIADKDNNP DWIALDLLSQ AEHDESAQSI LITDDAEFGS AVAVAVDKRL ETLERRAIAG
ASWRDFGAVI VVRDMDEAAA LSNRIAPEHL ELCVADPEAL SKKTIHAGAI FMGQYTPEAI
GDYIGGPNHV LPTARSARFS SGLSVMDFIK RTTLSQMTPD ALRSIGPAAA TLAESESLEA
HGLSVLARLE ALNR