Gene Nmul_A0819 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0819 
SymbolhisD 
ID3786688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp933606 
End bp934928 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content55% 
IMG OID637810905 
Producthistidinol dehydrogenase 
Protein accessionYP_411518 
Protein GI82701952 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTCGA TAAAGAGATT GTCTTCTGCC GATACCGAGT TCGATAAGGC GCTGTCGGAA 
CTGCTGGCGT TTGAAAACAC CCAGGATGCA AAGCTGGAGG CTGCTGTCGC AGATATTCTC
GCGAAGATCA GGACGGAGGG AGATAAAGCC TTGCTGGAAT ACACGCTCCG TTTCGATCGC
GTGGATGCAA AATCGGCGGC GGATCTGGAA TTGCCCCGAA ATCGCCTGCA ACAAGCTCTT
CATAACCTGC CTGGCGAGCA ACGTAACGCC CTGGAGCAGG CTGCGGAACG GGTCCGCGTC
TATCATGAGA AACAGTTGAC GCAATCCTGG AGCTACGTGG AGCCTGACGG AACACACCTT
GGGCAGAAAA TCACCCCTCT CGATCGTGCC GGTCTGTATG TTCCCGGCGG CAAGGCAGCC
TACCCTTCAT CAGTCCTGAT GAACGCAATT CCCGCCAAGG TGGCAGGAGT GGGCGAACTT
GTCATGGTGA CACCCACTCC ACAGGGCGAA GTAAATGACC TGGTGCTCGC TGCTGCGGCC
ATTTGCGAGG TTGACCGGGT TTTCACCATA GGGGGCGCTC AAGCCGTGGG CGCACTGGCA
TATGGCACCC CTACCGTGCC GCGAGTGGAC AAGATCGTCG GCCCGGGAAA CGCTTATGTG
GCAACAGCCA AGCGGCATGT TTTCGGTGTG GTAGGGATCG ATATGCTCGC GGGACCTTCC
GAAATTCTGA TCATCTGCGA TGGCAAAACC AATCCGGACT GGATTGCGAT GGACATGTTT
TCCCAGGCGG AGCACGACGA GTTGGCGCAG GCAATCCTGC TGTCACCCGA TCTTCATTTT
ATCGAAACAG TCGCAGCGAG TATCGTCCGG CAGCTGGAAA CGATGCCACG CAAGGAGATA
ATCCGGACTT CGCTCGAAAA CAGGAGCGCC TTGATTCAGG TGCATGATCT GGAAGAAGCC
TGTGAGATCG CCAACAGCAT CGCGCCCGAA CATCTGGAGT TATCAGTAGA ACAGCCGGAA
AAGTGGGTGG AAAAAATAAG ACACGCGGGT GCGATTTTTC TGGGTCGCCA TACATCGGAA
GCGCTGGGAG ATTATTGCGC GGGTCCCAAC CACGTCCTCC CCACTTCCCG TACTGCACGC
TTTTCGTCGC CACTCGGAGT ATACGATTTT CAGAAACGCA GCAGCATTAT TCAGGTATCG
GGGCAGGGAT CAGCGAAATT GGGCGCCATT GCCTCTATCC TGGCCCAAGG TGAAGGGCTG
CAGGCACACG CAATGTCAGC GGAATATCGT TACACAAAAA AAATAGCCCT TGGAAAAAAT
TGA
 
Protein sequence
MISIKRLSSA DTEFDKALSE LLAFENTQDA KLEAAVADIL AKIRTEGDKA LLEYTLRFDR 
VDAKSAADLE LPRNRLQQAL HNLPGEQRNA LEQAAERVRV YHEKQLTQSW SYVEPDGTHL
GQKITPLDRA GLYVPGGKAA YPSSVLMNAI PAKVAGVGEL VMVTPTPQGE VNDLVLAAAA
ICEVDRVFTI GGAQAVGALA YGTPTVPRVD KIVGPGNAYV ATAKRHVFGV VGIDMLAGPS
EILIICDGKT NPDWIAMDMF SQAEHDELAQ AILLSPDLHF IETVAASIVR QLETMPRKEI
IRTSLENRSA LIQVHDLEEA CEIANSIAPE HLELSVEQPE KWVEKIRHAG AIFLGRHTSE
ALGDYCAGPN HVLPTSRTAR FSSPLGVYDF QKRSSIIQVS GQGSAKLGAI ASILAQGEGL
QAHAMSAEYR YTKKIALGKN