Gene Gdia_2140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2140 
SymbolhisD 
ID6975568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2371408 
End bp2372721 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content74% 
IMG OID643391669 
Producthistidinol dehydrogenase 
Protein accessionYP_002276513 
Protein GI209544284 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.480226 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGGC TGGATACCGC GCAACCCGAT TTCCGTGCCG CCTTCGCCCG CCTGCTGGAC 
GACCGCGAGG GCGACACCGC CCGCGTCGAT GCCCCGGTCG CCGAGATCCT GGCCGCCGTG
CGTGCGCGGG GCGACGAGGC GCTGTGCGCC TATACCGCCC GGTTCGACCG CATGCCGGTC
ACGCCGGACC GCCTGCGCAT CACCGAGGCC GAGATCGAGG CCGCCTGCGC CCGGGTCCCG
CCCGACCTGC TGGCGGCCCT GGATGTCGCG GCCACCCGGA TCGAGGCCTT CCACCGCGCC
CAGATGCCGG CCGACCTGCG CTATACCGAC GCGGACGGGG TGGATCTGGG CATGCGCTGG
ACGGCGCTGG ACGCGGTCGG GCTGTACGTG CCCGGCGGCA CGGCGGCCTA TCCGTCCTCG
GTCCTGATGA ACGCCATGCC CGCGCGGGTG GCGGGGGCGG CGCGGCTGGC GATGTGCGTG
CCGACCCCGG ACGGCGTGCT GAATCCGCTG GTGCTGGCCG CCGCCCGACG CGCCGGCGTG
ACCGAGATCT ATCGCGTCGG CGGGGCGCAG GCCGTGGCGG CCATGGCCTA CGGCACCGCG
ACCATCCGCC CGGTGGACCG CGTGGTCGGA CCCGGCAACG CCTATGTGGC CGAGGCCAAG
CGTCAGGTGT TCGGCCGGGT GGGCATCGAC AGCATCGCCG GCCCGTCCGA GGTCGTGGTG
GTGGCCGACA GCGGCACCGA TCCGCGCATC GTCGCGCTGG ACCTGCTGGC GCAGGCCGAG
CATGACGCCC TGGCGCAGTC GATCCTGATC ACCCAGGACG CCACCCTGGC CGACCGGGTG
GCGGAGGCGG TCGAGGCCGA ACTGCGCACC CTGCCCCGCG CCGCCATCGC GGGGGCGAGC
TGGGGCGCCC ACGGCGCCAT CATCACCGTG CGCGACCTGG ACGAGGCCGC GTCGCTGATC
GACGCGATCG CGCCCGAACA TCTGGAACTG CTGCTGGCCG ATCCGGAACC GCTGTTCGCC
CGGGTCCGCC ATGCCGGGGC GATCTTCCTG GGCCGGCAAT GCGCCGAGGC GATCGGCGAT
TATGTCGGCG GTCCGAACCA TGTCCTGCCC ACCAGCCGGA CCGCGCGCTT CGCCTCGGGC
CTGTCGGTGT TCGACTTCCT GAAGCGCACG ACCTTCATCG GCGCGGGGCC GGACGCGCTG
CGCCGGATCG GGCCGGCGGC GGTGGCCCTG GCGCGGGCCG AGGGGCTGGA CGCACACGCG
CTGAGCGTGT CGGCGCGGCT GGACGCCGTG GCGCGCGAGT CCGACAAAGC TTGA
 
Protein sequence
MKRLDTAQPD FRAAFARLLD DREGDTARVD APVAEILAAV RARGDEALCA YTARFDRMPV 
TPDRLRITEA EIEAACARVP PDLLAALDVA ATRIEAFHRA QMPADLRYTD ADGVDLGMRW
TALDAVGLYV PGGTAAYPSS VLMNAMPARV AGAARLAMCV PTPDGVLNPL VLAAARRAGV
TEIYRVGGAQ AVAAMAYGTA TIRPVDRVVG PGNAYVAEAK RQVFGRVGID SIAGPSEVVV
VADSGTDPRI VALDLLAQAE HDALAQSILI TQDATLADRV AEAVEAELRT LPRAAIAGAS
WGAHGAIITV RDLDEAASLI DAIAPEHLEL LLADPEPLFA RVRHAGAIFL GRQCAEAIGD
YVGGPNHVLP TSRTARFASG LSVFDFLKRT TFIGAGPDAL RRIGPAAVAL ARAEGLDAHA
LSVSARLDAV ARESDKA