Gene GM21_3799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3799 
SymbolhisD 
ID8139173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4371058 
End bp4372347 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content66% 
IMG OID644871418 
Producthistidinol dehydrogenase 
Protein accessionYP_003023576 
Protein GI253702387 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones114 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTTCC TTGACATTAG GGAAGCAGAT TTTCAGGCGA AATTCGACGC CATAGTGGAG 
CGCGGCGAGG AGTCCGGGCG CGAGGTCGAG GAGGTTGTCC TCGGCATCAT CGCCGACGTC
AGGAGCCGCG GCGACGAGGC GCTTCTGGAA TACACCAGGC GCTTTGACCG GCTGGAGTGC
GACGCCGCCG GGCTGGAGGT GACGCCTGAG GAATTCGAGA AGGCGTTGGC CAAGGTGGAC
CAGAAGGACC TGGCGGCGCT GAAACTCGCG GTGGAGAGGG TGGCCCGCTT TCACGAGAAA
CAGAAGCAGC AGAGCTGGAT CTCCACCGAG GAGAGCGACA TCATGGTGGG GCAGAAGGTG
ACTCCGCTCG CCAAGGTCGG CATCTACGTA CCCGGCGGCA AGGCCTGCTA CCCTTCCAGC
GTCGTCATGA ACGCCGTCCC CGCCAAGGTC GCCGGCGTGG GCGAGATCGT CATGGTAGTG
CCGGCCCCCG GCGGCGAGAT GAACCCGCAC GTGCTGACCG CGGCGAAACT CTCCGGCGTG
GACCGCGTCT TTCGCATCGG CGGCGCTCAG GCGGTAGCGG CGCTAGCCTA CGGCACGGCC
ACCGTCCCCA AGGTCGACAA GATCACCGGC CCCGGCAACA TCTACGTGGC CACGGCCAAG
AAGCTCGTCT TCGGCCAGGT CGGCATCGAC ATGATCGCGG GGCCGAGCGA GATCCTGGTC
ATCAACGACG GCAGCGGCAA CCCGGTCCAC GTGGCGGCGG ATCTCCTCTC CCAGGCGGAG
CACGACGAGC TTGCCTCCTC CGTGCTGATC ACCACCGACC GCAGCTTCGG CGAGCAGGTG
GCGGCCGAGG TGGAGCGCCA GTTGAAAGAG CTCTCCCGTG AGGTGATCGC GCGCAAGTCC
TGGGAATCCT TCGGCGTCAT CATCGTGGCC GGGAACCTGG AGGAGGCCAT CGCGTTTTCC
AACCGGATCG CTCCCGAGCA CCTGGAGCTT GCCGTCGAGA ACCCCTTCGA GATCCTGCCG
CTCATCACCA ACGCCGGCGC CATCTTCATG GGGCACTACA CCCCCGAAGC GGCCGGCGAC
TACCTGGCCG GACCCAACCA TACCCTCCCC ACCGGCGGGA CGGCGCGCTT TTTCTCGCCG
CTCTCCGTGG ACGATTTCGT CAAGAAAAGC TCCATCATCC ACTTCACCCG CGGCGGCCTT
GAGCGCGTCG GCGAGGACAT CGTCAGGATC TCCCGCCTGG AAGGGCTCGA CGCCCACGGC
AGGTCGGTGA GCCTCAGGTT GGAGAAGTAG
 
Protein sequence
MQFLDIREAD FQAKFDAIVE RGEESGREVE EVVLGIIADV RSRGDEALLE YTRRFDRLEC 
DAAGLEVTPE EFEKALAKVD QKDLAALKLA VERVARFHEK QKQQSWISTE ESDIMVGQKV
TPLAKVGIYV PGGKACYPSS VVMNAVPAKV AGVGEIVMVV PAPGGEMNPH VLTAAKLSGV
DRVFRIGGAQ AVAALAYGTA TVPKVDKITG PGNIYVATAK KLVFGQVGID MIAGPSEILV
INDGSGNPVH VAADLLSQAE HDELASSVLI TTDRSFGEQV AAEVERQLKE LSREVIARKS
WESFGVIIVA GNLEEAIAFS NRIAPEHLEL AVENPFEILP LITNAGAIFM GHYTPEAAGD
YLAGPNHTLP TGGTARFFSP LSVDDFVKKS SIIHFTRGGL ERVGEDIVRI SRLEGLDAHG
RSVSLRLEK