Gene GSU3100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3100 
SymbolhisD 
ID2688475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3403592 
End bp3404881 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content65% 
IMG OID637127793 
Producthistidinol dehydrogenase 
Protein accessionNP_954141 
Protein GI39998190 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATTCC TCGACATCAG GGATACGAAC TTTGACGCGG AATTCGCTGC CATCCTCGCC 
CGGGGCGAGG AGACCGGCCG CGAGGTGGAG CAGGTGGTTC TCGACATCAT CGCCGATGTC
CGTGCACGGG GAGACGAGGC GCTCCTGGAG TACACCCGGC GCTTCGACCG GCTTGAGGCC
GACTCCGTCG CCGCCCTCCA GGTGACCGAG GACGAGATCG AGTACGCCTT TGCCAAGGTG
AAGGACGAGG AGATTGCCGC CCTCAAGCTG GCGGTGGAGC GGGTGGCCCG CTTCCACGAG
AAGCAGAAGC AGGAGACCTG GCTCTCCACC ACCGAGCCAG ACATCCTTCT CGGTCAGATG
GTGACGCCCC TGGAGCGGGT CGGGATCTAC GTTCCCGGCG GCAAGGCGAG CTACCCTTCC
AGTGTCATCA TGAATGCAGT TCCGGCCCGA GTGGCCGGCG TCGGCGAGAT CGTCATGGTG
GCCCCTACCC CCGGCGGCGA GATCAACCCG CACGTTCTGG TGGCGGCGCG GCTTTCCGGT
GTTGACCGGA TTTTCCGGAT GGGAGGCGCC CAGGCGGTGG CGGCCCTGGC CTATGGGACC
GCGACGGTGC CCCGGGTGGA CAAGATCACC GGCCCGGGGA ACATCTACGT GGCCACCGCC
AAAAAGCTCG TCTTCGGCCA GGTGGGGATC GACATGATCG CCGGACCCAG CGAGATTCTC
GTCATCAACG ACGGGAGCGG CACCCCGGCC CACATCGCCG CCGACCTCCT TTCCCAGGCG
GAGCACGACG AACTTGCTTC ATCCATCCTC ATCACCACCG ACCGCGGTTT CGGCGAGCAG
GTGGCGACGG AGGTGGAGCG GCAACTGGCG CAACTCTCCC GGGAGACCAT CGCCCGCACG
TCGTGGGAGA CCTACGGCGC GGTCATCGTG GCCGGTAGCC TGGACGAGGC CATCGCTTTC
TCGAACCGGA TCGCCCCGGA GCACCTGGAG CTTGCTGTGG CAAATCCCTT CGAGATACTG
CCGCGGATCA AAAACGCCGG TGCTATCTTC CTCGGCCACT TCACCCCCGA GGCGGCCGGC
GACTACCTGG CCGGCCCGAA CCACACCCTT CCCACCGGCG GTACGGCCCG TTTCTTCTCC
CCACTGTCGG TGGACGATTT CGTGAAGAAA TCCTCTATCG TCTACTTCAG TGCGGCGGGG
TTGAACCGTC TGGGCCGCGA CATCGTCAGT ATTGCCGAGA TGGAGGGGCT GGAGGCCCAC
GGCAGGTCGG TAAGCATCCG CCTGAAATAA
 
Protein sequence
MRFLDIRDTN FDAEFAAILA RGEETGREVE QVVLDIIADV RARGDEALLE YTRRFDRLEA 
DSVAALQVTE DEIEYAFAKV KDEEIAALKL AVERVARFHE KQKQETWLST TEPDILLGQM
VTPLERVGIY VPGGKASYPS SVIMNAVPAR VAGVGEIVMV APTPGGEINP HVLVAARLSG
VDRIFRMGGA QAVAALAYGT ATVPRVDKIT GPGNIYVATA KKLVFGQVGI DMIAGPSEIL
VINDGSGTPA HIAADLLSQA EHDELASSIL ITTDRGFGEQ VATEVERQLA QLSRETIART
SWETYGAVIV AGSLDEAIAF SNRIAPEHLE LAVANPFEIL PRIKNAGAIF LGHFTPEAAG
DYLAGPNHTL PTGGTARFFS PLSVDDFVKK SSIVYFSAAG LNRLGRDIVS IAEMEGLEAH
GRSVSIRLK