Gene AFE_3041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAFE_3041 
SymbolhisD 
ID7137196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 23270 
KingdomBacteria 
Replicon accessionNC_011761 
Strand
Start bp2731557 
End bp2732858 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content65% 
IMG OID643531392 
Producthistidinol dehydrogenase 
Protein accessionYP_002427408 
Protein GI218665150 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.2292 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCGTC TGGACACCAG CGACCCTGAT TTCGCCCAGC AGTTTCACGC GCTGCATGAT 
TGGGATGCCA ACCTCGATCC GCAGATCGAA GTGCGGGTAC GAGAGATCGT CACCACCGTC
CGTGATCGGG GCGATGCGGC GCTACGCGAG TACACGGAGC GCTTTGATGG GGTGACGACG
GCTTCCGCCG CTGAGCTGGA GATCCCCCGC AGTGCCTGGG ATGCGGCGCT CCACGGTCTG
GAGCCCACCC AGCGAGTTGC CCTGGAAGAG GCGGCGCAAC GTATCCGCAG TTACCACGAG
CACCAGCGCA GTGTAGGCTG GACCTTTACC GAGGCCGACG GCACGATGCT CGGACAGCGC
ATCCTGCCCT TGGCCCGGGT GGGGATTTAC GTACCCGGCG GCAAGGCGGC TTATCCCAGC
TCCGTGCTGA TGAATGCCAT TCCCGCGCAC GTGGCGGGCG TGAAGGAAAT CATCATGACC
GTACCCACCC CGCAGGGGCA GGTGAATCCC TGGGTGCTGG CCGCAGCCGC CATTGCCGGG
GTGGACCGGG TGTTCTGTAT CGGCGGTGCG CAGGCAGTGG CGGCGCTCGC CTACGGTACG
GAGAGCGTCC CCGCGGTAGA CAAGATTGTC GGCCCCGGCA ATATCTATGT GGCTACCGCC
AAGCGCATGG TCTTTGGCCG GGTAGGCATC GATATGATTG CCGGACCCAG CGAAATCCTC
GTGATCAGCG ATGGCTCGGC ACCGGCGGAA TGGTTGGCCT GGGACCTGCT CTCACAGGCG
GAGCATGATG AGATTGCCCA GAGTATTTTC ATCAGTTGGG ACGATGCCCA CATCGAGTCG
GTGGTGAACG CGGTGGATGC CGCCCTCGAT GTGCTCGATC GCGCACCCAT CGCCCGCAAG
AGCTGGGCAG ACCGGGGGGC GGTGATTCGT GTGCGGGACC GTGCCGAGGC CTGCGCCATT
GCCGACCGTA TCGCGCCGGA ACATCTGGAA CTAGCGGTGC AGAATCCCGA AGACTGGCTG
GCGGACATTC ACAATGCCGG GGCCATCTTC ATGGGCATCC ATAGTTGTGA GGCCCTCGGC
GACTATGTGG CCGGCCCCAA CCATGTGCTG CCCACGGGGG GCAGCGCACG TTTTTCCTCG
CCCCTCGGCG TCTATGATTT CGTCAAGCGG AGCAGCCTCA TTCACAGCAG CCCCGCCGGC
GCCGCGCGAC TGGGACAGAT CGCCGAACGT CTCGCCCTGG GCGAGGGCCT GACCGCCCAT
GCCCGTTCAG CAGCCTGCCG CATCCCCGAA GCCGGATCAT GA
 
Protein sequence
MNRLDTSDPD FAQQFHALHD WDANLDPQIE VRVREIVTTV RDRGDAALRE YTERFDGVTT 
ASAAELEIPR SAWDAALHGL EPTQRVALEE AAQRIRSYHE HQRSVGWTFT EADGTMLGQR
ILPLARVGIY VPGGKAAYPS SVLMNAIPAH VAGVKEIIMT VPTPQGQVNP WVLAAAAIAG
VDRVFCIGGA QAVAALAYGT ESVPAVDKIV GPGNIYVATA KRMVFGRVGI DMIAGPSEIL
VISDGSAPAE WLAWDLLSQA EHDEIAQSIF ISWDDAHIES VVNAVDAALD VLDRAPIARK
SWADRGAVIR VRDRAEACAI ADRIAPEHLE LAVQNPEDWL ADIHNAGAIF MGIHSCEALG
DYVAGPNHVL PTGGSARFSS PLGVYDFVKR SSLIHSSPAG AARLGQIAER LALGEGLTAH
ARSAACRIPE AGS