Gene Haur_2013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2013 
Symbol 
ID5733902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2502613 
End bp2503947 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content53% 
IMG OID641279157 
Producthistidinol dehydrogenase 
Protein accessionYP_001544784 
Protein GI159898537 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAATTC AGCTTTATAC CGATCTTGCG CAGGCTCAAC AAGGCCCATT AGCACGAGTC 
GCCTTCGATA CGGTCGAAGT TCCAGAGCGT TTACAACAAA GCCTCGACCA CATGTTTGGG
GTTGGCACAA CGCCTGCCGC TGCGGTTGAT CAGATCTTAG CCAGTGTGCG GCGCGATGGC
GATGCGGCCT TAGAGCACTG GAGCCGCACG ATCGAAGGCG TTGAATTAAG CCAATTTGAG
GTCGATCGCT CGGCGATTGA AGCCGCCTAT AGCCAACTTG ATCCATTATT GGTTGAAGCG
TTACGGATCT CTGCTGCCGA GATCGAGCGT TTTCATCGTA AGCAAACCCG CCAAAGTTGG
GTTGATTGGT CGGATGAAGG AGCACTGGGT CAGATTGTTC TACCACTTGA GCGGATTGGG
GCGTATGCGC CAGGTGGCAC AGCTCCCCTT CCATCGTCAT TATTGATGGG GGTAATTCCA
GCTAAGGTAG CTGGAGTACG CGAGATTATT GTGTGCTCGC CGCCGCAACG TGATACTGGC
GAGATCTCGC CGTTGGTCTT GGTAGCTGCC GATATTGCTG GAGTCCACCG AATTTTTCGT
TTGGGCGGGG CACAGGCCAT TGCTGCGATG GCCTATGGCA CGAATAGTGT GCCACATGTC
GATAAAATTA TCGGTCCAGG CAATCTGTTT GTGGTGTTGG CCAAAAAGGC GGTGTATGGC
ACGGTTGATA TTGAAGCCTT GCCCGGCCCT ACCGAAACCA TGGTGATTGC CGATGCTGAT
GCTAACCCTG AGCTAGTGGC TGCCGATTTA CTCGCCCAAG CCGAACATGA TTTGCTGGCT
TCGGCGATTT TGCTTACGCC TTCGTTGGAA TTGGCCGAAA AAGTCCAGGT CGCGGTTGCT
CGTCAACTCG AAGAGCTTGA ACGAGCTGAA ATCGCGGCCC AAGCGCTCAC CAATCGCTCA
GGGATTGTGC TTGTCCCTTC ATTAGAGGTT GCATTCGATT TAAGTAATGC CTATGGCCCT
GAGCACCTCT GTTTATTAGT CAACGATCCT TGGCAATATG TGGGTAAAGT ACGCAATGCT
GGGGGCATTT TCCTTGGTGA ACGTTCGTTT GAAGTGTTGG GTGATTATGT GGCTGGGCCA
TCGCACATTA TGCCCACTGG TGGTACGGCT CGCTATGCCT CGCCAGTCAA TGTTGACCAC
TTCCGAAAAG TTATTTCGTT GGTTGGCTTG AACGAAAAAG CCTTGCAACG ATTAGGGCCA
GTCGCTCAGC GTTTGGCTGA GGCCGAAGGA CTGACCGCCC ATGCGGCGGC TGTACGCCGC
CGTTTAGAGC AATAA
 
Protein sequence
MPIQLYTDLA QAQQGPLARV AFDTVEVPER LQQSLDHMFG VGTTPAAAVD QILASVRRDG 
DAALEHWSRT IEGVELSQFE VDRSAIEAAY SQLDPLLVEA LRISAAEIER FHRKQTRQSW
VDWSDEGALG QIVLPLERIG AYAPGGTAPL PSSLLMGVIP AKVAGVREII VCSPPQRDTG
EISPLVLVAA DIAGVHRIFR LGGAQAIAAM AYGTNSVPHV DKIIGPGNLF VVLAKKAVYG
TVDIEALPGP TETMVIADAD ANPELVAADL LAQAEHDLLA SAILLTPSLE LAEKVQVAVA
RQLEELERAE IAAQALTNRS GIVLVPSLEV AFDLSNAYGP EHLCLLVNDP WQYVGKVRNA
GGIFLGERSF EVLGDYVAGP SHIMPTGGTA RYASPVNVDH FRKVISLVGL NEKALQRLGP
VAQRLAEAEG LTAHAAAVRR RLEQ