Gene Haur_3794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3794 
Symbol 
ID5735658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4763758 
End bp4766109 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content54% 
IMG OID641280946 
Productaldehyde dehydrogenase 
Protein accessionYP_001546558 
Protein GI159900311 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTATG GCCCAGCACC CGAAGCTGCT GCTCCAGCCC ACGAGTGGCT AGATGCCCAT 
AGCCGTCGTT TTGGGCTCTA TATTAATGGT ACATGGACAG AAGTCGCCAA CGAGCGCTTG
TTCGATTCGA TCAATCCAGC CAATCGCAGC GTGTTGGCCC AAGTGACCCA AGCGAGCAGC
GATGAAGTAA ACGCTGCGGT GGCTGCTGCC AAGGCGGCCT TTCCAGCTTG GTCGCAAACC
AGCGGCCATG TGCGTGCTCG CTATTTGTAT GCCTTGGCAC GCCAAATTCA AAAACATTCG
CGCCGTTTCG CGGTGCTCGA AACCCTCGAT AATGGCAAGC CCATCCGCGA AACCCGCGAC
ATTGATATTC CATTAGTCGC GCGGCATTTC TACTATCACG CTGGTTGGGC ACAATTGCAA
GAAAGCGATT TGGCAGGCTA CGAGCCGCTG GGCGTGGTCG GCCAAATTAT TCCGTGGAAC
TTTCCGCTGT TGATGTTGGC TTGGAAGATT GCCCCAGCCT TGGCGATGGG CAACACGGTG
GTGCTGAAGC CTGCCGAATG GACTTCATTA ACCGCCTTGG CATTTGCCGA AATTTGCCAC
GAAATTGGTT TGCCCAAGGG CGTGGTCAAC ATCGTAACTG GCGATGGTAA AGTTGGCGAG
CAAATCGTCA AGCACCCCGA TATTGCCAAA ATTGCCTTTA CTGGTTCAAC CGAAGTTGGC
AAAATTATTC GGAGCGCCAC CGCTGGCAGC GGCAAAAAAC TCTCCTTGGA GCTTGGCGGC
AAATCGCCCT TTATCGTGTT TGATAACGCT GATCTCGATA GCGTGGTCGA AGGCGTGGTT
GATGCGATTT GGTTTAATCA AGGCCAAGTT TGTTGCGCTG GCTCACGTTT GTTGGTGCAA
GAAAACATTG CCGATAAGCT GATTGGCAAG TTGCGCACTC GTATGGAGCA ATTGCGCATC
GGCGATCCTT TAGATAAAGC GATCGATATT GGCGCGATTG TTGCTCCAGC CCAATTACAA
AAAATCGAGC AACTGGTGGC CGAAGGCGAA AACGAAGGTT CAATCAAATG GCAACCATCG
TGGGCTTGCC CAACTGATGG CTACTTCTAT CCGCCAACCT TGTTTACCAA CGTGGCTCCC
GCTTCAACCT TGGCCCAAGT TGAAATTTTC GGGCCAGTCT TGGTCACGAT GACCTTCCGC
ACCCCTGATG AAGCGATTGC GATTGCCAAC AATACCCGTT TTGGTTTGGC CGCCAGCATT
TGGAGCGAAG ATATTAACGT GGCGCTGCAT GCCGCAGCCC GCGTCAAAGC AGGCGTAGTT
TGGATCAACA GCACCAACTT GTTTGATGCA GCAGCTGGCT TCGGCGGCTA TCGTGAAAGT
GGCTACGGTC GCGAAGGTGG CAAAGAAGGC TTATACGAAT ATCTCAAAAA ATCTGAGGTT
AAAAAGCTTA AAACCAAGGC CAGCCCAGCG CCCGCGCCTG TGGCAACCAC AGCCAGCAAC
GGCCTACCAG CGCTTGATCG CACGCCCAAA ATCTATATTG GCGGCAAGCA AGCCCGCCCC
GATTCGGGCT ACAGCCGAAT TGTGGTTGGC AGCAATGGCG AGCAGCTTGG CGAAGTTGGC
GATGGCAGCC GCAAAGATAT TCGCAACGCG GTCGAAGTTG CACGCAGTGC TGCTAACAGT
TGGTCGGCAG CAACCGCCTA TAATCGGGCG CAAGTGCTCT ATTTCTTGGC TGAAAATTTA
GGCGCACGTG CCGCCGAATT TGCCCAACGC ATTCGCCAAC AAACAGGCCG CAACGATGCC
GACCTCGAAG TTGAAACATC AATCGAGCGC TTATTTACCT ACGCCGCGTG GGCCGATAAA
TATGATGGCT CGGTGCATGC CACGCCTGTA CGCAACGCCA CCCTCGCCAT GGTCGAATCG
CTCGGTGTGC TTGGTTTGGT TTGTCCCAGC GAATATCCCT TGTTGGGCAC GATTTCATTG
CTAGCACCAG CCATCGCCTT GGGCAATAGT GCGATTATCA TTCCATCGCC AGAGCATCCA
CTTTCAGCCA CCGATTTGTA TCAAGTGCTC GATACCAGCG ATGTGCCGGC AGGCGTGGTC
AACATTATCA CCGGCGACCG CGATAGCCTA GCCAAAGTGC TGGCCGAGCA CAACGACGTT
GATGGTTTGT GGTATTGGGG CAGTGCTGAG GGTAGCGCCA TGGTCGAGCG CAGTTCAATC
GGCAACCTCA AACAAACCTG GGTCAACTAC GGCGAAACTC GCGATTGGCT TGATCGACGA
GTTGGCGAGG GCGAAGAATT TCTACGCCAC GCTAGCCAAA TCAAAAATAT TTGGGTTCCC
TACGGCGCAT AA
 
Protein sequence
MSYGPAPEAA APAHEWLDAH SRRFGLYING TWTEVANERL FDSINPANRS VLAQVTQASS 
DEVNAAVAAA KAAFPAWSQT SGHVRARYLY ALARQIQKHS RRFAVLETLD NGKPIRETRD
IDIPLVARHF YYHAGWAQLQ ESDLAGYEPL GVVGQIIPWN FPLLMLAWKI APALAMGNTV
VLKPAEWTSL TALAFAEICH EIGLPKGVVN IVTGDGKVGE QIVKHPDIAK IAFTGSTEVG
KIIRSATAGS GKKLSLELGG KSPFIVFDNA DLDSVVEGVV DAIWFNQGQV CCAGSRLLVQ
ENIADKLIGK LRTRMEQLRI GDPLDKAIDI GAIVAPAQLQ KIEQLVAEGE NEGSIKWQPS
WACPTDGYFY PPTLFTNVAP ASTLAQVEIF GPVLVTMTFR TPDEAIAIAN NTRFGLAASI
WSEDINVALH AAARVKAGVV WINSTNLFDA AAGFGGYRES GYGREGGKEG LYEYLKKSEV
KKLKTKASPA PAPVATTASN GLPALDRTPK IYIGGKQARP DSGYSRIVVG SNGEQLGEVG
DGSRKDIRNA VEVARSAANS WSAATAYNRA QVLYFLAENL GARAAEFAQR IRQQTGRNDA
DLEVETSIER LFTYAAWADK YDGSVHATPV RNATLAMVES LGVLGLVCPS EYPLLGTISL
LAPAIALGNS AIIIPSPEHP LSATDLYQVL DTSDVPAGVV NIITGDRDSL AKVLAEHNDV
DGLWYWGSAE GSAMVERSSI GNLKQTWVNY GETRDWLDRR VGEGEEFLRH ASQIKNIWVP
YGA