Gene Haur_2524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2524 
Symbol 
ID5734402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3229358 
End bp3230971 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content50% 
IMG OID641279664 
Productaldehyde dehydrogenase 
Protein accessionYP_001545290 
Protein GI159899043 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00255525 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAATTT TGGGAATTTT GCTCGTTAGG TTGATCGTAA GCTTCAGGCC TGATACTACG 
ATCAAATTCT TCCTCCTCTA CACTCTTCTC TCCTGCAAGC GGCCCGACTG GGCCGTTTGT
GGTTTTAAGC GATTTTTATT TTGCCACCAA AGCTCGCTTA TAATAGCATG TGGAAATGTT
GTGTTGGAGG AATCAATGAA CCTCATGCAA ATTCATGGGC AATTTGCGGC GTTGCAGGCC
CAGCGCGGGC GAATCGCTGC CAGCCGTGCA CCTGAGCGCA TTGGCTATTT ACAGCAATTT
AAACGGGCGA TCGAACAAGC CCGACCAGAT ATTCATGCGG CGTTGCAGGC CGATTTTGCC
AAACCAGCGG CTGAAATTGA GGCCACCGAA ATTCAGCAGG TGATCGAGCA AATCAATTTT
GCTATCAAAC GGCTTGAAAC ATGGATGCAG CCCAAACGAG TTAAAACCCC AACCATGCTA
ACGGGCAGCA AAAGTTGGAT TCAATATGAG CCACGCGGCG TTGTGCTGAT TCTTGCGCCG
TGGAATTACC CGCTCTCACT AGCACTTATG CCTTTGATTG GGGCAGTGGC GGCTGGGAAT
TGTGCGATTG TGCGGCCATC GGAGCGCATG CCACATACCG CTCAAGTTGT AGCAAACATT
ATTGCTACCG CCTTCAAACC TGAGCATGTT ACCAGCGTTG TGGGCGATGT TGATACGGCT
GAAGCATTGC TCGACTTGCC ATTCGACCAT ATTTTCTTTA CGGGCAGCCC ACGAATCGGC
CAATATGTGA TGCAACGCGC CGCCGAGCAT TTTAGCTCGG TTACTCTAGA ACTGGGTGGC
AAATCGCCAG CGATTGTTGA TCGTTCAGCC GATTTGAAAC GTGCTGCACA GGCGATTGTG
TGGGGCAAAT TTGTCAATGC CGGCCAAACC TGTGTTGCGC CTGATCATGT CTGGGTTCAG
CGTGAGCAAG CTCAAGCCTT GACCCAACTG ATTATTAAAC AAATTGAGCG TAACTACGGC
AAAGGCGATT ATACCCGCCT GCAATCGCCC GATTTGGCCA ATGTGATCGA TGCTAATGCC
ACTGCTCGCC TGCGTGGTTT GGTCAATAAT TCGGTAGCCC AAGGGGCGTT GGTCGCGCTG
GGTGGCCAAT CGACCGATCA TCCTGCACGC TTTGCGCCAA CCGTGCTGAC CAACGTTAAA
CCTAGCATGG CAATTATGCA GGAAGAAATT TTTGGGCCGA TTCTGCCAAT TTTGGTGTAC
GACCAAATTG ATGAAGTGAT TATGGCGACG CGAGCTAGCG GCAAGCCCTT GACCATGGCG
ATTTTTGCCG AGAATCAAGC GATTATCAAC TGGCTGCTAC GCGAAATTCC GGCTGGCAGC
AGTATGATAA ACGGGGTTTT ACTGAATGTG GTTAATCCGA ATTTGCCATT TGGTGGGGTT
GGCCAGAGCG GCATTGGCAA TTATCATGGC TTTTACAGCT TCAAAACATT TTCGCATGAA
CGAGCGGTAT TTCAACTTGG CGGCTTAAAT TTAGTAAATT TATTTCAACC GCCCTATCGC
TCGGTGTCTA AGCGCTTGGC AGCGTGGTCG CGGCGGATCA TGAGCAAACG CTAA
 
Protein sequence
MGILGILLVR LIVSFRPDTT IKFFLLYTLL SCKRPDWAVC GFKRFLFCHQ SSLIIACGNV 
VLEESMNLMQ IHGQFAALQA QRGRIAASRA PERIGYLQQF KRAIEQARPD IHAALQADFA
KPAAEIEATE IQQVIEQINF AIKRLETWMQ PKRVKTPTML TGSKSWIQYE PRGVVLILAP
WNYPLSLALM PLIGAVAAGN CAIVRPSERM PHTAQVVANI IATAFKPEHV TSVVGDVDTA
EALLDLPFDH IFFTGSPRIG QYVMQRAAEH FSSVTLELGG KSPAIVDRSA DLKRAAQAIV
WGKFVNAGQT CVAPDHVWVQ REQAQALTQL IIKQIERNYG KGDYTRLQSP DLANVIDANA
TARLRGLVNN SVAQGALVAL GGQSTDHPAR FAPTVLTNVK PSMAIMQEEI FGPILPILVY
DQIDEVIMAT RASGKPLTMA IFAENQAIIN WLLREIPAGS SMINGVLLNV VNPNLPFGGV
GQSGIGNYHG FYSFKTFSHE RAVFQLGGLN LVNLFQPPYR SVSKRLAAWS RRIMSKR