Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2524 |
Symbol | |
ID | 5734402 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3229358 |
End bp | 3230971 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279664 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001545290 |
Protein GI | 159899043 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00255525 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAATTT TGGGAATTTT GCTCGTTAGG TTGATCGTAA GCTTCAGGCC TGATACTACG ATCAAATTCT TCCTCCTCTA CACTCTTCTC TCCTGCAAGC GGCCCGACTG GGCCGTTTGT GGTTTTAAGC GATTTTTATT TTGCCACCAA AGCTCGCTTA TAATAGCATG TGGAAATGTT GTGTTGGAGG AATCAATGAA CCTCATGCAA ATTCATGGGC AATTTGCGGC GTTGCAGGCC CAGCGCGGGC GAATCGCTGC CAGCCGTGCA CCTGAGCGCA TTGGCTATTT ACAGCAATTT AAACGGGCGA TCGAACAAGC CCGACCAGAT ATTCATGCGG CGTTGCAGGC CGATTTTGCC AAACCAGCGG CTGAAATTGA GGCCACCGAA ATTCAGCAGG TGATCGAGCA AATCAATTTT GCTATCAAAC GGCTTGAAAC ATGGATGCAG CCCAAACGAG TTAAAACCCC AACCATGCTA ACGGGCAGCA AAAGTTGGAT TCAATATGAG CCACGCGGCG TTGTGCTGAT TCTTGCGCCG TGGAATTACC CGCTCTCACT AGCACTTATG CCTTTGATTG GGGCAGTGGC GGCTGGGAAT TGTGCGATTG TGCGGCCATC GGAGCGCATG CCACATACCG CTCAAGTTGT AGCAAACATT ATTGCTACCG CCTTCAAACC TGAGCATGTT ACCAGCGTTG TGGGCGATGT TGATACGGCT GAAGCATTGC TCGACTTGCC ATTCGACCAT ATTTTCTTTA CGGGCAGCCC ACGAATCGGC CAATATGTGA TGCAACGCGC CGCCGAGCAT TTTAGCTCGG TTACTCTAGA ACTGGGTGGC AAATCGCCAG CGATTGTTGA TCGTTCAGCC GATTTGAAAC GTGCTGCACA GGCGATTGTG TGGGGCAAAT TTGTCAATGC CGGCCAAACC TGTGTTGCGC CTGATCATGT CTGGGTTCAG CGTGAGCAAG CTCAAGCCTT GACCCAACTG ATTATTAAAC AAATTGAGCG TAACTACGGC AAAGGCGATT ATACCCGCCT GCAATCGCCC GATTTGGCCA ATGTGATCGA TGCTAATGCC ACTGCTCGCC TGCGTGGTTT GGTCAATAAT TCGGTAGCCC AAGGGGCGTT GGTCGCGCTG GGTGGCCAAT CGACCGATCA TCCTGCACGC TTTGCGCCAA CCGTGCTGAC CAACGTTAAA CCTAGCATGG CAATTATGCA GGAAGAAATT TTTGGGCCGA TTCTGCCAAT TTTGGTGTAC GACCAAATTG ATGAAGTGAT TATGGCGACG CGAGCTAGCG GCAAGCCCTT GACCATGGCG ATTTTTGCCG AGAATCAAGC GATTATCAAC TGGCTGCTAC GCGAAATTCC GGCTGGCAGC AGTATGATAA ACGGGGTTTT ACTGAATGTG GTTAATCCGA ATTTGCCATT TGGTGGGGTT GGCCAGAGCG GCATTGGCAA TTATCATGGC TTTTACAGCT TCAAAACATT TTCGCATGAA CGAGCGGTAT TTCAACTTGG CGGCTTAAAT TTAGTAAATT TATTTCAACC GCCCTATCGC TCGGTGTCTA AGCGCTTGGC AGCGTGGTCG CGGCGGATCA TGAGCAAACG CTAA
|
Protein sequence | MGILGILLVR LIVSFRPDTT IKFFLLYTLL SCKRPDWAVC GFKRFLFCHQ SSLIIACGNV VLEESMNLMQ IHGQFAALQA QRGRIAASRA PERIGYLQQF KRAIEQARPD IHAALQADFA KPAAEIEATE IQQVIEQINF AIKRLETWMQ PKRVKTPTML TGSKSWIQYE PRGVVLILAP WNYPLSLALM PLIGAVAAGN CAIVRPSERM PHTAQVVANI IATAFKPEHV TSVVGDVDTA EALLDLPFDH IFFTGSPRIG QYVMQRAAEH FSSVTLELGG KSPAIVDRSA DLKRAAQAIV WGKFVNAGQT CVAPDHVWVQ REQAQALTQL IIKQIERNYG KGDYTRLQSP DLANVIDANA TARLRGLVNN SVAQGALVAL GGQSTDHPAR FAPTVLTNVK PSMAIMQEEI FGPILPILVY DQIDEVIMAT RASGKPLTMA IFAENQAIIN WLLREIPAGS SMINGVLLNV VNPNLPFGGV GQSGIGNYHG FYSFKTFSHE RAVFQLGGLN LVNLFQPPYR SVSKRLAAWS RRIMSKR
|
| |