Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3867 |
Symbol | |
ID | 5735716 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4858777 |
End bp | 4860273 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281018 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001546629 |
Protein GI | 159900382 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00214537 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGACC ATAAGGTGTA CTATAACTAC ATTGGCGGCG AGTGGGTGCC AGCGCGTTCG GGCAAGACCT ACGAGAACCG CAACCCCGCC GATACCCGCG ATGTGATTGG CATCTTCCCC GACTCTGGTG CTGAGGATAT TGCCGATGCG GTTGCTGCCG CAAAGGAAGC CTACAATGCA TGGCGACTTG TGCCAGCTCC CAAGCGTGGC GAGTTGCTGT ATCGGGCTTC GCAAATTCTT CAAGAACGCA AAGAGCAATA TGCCAATGAT ATGACCCGTG AAATGGGCAA GGTTTTGGCC GAAACCCGTG GCGATGTCCA AGAAGCGATC GACATGGGCT ATTTTATGGC TGGCGAAGGT CGGCGTTTGT ATGGCGTAAC CACGCCCTCG GAATTGCCCA ATAAATTCCA AATGAGCGTG CGCCAACCGT TGGGCGTGTG TGGCTTGATT ACGCCATGGA ACTTCCCAAT GGCCATCCCA TCGTGGAAGA TTTTTCCAGC CTTGATTTGC GGCAACACCG TGGTGATCAA GCCTGCCGAA GATACGCCAC TTTCAACCTA CAACTTTGTG CAAGCTTTGG TTGATGCTGG CCTACCCAAA GGCGTGGTCA ATATTGTTTC AGGCCATGGC CCAACTGCTG GCGAGCCATT GGTGCAACAT CCCGATGTTA AAGTGATTTC GTTCACTGGC TCGACCGAAG TTGGTCGCCA TGTTTCAACG CTTTGTGCCC AACAAGGCAA GCATGTATCG CTCGAAATGG GCGGCAAAAA CCCCATGATC ATCATGGACG ATGCTGATCT TGATTTGGTG TTGGCGGGCG CGGTTTGGGG CGCGTTTGGT ACAACTGGCC AACGCTGTAC CGCGACCTCA CGGATTATTG CTCATCGCTC AATTGTTGAT GAACTGACCA GCCGCATCCA AGCTGAAGCC AAAACCATTC AAGTGGGCAA TGGCTTGCTC GATGGCATGC AAATGGGACC ATCAATTAAC GAAGGCCAAT TGAGCGTGGT TGAAAAATAT GTCAAGATTG GCCGTGAAGA AGGCGCTGAA TTGGTGCTGG GCGGCGAACG CTTGACCGAT GGCGATTTAG GCCATGGCTT CTTCCACCAG CCAACCATTT TTGGCAATGT CAAACGCAAT ATGCGCATTG CTCAAGAGGA AATTTTCGGG CCAGTGGTCT CGATCATTCC GGTTGATAGC CTTGAAGAAG CGATTGATGT TGCCAACGAT GTGCCATACG GCTTATCATC ATCGATTTAT ACTCGCAATG TCAACAATGC CTTCATTGCC ATGCGCGATC TCTACACTGG GATTGTCTAT GTCAATGCGC CAACCATCGG GGCCGAAATT CACTTGCCCT TCGGTGGTAC CAAAGGCACT GGCAACGGCA AGCGTGAAGG TGGCACGCAG GTACTCGATA CCTACAGCGA ATGGAAATCG TTGTATGTCG ATTACAGTGG TGGCTTGCAA CGCGCCCAAA TCGATAACGC CGAATAA
|
Protein sequence | MSDHKVYYNY IGGEWVPARS GKTYENRNPA DTRDVIGIFP DSGAEDIADA VAAAKEAYNA WRLVPAPKRG ELLYRASQIL QERKEQYAND MTREMGKVLA ETRGDVQEAI DMGYFMAGEG RRLYGVTTPS ELPNKFQMSV RQPLGVCGLI TPWNFPMAIP SWKIFPALIC GNTVVIKPAE DTPLSTYNFV QALVDAGLPK GVVNIVSGHG PTAGEPLVQH PDVKVISFTG STEVGRHVST LCAQQGKHVS LEMGGKNPMI IMDDADLDLV LAGAVWGAFG TTGQRCTATS RIIAHRSIVD ELTSRIQAEA KTIQVGNGLL DGMQMGPSIN EGQLSVVEKY VKIGREEGAE LVLGGERLTD GDLGHGFFHQ PTIFGNVKRN MRIAQEEIFG PVVSIIPVDS LEEAIDVAND VPYGLSSSIY TRNVNNAFIA MRDLYTGIVY VNAPTIGAEI HLPFGGTKGT GNGKREGGTQ VLDTYSEWKS LYVDYSGGLQ RAQIDNAE
|
| |