Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3794 |
Symbol | |
ID | 5735658 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4763758 |
End bp | 4766109 |
Gene Length | 2352 bp |
Protein Length | 783 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641280946 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001546558 |
Protein GI | 159900311 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTATG GCCCAGCACC CGAAGCTGCT GCTCCAGCCC ACGAGTGGCT AGATGCCCAT AGCCGTCGTT TTGGGCTCTA TATTAATGGT ACATGGACAG AAGTCGCCAA CGAGCGCTTG TTCGATTCGA TCAATCCAGC CAATCGCAGC GTGTTGGCCC AAGTGACCCA AGCGAGCAGC GATGAAGTAA ACGCTGCGGT GGCTGCTGCC AAGGCGGCCT TTCCAGCTTG GTCGCAAACC AGCGGCCATG TGCGTGCTCG CTATTTGTAT GCCTTGGCAC GCCAAATTCA AAAACATTCG CGCCGTTTCG CGGTGCTCGA AACCCTCGAT AATGGCAAGC CCATCCGCGA AACCCGCGAC ATTGATATTC CATTAGTCGC GCGGCATTTC TACTATCACG CTGGTTGGGC ACAATTGCAA GAAAGCGATT TGGCAGGCTA CGAGCCGCTG GGCGTGGTCG GCCAAATTAT TCCGTGGAAC TTTCCGCTGT TGATGTTGGC TTGGAAGATT GCCCCAGCCT TGGCGATGGG CAACACGGTG GTGCTGAAGC CTGCCGAATG GACTTCATTA ACCGCCTTGG CATTTGCCGA AATTTGCCAC GAAATTGGTT TGCCCAAGGG CGTGGTCAAC ATCGTAACTG GCGATGGTAA AGTTGGCGAG CAAATCGTCA AGCACCCCGA TATTGCCAAA ATTGCCTTTA CTGGTTCAAC CGAAGTTGGC AAAATTATTC GGAGCGCCAC CGCTGGCAGC GGCAAAAAAC TCTCCTTGGA GCTTGGCGGC AAATCGCCCT TTATCGTGTT TGATAACGCT GATCTCGATA GCGTGGTCGA AGGCGTGGTT GATGCGATTT GGTTTAATCA AGGCCAAGTT TGTTGCGCTG GCTCACGTTT GTTGGTGCAA GAAAACATTG CCGATAAGCT GATTGGCAAG TTGCGCACTC GTATGGAGCA ATTGCGCATC GGCGATCCTT TAGATAAAGC GATCGATATT GGCGCGATTG TTGCTCCAGC CCAATTACAA AAAATCGAGC AACTGGTGGC CGAAGGCGAA AACGAAGGTT CAATCAAATG GCAACCATCG TGGGCTTGCC CAACTGATGG CTACTTCTAT CCGCCAACCT TGTTTACCAA CGTGGCTCCC GCTTCAACCT TGGCCCAAGT TGAAATTTTC GGGCCAGTCT TGGTCACGAT GACCTTCCGC ACCCCTGATG AAGCGATTGC GATTGCCAAC AATACCCGTT TTGGTTTGGC CGCCAGCATT TGGAGCGAAG ATATTAACGT GGCGCTGCAT GCCGCAGCCC GCGTCAAAGC AGGCGTAGTT TGGATCAACA GCACCAACTT GTTTGATGCA GCAGCTGGCT TCGGCGGCTA TCGTGAAAGT GGCTACGGTC GCGAAGGTGG CAAAGAAGGC TTATACGAAT ATCTCAAAAA ATCTGAGGTT AAAAAGCTTA AAACCAAGGC CAGCCCAGCG CCCGCGCCTG TGGCAACCAC AGCCAGCAAC GGCCTACCAG CGCTTGATCG CACGCCCAAA ATCTATATTG GCGGCAAGCA AGCCCGCCCC GATTCGGGCT ACAGCCGAAT TGTGGTTGGC AGCAATGGCG AGCAGCTTGG CGAAGTTGGC GATGGCAGCC GCAAAGATAT TCGCAACGCG GTCGAAGTTG CACGCAGTGC TGCTAACAGT TGGTCGGCAG CAACCGCCTA TAATCGGGCG CAAGTGCTCT ATTTCTTGGC TGAAAATTTA GGCGCACGTG CCGCCGAATT TGCCCAACGC ATTCGCCAAC AAACAGGCCG CAACGATGCC GACCTCGAAG TTGAAACATC AATCGAGCGC TTATTTACCT ACGCCGCGTG GGCCGATAAA TATGATGGCT CGGTGCATGC CACGCCTGTA CGCAACGCCA CCCTCGCCAT GGTCGAATCG CTCGGTGTGC TTGGTTTGGT TTGTCCCAGC GAATATCCCT TGTTGGGCAC GATTTCATTG CTAGCACCAG CCATCGCCTT GGGCAATAGT GCGATTATCA TTCCATCGCC AGAGCATCCA CTTTCAGCCA CCGATTTGTA TCAAGTGCTC GATACCAGCG ATGTGCCGGC AGGCGTGGTC AACATTATCA CCGGCGACCG CGATAGCCTA GCCAAAGTGC TGGCCGAGCA CAACGACGTT GATGGTTTGT GGTATTGGGG CAGTGCTGAG GGTAGCGCCA TGGTCGAGCG CAGTTCAATC GGCAACCTCA AACAAACCTG GGTCAACTAC GGCGAAACTC GCGATTGGCT TGATCGACGA GTTGGCGAGG GCGAAGAATT TCTACGCCAC GCTAGCCAAA TCAAAAATAT TTGGGTTCCC TACGGCGCAT AA
|
Protein sequence | MSYGPAPEAA APAHEWLDAH SRRFGLYING TWTEVANERL FDSINPANRS VLAQVTQASS DEVNAAVAAA KAAFPAWSQT SGHVRARYLY ALARQIQKHS RRFAVLETLD NGKPIRETRD IDIPLVARHF YYHAGWAQLQ ESDLAGYEPL GVVGQIIPWN FPLLMLAWKI APALAMGNTV VLKPAEWTSL TALAFAEICH EIGLPKGVVN IVTGDGKVGE QIVKHPDIAK IAFTGSTEVG KIIRSATAGS GKKLSLELGG KSPFIVFDNA DLDSVVEGVV DAIWFNQGQV CCAGSRLLVQ ENIADKLIGK LRTRMEQLRI GDPLDKAIDI GAIVAPAQLQ KIEQLVAEGE NEGSIKWQPS WACPTDGYFY PPTLFTNVAP ASTLAQVEIF GPVLVTMTFR TPDEAIAIAN NTRFGLAASI WSEDINVALH AAARVKAGVV WINSTNLFDA AAGFGGYRES GYGREGGKEG LYEYLKKSEV KKLKTKASPA PAPVATTASN GLPALDRTPK IYIGGKQARP DSGYSRIVVG SNGEQLGEVG DGSRKDIRNA VEVARSAANS WSAATAYNRA QVLYFLAENL GARAAEFAQR IRQQTGRNDA DLEVETSIER LFTYAAWADK YDGSVHATPV RNATLAMVES LGVLGLVCPS EYPLLGTISL LAPAIALGNS AIIIPSPEHP LSATDLYQVL DTSDVPAGVV NIITGDRDSL AKVLAEHNDV DGLWYWGSAE GSAMVERSSI GNLKQTWVNY GETRDWLDRR VGEGEEFLRH ASQIKNIWVP YGA
|
| |