Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2504 |
Symbol | |
ID | 5734385 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3198184 |
End bp | 3199755 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641279644 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001545270 |
Protein GI | 159899023 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0589644 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCAA TTTTAATCAA CGGCGAATGG CAAATTGGCG ATTACGTTGG TAGTTTTCAT GCGACCAACC CAACCACAAC CCAAGCATTC ACCACCGAAT ATCCGATTTC GGGACGCGGC GATTTAGAAT TAGCCTTGAG CGCTGGGGTT GCCGCCTCGC GTGAGCTAGC CCAAAGTAGC CCTGAGCAAC GCGCTAAATT TCTCGAAGGC TATGCCGATT TGATCGAGGC CAAGCGCCAA GAACTGAGTG CAATCGCCCA TAGCGAAACT GGCCTACCGA TCGAGCCACG CTTGAATAGC GTTGAACTAC CACGCACGGT TAATCAATTG CGCCAAGCGG CTCAAGCAGT GCGCAACCAC ACCTGGGAGC TGGCAACAAT CGACAGCAAA GCCAATATTC GCTCGCTCTA TGGCAGTTTG GCCGCACCAG TCGCCATTTT TGGGCCGAAT AACTTCCCAT TTGCCTATAA TGCGATTGCT GGCAGCGATT TTGCCTCGGC GATTGCTGCG GGAAATGCTG TAATCGCCAA AGGACACCCA AGCCACCCCG CCACCACCGA GTTACTGGCT GAATTGGCGC ATCAAGCAGT CTTGGCGGCT GGTTTGCCTG CGGCAAGTGT GCAATTGCTG TATGGTTTGC CCGATGAATT GGGCTTGGCA TTGGTCAGTC ATCCTTTAAT TGGGGCAATT GGCTTTACTG GTTCGCGCCG AGCAGGATTG ACGCTCAAGG CTGCCGCCGA TCAAGCAGGC AAGGCAATTT ACCTTGAAAT GTCGAGCATC AACCCGGTTG TGATGTTGGC TGGAGCGGTG ACCGAGCGGG CCGAAGCGCT TGCCGCCGAA TTTGCTGGCT CGTGCACTTT GGGCGCTGGC CAATTTTGCA CCAACCCTGG CTTATTAATT TTGGAGCATT CAGCGGCTAG TGCCGATTTT ATTGAAGCAA CCAAAGCCCA CTTTAACCAA CACCCTAGCC TGACCCTGCT CAACCATGGC GTTCTGGCTG GGCTTGAACA GGGCATTGAG CATTTGCAAG ACGCTGGGGC ACAGGTCTTG GTTGGTGGGC ATGTGTTGGA TGATGGGTAT CGCTATGCCA ATAGTTTGCT CTACGTTGCG GGCAATAGCT TTCTGGCCAA TCCCCAAGCG CTCAGCCATG AAGTCTTTGG GCCAGTCAGC TTGATTGTTG AATGTGCCGA CCAAGCCGAA GTATTGCGAG TGCTCGACTG CCTTGAGGGC AATTTGACGG GCAGCATCTA CAGCAGCAGC AACGGGGCAG ATGAAACCTT TTATCAAATT GTTGCCGAGC GCTTACGCTC CAAAGTTGGG CGTTTGCTCA ACGATAAAAT GCCCACAGGT GTGGCCGTCA GCTCAGCCAT GAATCATGGC GGGCCGTATC CCGCGACAGG CCATGCAGGC TGGACGGCAG TGGGCTTTCC CGCAACGATT CGGCGTTTTG CGGCGCTCCA CTGCTACGAC AACGTGCGCG AATCACGCTT GCCAAGCATT CTGCAAAACG CCAATCCGCA AGCGATTTGG CGCTTGGTTG ATGGCCAGTG GAGCAATGCT GGCATTGATT AG
|
Protein sequence | MKPILINGEW QIGDYVGSFH ATNPTTTQAF TTEYPISGRG DLELALSAGV AASRELAQSS PEQRAKFLEG YADLIEAKRQ ELSAIAHSET GLPIEPRLNS VELPRTVNQL RQAAQAVRNH TWELATIDSK ANIRSLYGSL AAPVAIFGPN NFPFAYNAIA GSDFASAIAA GNAVIAKGHP SHPATTELLA ELAHQAVLAA GLPAASVQLL YGLPDELGLA LVSHPLIGAI GFTGSRRAGL TLKAAADQAG KAIYLEMSSI NPVVMLAGAV TERAEALAAE FAGSCTLGAG QFCTNPGLLI LEHSAASADF IEATKAHFNQ HPSLTLLNHG VLAGLEQGIE HLQDAGAQVL VGGHVLDDGY RYANSLLYVA GNSFLANPQA LSHEVFGPVS LIVECADQAE VLRVLDCLEG NLTGSIYSSS NGADETFYQI VAERLRSKVG RLLNDKMPTG VAVSSAMNHG GPYPATGHAG WTAVGFPATI RRFAALHCYD NVRESRLPSI LQNANPQAIW RLVDGQWSNA GID
|
| |