Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3788 |
Symbol | |
ID | 5735652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4756620 |
End bp | 4758152 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641280940 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001546552 |
Protein GI | 159900305 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCAAT CAGATAGCCT TCGCTCGTTT GGTCTGTATA TTGACGGAGC TTGGGTCGCG GCCAGCGATG CTGCTAGCGA AACCCTGTAC AACCCCGCGA CGGGCGAGCC AGTTGCCCAA GTAGCGCGGG CCACCATTCA CGACATTGAT CGAGCGGTTG GAGCTGCGCG GAAGAGTTTC GATATTGGTT CGTGGGCGCA AATGCGGCCT GTTGATCGCG CTAAAACGAT CGAGGCGATT GCCGATTTGC TCGAAGAAAA CACCGACGAA TTGGCCGAGC TTGAGACGCT CAATGGTGGC GCAACTCTGC GTAAAAGTTC ATGGCTCGAT ATTCCGGTAG GTATCGAGCA TTTGCGCTAT TTTGCCGATT TGGCGCGGCA ACACCCCATG CAAACCTTGC CCTATATCGA TTTTCCGTCG CCGAGTGCTA ATGCGGTTTG GCGCGAGCCG ATTGGGGTTT GTGGCCAAAT TATCCCTTGG AACTACCCAT TTTTGATGGC AATTTGGAAG ATTGGCCCAG CCTTGGCCGC TGGCAATAGC CTGGTGCTCA AGCCAGCCTC GTTGACTCCA GTTACCGCTT TGCGTATGGC CGAATTGATT CATGAAGCCG ATTTGTTGCC GCATGGCGTT TTCAATGTGG TAACTGGGCC TGGCGGTTTG GTGGGCGAAC GCCTGACCAG CCATCCTGCG GTCGATAAAA TTGCTTTTAC TGGCTCGACC GAGGTTGGCC GCCGCATTGC CGAAGTCGCT GGGCGTAATC TCAAGCGCGT TACCTTGGAG CTTGGTGGCA AATCGCCAGT GGTTGTTTTG CCCAATGCTG ATCTTGATTT GGCGGTTGAT GGGGCGATTT GGGCGGCCTT TATGCATTCA GGCCAAAGCT GCGAGGCTGG CACACGCTTG CTCTTGCCCG ATTCGCTGCA CGATCAATTT GTTGAGCGGA TGGTCGCACG AGTTGAACAA TTAGTGTTGG GTGATCCGCT TGATCTGACA ACTGATTTAG GGCCGTTGGT TTCAGCTGCT CAAAAACGTG CAGTCGAGGC CTATATCGAG CTGGGGATTC AAGAAGGCGC TACCTTGCGC TGCGGTGGAG TGGGCATCGA TGATCCCAAT CTGGCCAATG GCCATTTTGT GCGACCCACG ATCTTTACCA ATGTGCATAA CCAGATGCGG ATTGCTCAAG AAGAAATTTT TGGGCCAGTC CTTTCGGTAA TTCGCTATCA TACAGTTGGT GAGGCGATTA CGCTTGCCAA TGATACCAAC TATGGCTTGG CAGCCAGCGT GTGGAGCCGC GATTTACAAG ATGCCCAAGA GGTGGCGAGG GCAATTCGGG CTGGCACGGT TTGGATCAAT GATCATCACC TGATCAATGC CAAAGCGCCA TTTGGGGGCT ACAAAGATAG CGGAATTGGC CGCGAGTTGG GGCCGAATGC GCTTGATGCC TATAGCGAAA TCAAGCATAT TCATACCGAC TTGACCCAAG AACGCACCCG CCGCATTTGG GTCGATATCG TTACGCCACG GCTTGATGAT TAA
|
Protein sequence | MTQSDSLRSF GLYIDGAWVA ASDAASETLY NPATGEPVAQ VARATIHDID RAVGAARKSF DIGSWAQMRP VDRAKTIEAI ADLLEENTDE LAELETLNGG ATLRKSSWLD IPVGIEHLRY FADLARQHPM QTLPYIDFPS PSANAVWREP IGVCGQIIPW NYPFLMAIWK IGPALAAGNS LVLKPASLTP VTALRMAELI HEADLLPHGV FNVVTGPGGL VGERLTSHPA VDKIAFTGST EVGRRIAEVA GRNLKRVTLE LGGKSPVVVL PNADLDLAVD GAIWAAFMHS GQSCEAGTRL LLPDSLHDQF VERMVARVEQ LVLGDPLDLT TDLGPLVSAA QKRAVEAYIE LGIQEGATLR CGGVGIDDPN LANGHFVRPT IFTNVHNQMR IAQEEIFGPV LSVIRYHTVG EAITLANDTN YGLAASVWSR DLQDAQEVAR AIRAGTVWIN DHHLINAKAP FGGYKDSGIG RELGPNALDA YSEIKHIHTD LTQERTRRIW VDIVTPRLDD
|
| |