Gene Haur_3788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3788 
Symbol 
ID5735652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4756620 
End bp4758152 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content54% 
IMG OID641280940 
Productaldehyde dehydrogenase 
Protein accessionYP_001546552 
Protein GI159900305 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCAAT CAGATAGCCT TCGCTCGTTT GGTCTGTATA TTGACGGAGC TTGGGTCGCG 
GCCAGCGATG CTGCTAGCGA AACCCTGTAC AACCCCGCGA CGGGCGAGCC AGTTGCCCAA
GTAGCGCGGG CCACCATTCA CGACATTGAT CGAGCGGTTG GAGCTGCGCG GAAGAGTTTC
GATATTGGTT CGTGGGCGCA AATGCGGCCT GTTGATCGCG CTAAAACGAT CGAGGCGATT
GCCGATTTGC TCGAAGAAAA CACCGACGAA TTGGCCGAGC TTGAGACGCT CAATGGTGGC
GCAACTCTGC GTAAAAGTTC ATGGCTCGAT ATTCCGGTAG GTATCGAGCA TTTGCGCTAT
TTTGCCGATT TGGCGCGGCA ACACCCCATG CAAACCTTGC CCTATATCGA TTTTCCGTCG
CCGAGTGCTA ATGCGGTTTG GCGCGAGCCG ATTGGGGTTT GTGGCCAAAT TATCCCTTGG
AACTACCCAT TTTTGATGGC AATTTGGAAG ATTGGCCCAG CCTTGGCCGC TGGCAATAGC
CTGGTGCTCA AGCCAGCCTC GTTGACTCCA GTTACCGCTT TGCGTATGGC CGAATTGATT
CATGAAGCCG ATTTGTTGCC GCATGGCGTT TTCAATGTGG TAACTGGGCC TGGCGGTTTG
GTGGGCGAAC GCCTGACCAG CCATCCTGCG GTCGATAAAA TTGCTTTTAC TGGCTCGACC
GAGGTTGGCC GCCGCATTGC CGAAGTCGCT GGGCGTAATC TCAAGCGCGT TACCTTGGAG
CTTGGTGGCA AATCGCCAGT GGTTGTTTTG CCCAATGCTG ATCTTGATTT GGCGGTTGAT
GGGGCGATTT GGGCGGCCTT TATGCATTCA GGCCAAAGCT GCGAGGCTGG CACACGCTTG
CTCTTGCCCG ATTCGCTGCA CGATCAATTT GTTGAGCGGA TGGTCGCACG AGTTGAACAA
TTAGTGTTGG GTGATCCGCT TGATCTGACA ACTGATTTAG GGCCGTTGGT TTCAGCTGCT
CAAAAACGTG CAGTCGAGGC CTATATCGAG CTGGGGATTC AAGAAGGCGC TACCTTGCGC
TGCGGTGGAG TGGGCATCGA TGATCCCAAT CTGGCCAATG GCCATTTTGT GCGACCCACG
ATCTTTACCA ATGTGCATAA CCAGATGCGG ATTGCTCAAG AAGAAATTTT TGGGCCAGTC
CTTTCGGTAA TTCGCTATCA TACAGTTGGT GAGGCGATTA CGCTTGCCAA TGATACCAAC
TATGGCTTGG CAGCCAGCGT GTGGAGCCGC GATTTACAAG ATGCCCAAGA GGTGGCGAGG
GCAATTCGGG CTGGCACGGT TTGGATCAAT GATCATCACC TGATCAATGC CAAAGCGCCA
TTTGGGGGCT ACAAAGATAG CGGAATTGGC CGCGAGTTGG GGCCGAATGC GCTTGATGCC
TATAGCGAAA TCAAGCATAT TCATACCGAC TTGACCCAAG AACGCACCCG CCGCATTTGG
GTCGATATCG TTACGCCACG GCTTGATGAT TAA
 
Protein sequence
MTQSDSLRSF GLYIDGAWVA ASDAASETLY NPATGEPVAQ VARATIHDID RAVGAARKSF 
DIGSWAQMRP VDRAKTIEAI ADLLEENTDE LAELETLNGG ATLRKSSWLD IPVGIEHLRY
FADLARQHPM QTLPYIDFPS PSANAVWREP IGVCGQIIPW NYPFLMAIWK IGPALAAGNS
LVLKPASLTP VTALRMAELI HEADLLPHGV FNVVTGPGGL VGERLTSHPA VDKIAFTGST
EVGRRIAEVA GRNLKRVTLE LGGKSPVVVL PNADLDLAVD GAIWAAFMHS GQSCEAGTRL
LLPDSLHDQF VERMVARVEQ LVLGDPLDLT TDLGPLVSAA QKRAVEAYIE LGIQEGATLR
CGGVGIDDPN LANGHFVRPT IFTNVHNQMR IAQEEIFGPV LSVIRYHTVG EAITLANDTN
YGLAASVWSR DLQDAQEVAR AIRAGTVWIN DHHLINAKAP FGGYKDSGIG RELGPNALDA
YSEIKHIHTD LTQERTRRIW VDIVTPRLDD