Gene Haur_2504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2504 
Symbol 
ID5734385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3198184 
End bp3199755 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content54% 
IMG OID641279644 
Productaldehyde dehydrogenase 
Protein accessionYP_001545270 
Protein GI159899023 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0589644 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCAA TTTTAATCAA CGGCGAATGG CAAATTGGCG ATTACGTTGG TAGTTTTCAT 
GCGACCAACC CAACCACAAC CCAAGCATTC ACCACCGAAT ATCCGATTTC GGGACGCGGC
GATTTAGAAT TAGCCTTGAG CGCTGGGGTT GCCGCCTCGC GTGAGCTAGC CCAAAGTAGC
CCTGAGCAAC GCGCTAAATT TCTCGAAGGC TATGCCGATT TGATCGAGGC CAAGCGCCAA
GAACTGAGTG CAATCGCCCA TAGCGAAACT GGCCTACCGA TCGAGCCACG CTTGAATAGC
GTTGAACTAC CACGCACGGT TAATCAATTG CGCCAAGCGG CTCAAGCAGT GCGCAACCAC
ACCTGGGAGC TGGCAACAAT CGACAGCAAA GCCAATATTC GCTCGCTCTA TGGCAGTTTG
GCCGCACCAG TCGCCATTTT TGGGCCGAAT AACTTCCCAT TTGCCTATAA TGCGATTGCT
GGCAGCGATT TTGCCTCGGC GATTGCTGCG GGAAATGCTG TAATCGCCAA AGGACACCCA
AGCCACCCCG CCACCACCGA GTTACTGGCT GAATTGGCGC ATCAAGCAGT CTTGGCGGCT
GGTTTGCCTG CGGCAAGTGT GCAATTGCTG TATGGTTTGC CCGATGAATT GGGCTTGGCA
TTGGTCAGTC ATCCTTTAAT TGGGGCAATT GGCTTTACTG GTTCGCGCCG AGCAGGATTG
ACGCTCAAGG CTGCCGCCGA TCAAGCAGGC AAGGCAATTT ACCTTGAAAT GTCGAGCATC
AACCCGGTTG TGATGTTGGC TGGAGCGGTG ACCGAGCGGG CCGAAGCGCT TGCCGCCGAA
TTTGCTGGCT CGTGCACTTT GGGCGCTGGC CAATTTTGCA CCAACCCTGG CTTATTAATT
TTGGAGCATT CAGCGGCTAG TGCCGATTTT ATTGAAGCAA CCAAAGCCCA CTTTAACCAA
CACCCTAGCC TGACCCTGCT CAACCATGGC GTTCTGGCTG GGCTTGAACA GGGCATTGAG
CATTTGCAAG ACGCTGGGGC ACAGGTCTTG GTTGGTGGGC ATGTGTTGGA TGATGGGTAT
CGCTATGCCA ATAGTTTGCT CTACGTTGCG GGCAATAGCT TTCTGGCCAA TCCCCAAGCG
CTCAGCCATG AAGTCTTTGG GCCAGTCAGC TTGATTGTTG AATGTGCCGA CCAAGCCGAA
GTATTGCGAG TGCTCGACTG CCTTGAGGGC AATTTGACGG GCAGCATCTA CAGCAGCAGC
AACGGGGCAG ATGAAACCTT TTATCAAATT GTTGCCGAGC GCTTACGCTC CAAAGTTGGG
CGTTTGCTCA ACGATAAAAT GCCCACAGGT GTGGCCGTCA GCTCAGCCAT GAATCATGGC
GGGCCGTATC CCGCGACAGG CCATGCAGGC TGGACGGCAG TGGGCTTTCC CGCAACGATT
CGGCGTTTTG CGGCGCTCCA CTGCTACGAC AACGTGCGCG AATCACGCTT GCCAAGCATT
CTGCAAAACG CCAATCCGCA AGCGATTTGG CGCTTGGTTG ATGGCCAGTG GAGCAATGCT
GGCATTGATT AG
 
Protein sequence
MKPILINGEW QIGDYVGSFH ATNPTTTQAF TTEYPISGRG DLELALSAGV AASRELAQSS 
PEQRAKFLEG YADLIEAKRQ ELSAIAHSET GLPIEPRLNS VELPRTVNQL RQAAQAVRNH
TWELATIDSK ANIRSLYGSL AAPVAIFGPN NFPFAYNAIA GSDFASAIAA GNAVIAKGHP
SHPATTELLA ELAHQAVLAA GLPAASVQLL YGLPDELGLA LVSHPLIGAI GFTGSRRAGL
TLKAAADQAG KAIYLEMSSI NPVVMLAGAV TERAEALAAE FAGSCTLGAG QFCTNPGLLI
LEHSAASADF IEATKAHFNQ HPSLTLLNHG VLAGLEQGIE HLQDAGAQVL VGGHVLDDGY
RYANSLLYVA GNSFLANPQA LSHEVFGPVS LIVECADQAE VLRVLDCLEG NLTGSIYSSS
NGADETFYQI VAERLRSKVG RLLNDKMPTG VAVSSAMNHG GPYPATGHAG WTAVGFPATI
RRFAALHCYD NVRESRLPSI LQNANPQAIW RLVDGQWSNA GID