Gene Haur_1504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1504 
Symbol 
ID5733389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1753209 
End bp1754210 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content52% 
IMG OID641278642 
Productalcohol dehydrogenase 
Protein accessionYP_001544276 
Protein GI159898029 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.139822 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGCAA TTATCTATCG TGAATATGGC TCAGCCGATG TGTTGCGGCT CAGCGAAGTA 
GCTCGCCCAA CCCCTGCTGA AAATCAAGTT TTAATTAAAG TACATATGAC AGCCCTTAAC
GCCGCCGATT GGCGCTTGAT GTCTGGTAAA CCCTTTCCTG TACGCTTTAT GACGGGCTTA
TTCAAACCCA AAAAAGGTAT TCCTGGCACC GATGTGGCTG GAGTTATCGA AGCCGTCGGG
CGTAACGTCA CTCAATTTAA AGTCGGCGAT GCGGTGTTTG GCGATCTTTC GGGCTGCGGA
GCTGGTGGCT TAGGCCAATA TGTTTGTGCC CCCGAACATG TGCTGGTGCT CAAGCCCGAG
CAGCTAAGTT TTGAACAAGC TGCTGCCGCG CCCATGGCCG CGGTCACAGC CTTGCAAGGC
CTGCGCCAAG GCGGCATCGC CGCAGGCCAA AAGGTCTTGA TTTATGGAGC TTCAGGTGGA
ATTGGCACAT TTGCGGTGCA GCTTGCTAAA CATTTTGGCG CAATCGTTAC CGCTGTTTCC
AGTGCCGCCA AGCACGATTT GCTACGTTCG CTCGGCGCTG ATCAGGTGCT GGATTATGCT
AAGGATGATT TTGCTCGCAA TGGTCAGCTG TATGATCTGA TTTTGGGGGT CAATGGTCAT
CGCTCAATTT TCGACTATAA ACGCAGTTTA GCGCCTCAAG GTCGCTATGT GATGGTTGGC
GGCGAAATGA GCCAGATTTT TCAGGCGATC GCCTTGGGCA AATTGCTCTC AATTGGCAGC
CAAAAACAGC TGAGTAACCT GTTCGCCAAG CCCAACCAAA CCGATCTCGC CAAAATTGGC
TTTTTGCTGG CCAACGGCGA TATCAAAGCG GTGATCGATC AGCGCTACCC GCTGGAAGAA
GCTCCTGCCG CAATGCGTTA TCTCCAAGCT GGCCATGCCA AAGGCAAAAT TATGATCGAA
TTGCAACCCA CCACAGCTCA AAGCTTGGAG CAAACCGTAT GA
 
Protein sequence
MQAIIYREYG SADVLRLSEV ARPTPAENQV LIKVHMTALN AADWRLMSGK PFPVRFMTGL 
FKPKKGIPGT DVAGVIEAVG RNVTQFKVGD AVFGDLSGCG AGGLGQYVCA PEHVLVLKPE
QLSFEQAAAA PMAAVTALQG LRQGGIAAGQ KVLIYGASGG IGTFAVQLAK HFGAIVTAVS
SAAKHDLLRS LGADQVLDYA KDDFARNGQL YDLILGVNGH RSIFDYKRSL APQGRYVMVG
GEMSQIFQAI ALGKLLSIGS QKQLSNLFAK PNQTDLAKIG FLLANGDIKA VIDQRYPLEE
APAAMRYLQA GHAKGKIMIE LQPTTAQSLE QTV