Gene Haur_3894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3894 
Symbol 
ID5735755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4886837 
End bp4887775 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content52% 
IMG OID641281045 
Productalcohol dehydrogenase 
Protein accessionYP_001546656 
Protein GI159900409 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00052515 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACAA TGCAGGCAAT TCAAATTAAT CGGTATGGCG GCGTTGACGA ATTAGTGCAG 
GCAGAAATTG CGCGGCCTGA GCCTACCGCC GATCAAGTGT TAATTAAAGT TCATGCAATT
GGGATTAATC CGGTTGATTA TAAAGTGCGC TCAGGTATGA TCGCCTTTTT CGATGAGCAA
GCGTTTCCGG TGGCGCTTGG TTGGGATGTG GCGGGCACAA TTGCTGCGGT TGGCTCAAAT
GTTAGCCAAT GGAAGCTTGG CGATGAAGTC TATGGCATGG TCAATTTTCC CACTCCTAGC
GGTGCCTATG CTGAATATGT GGTAGCTCCA GCGCTTGAAG TTGCTGCCAA ACCCAAAAGC
CTGAGCTTTG CAGAAGCCGC AGCTGTGCCG TTGGTAGCCT TGACTGCTTG GCAGGCCTTT
GATCTGGTTG GTTTGCAGGC TGGCGATCGC GTGCTGGTGC ATGCAGCGGC TGGCGGCGTT
GGTCATGTGG CGGTGCAATT GGCTAAATTA CGCGGCGCTC ATGTGATTGC AACGGCCTCG
GCGCGGAATG AAGGCTTTGT GCGCGAATTG GGCGTTGATC AATTTGTTGA TTACACTGCT
GCGCCGTTTG AGCAACAAAT CGAACCAGTT GATGTTGTGT TTGATACCGT TGGCGGCGAA
GTCCAAGCGC GTTCGTATGC GGTGTTAAAA CCTCAAGCTG GATTGGTAAC GATTGTTGGT
TCACCGCCCG CCGATTTGGC CGCAGCCCAT GCTGGCAAAA GCTTGAACCA TCTTGTTCAG
GCCAATCAAG CCCAATTAAC CGAGATTGCC AACTTGATCG ATAGCCAAAA ATTGCGGGTG
GAAGTCGAAC AAGTCTACGA TTTCACGGCC ATGGCAGCGG CCCACGAACG CATGCAAAGT
AGTCGGGTAC GCGGCAAAAT TGTGGTCAAA GTGAGCTAA
 
Protein sequence
MTTMQAIQIN RYGGVDELVQ AEIARPEPTA DQVLIKVHAI GINPVDYKVR SGMIAFFDEQ 
AFPVALGWDV AGTIAAVGSN VSQWKLGDEV YGMVNFPTPS GAYAEYVVAP ALEVAAKPKS
LSFAEAAAVP LVALTAWQAF DLVGLQAGDR VLVHAAAGGV GHVAVQLAKL RGAHVIATAS
ARNEGFVREL GVDQFVDYTA APFEQQIEPV DVVFDTVGGE VQARSYAVLK PQAGLVTIVG
SPPADLAAAH AGKSLNHLVQ ANQAQLTEIA NLIDSQKLRV EVEQVYDFTA MAAAHERMQS
SRVRGKIVVK VS