Gene Haur_3226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3226 
Symbol 
ID5735094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4083664 
End bp4084875 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content51% 
IMG OID641280372 
Product2-alkenal reductase 
Protein accessionYP_001545991 
Protein GI159899744 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAAC GCAGCATGCG TATTTTGATT CCAGCATTTC TTATTCCACT AGGCCTTTGT 
ATTGTGGCAA GCCTGTTGAT TTGGATCTTC GCCCCAGCTA ACGATGCTGC TGGAGAAAGC
TCAAATAACG ATTCAACTCT TTCAGTTGCG CCAACCACAG CTCCAGTTAG CGCTTTGAGC
GAAGCACCGG GCAATGGCTT GAGCGACGAA CAAGCGCGGA TCGATTTGTA TAAGCGAGTT
GGCCCAGCGG TTGTCAGCAT CGATACCGAA GTTACCGGTG AAGGTAGCGA GGCGGCAACT
GGCGAGGCGC TTGGCTCAGG CTTTTTGGTC GATGATCAAG GCCATATTGC TACCAACAAC
CATGTAATCG AAGGTGCAAC CCGTATTTTC GTCACTTTTG CTGATGGCCG CCAAGTGCCA
GCCACTCTGC GCGGAACCGA TGAAGATAAC GATATTGCGG TGATTAAGGT TGATGCCGCA
GCCGTGAGCA AGATTGCGCC GATGGTATTT GGCAACTCAC GCGAAGTTCA AGTTGGTCAA
GATACGATTG CAATTGGCAA TCCGTTTGGC TTGCAAAATA CCATGACTTT GGGGATTGTG
AGCGCGGTCG AAGGGCGTTC GTTGCCAGGC CGTACCCTTG CTAATGGTGG TCAATTTCGC
ATCAGCCGAA TTATTCAGAC CGATGCGGCG ATCAACCCAG GCAATAGCGG TGGGCCATTG
CTCAATAGTA AAGGCGAAGT GATTGGGATT AACACCGCGA TTCGGGTGAG TGACCCAACT
GCTGCACCAG CCTTTGCCGG AGTTGGTTAT GCCGTGCCCG CCAACACGGT CAAAGTGATC
GTCGAAGATT TGATTAAAAC TGGCAAGCAC GATTCAGCCT ATCTTGGCGT GAGCATGCTA
ACGATCAGCG CTCAATTGGC GCAAGAATTG AAATTACCAG TCAGCCAAGG CGCGTTGGTG
ACCAATGTAG TCGTTGATGG TCCCGCCGAT CAAGCTGGAA TTCGCTTGGG TACAACCAGT
ATCGAAGTTG ATGGTGCTGC ATTAATTATT GATAGCGACA TCGTGACTGC CTTCAACGGC
GAAACGATTC GCTCCAGTGA CGATTTGATT GCCCATATCA ACGATTCGCG AGTTGGCGAT
AAAGTCATTT TGACGATCAT ACGGGGTGAT GAAGAGCAAA CGATCGAGGT AACCTTGGGG
GCACGCCCTT AA
 
Protein sequence
MKQRSMRILI PAFLIPLGLC IVASLLIWIF APANDAAGES SNNDSTLSVA PTTAPVSALS 
EAPGNGLSDE QARIDLYKRV GPAVVSIDTE VTGEGSEAAT GEALGSGFLV DDQGHIATNN
HVIEGATRIF VTFADGRQVP ATLRGTDEDN DIAVIKVDAA AVSKIAPMVF GNSREVQVGQ
DTIAIGNPFG LQNTMTLGIV SAVEGRSLPG RTLANGGQFR ISRIIQTDAA INPGNSGGPL
LNSKGEVIGI NTAIRVSDPT AAPAFAGVGY AVPANTVKVI VEDLIKTGKH DSAYLGVSML
TISAQLAQEL KLPVSQGALV TNVVVDGPAD QAGIRLGTTS IEVDGAALII DSDIVTAFNG
ETIRSSDDLI AHINDSRVGD KVILTIIRGD EEQTIEVTLG ARP