Gene Haur_0347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0347 
Symbol 
ID5732257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp418341 
End bp419447 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content50% 
IMG OID641277470 
Product2-alkenal reductase 
Protein accessionYP_001543126 
Protein GI159896879 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000181204 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAATC CCAAACGTAC CGTCGGCATC GGCGGATTGA TTATTGCCGC CCTCGCACTT 
TCAGGGGTTG GTGGTGGTTT AGTCGGTGGT GTCGCAGGCT ATCGCTTAGC CCAAGTTGAT
CAAACGGCAA CGGCAGTGGC GGTCACGCCA ACCAGCGCCA CCCAACAAAC GACCGATAAT
CTGACTGCGC AACCAGTTGT TGCCAGCACG AATGATGCAG TTGTCCAAAC CGTTGCCAAG
GCCTCACCAG CAGTCGTAAA AATTATTAGT ACGGTGGTTG GCGGTCAGGC CAGCGGTTCA
GGCGCAATTA TCAATGAAGA TGGCTATATC ATCACCAATA ATCATGTGGT CGAAGGTCAA
AGTCGCTTGC AAGTAATTTA TTCTGATGGA ACCAGCCACA ATGCCGAGTT GATCGGCACT
GATGCTTTTT CGGATATCGC CGTGATTCGG GTGCTCGATG CCGTGCCAGC CACAATTTCC
TTAGGCGATT CGGATAGTTT GCAGCCTGGC GAAACGGTCG TGGCCATTGG CAGTCCGCTT
GGTAAATTCC AAAATAGTGT GACGGTTGGA GTGGTCAGCG CTCTTGACCG CACAATTGAT
AGCATGGAAG GCCTAATTCA AACTGACGCA GCGATTAACC ATGGCAATAG TGGCGGCCCA
CTGATCAACC TCAAAGGCGA AATTGTTGGG ATCAATACGT TGGTTGTGCG CGGTGATATT
GGTAGTATCG ATGAAGCTCA AGGCCTAGGT TTTGCTGTCC CATCAAATAT TGTGCGCGAA
GTCAGCGATG CTTTGATTGC CAATGGCCAA GTTATTCGAC CATATATAGG CATTCGCTAC
GAATTGCTTT CGCCCGAAAC CGCCGAACTG GGGATTGCCA ATGATAAAGG TGCTTTCGTG
ACCAATGTTG ATGAAGGTAC ACCTGCTCGG CGAGCTGGCA TTAGCCGTGG CGATATTATT
CTCGCAGTCA ACGGCGAAGA AATTACTCAA CGCCACTCAT TGCAGCGCCT GCTGTTGCAA
TATAAACCAG GCGATACCGT AACGGTCACA ATCGAGCGTA ATGACCAACA ACAGGAAGTG
CAAATTACCC TCGCTGAACG CCCATAA
 
Protein sequence
MSNPKRTVGI GGLIIAALAL SGVGGGLVGG VAGYRLAQVD QTATAVAVTP TSATQQTTDN 
LTAQPVVAST NDAVVQTVAK ASPAVVKIIS TVVGGQASGS GAIINEDGYI ITNNHVVEGQ
SRLQVIYSDG TSHNAELIGT DAFSDIAVIR VLDAVPATIS LGDSDSLQPG ETVVAIGSPL
GKFQNSVTVG VVSALDRTID SMEGLIQTDA AINHGNSGGP LINLKGEIVG INTLVVRGDI
GSIDEAQGLG FAVPSNIVRE VSDALIANGQ VIRPYIGIRY ELLSPETAEL GIANDKGAFV
TNVDEGTPAR RAGISRGDII LAVNGEEITQ RHSLQRLLLQ YKPGDTVTVT IERNDQQQEV
QITLAERP