Gene Haur_2608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2608 
Symbol 
ID5734486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3348248 
End bp3349243 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content51% 
IMG OID641279748 
Productaldo/keto reductase 
Protein accessionYP_001545374 
Protein GI159899127 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000730826 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAGC GCCAATTAGG CCGTGATGGT TTGGTGGTTT CGGCAGCAGG CTTAGGCTGT 
ATGGCAATGT CGGGGATGTA CGGGCCTTCG GATCGTGCCG AGAGTATTGC GACAATTCAT
TCAGCGCTTG ATGCTGGAGT CAATTTGCTG GATACTGGCG ATTTCTATGG GATGGGCCAT
AACGAACTGC TGATTAGTGA AGCCTTGCGT GAGCGTTCAC GCTCCGATGT TGTTTTGAGT
GTCAAATTTG GGGCAATGCG AAGCCCTGAT GGTTCGTGGC TCGGCTACGA TGCTCGGCCA
GCTGCGGTTA AAAATTTCTT ATACCACAGC CTTACACGCT TGAACACCGA TTACATTGAT
ATTTATCGAC CTTCGCGGCT TGACCCGAAT GTACCGATTG AAGAAACAAT TGGGGCGATT
GCCGAGATGG TCGAAAAAGG CTATGTGCGG CATATCGGTT TGTCGGAAGT TGGGGTTGAA
ACGATTCGAC GAGCAGCAGC GGTGCATCCA ATTGTCGATT TGCAAATTGA ATATTCCTTG
ATGTCGCGTG GTATCGAGGC CGAAATTTTG CCTGCTTGTC GCGAATTGGG CATCGGCATC
ACCGCCTATG GAGTGCTTTC GCGTGGCTTG CTAAGCGGTG CTTGGTCGAA AGAACGGGTT
TTGGCTGGCT CAGATTTTCG TTCCCATGGC CCGCGCTTTA CTGGCGAGAA TCTTGATCAT
AATTTAGAAC TGGTTGCAGC CTTGCAAACA ATTGCCGAAG CCAAGGGTGC AAGTATTGCC
CAAATTGCCA GCGCATGGGT AGTGGCGCAA GGAGCCGATA TTATTCCATT GTTCGGGGCA
CGCCGCCTAC ATCAATTGCA CGATTCATTG ACCAGCCTTG ACATTAACTT GAATGCTGAT
GAATTAGGCA TAATTGAGCG GGCGATTCCC AAAGGTGCGG CGGCGGGCGA ACGCTACAAT
GCCTATCTGA TGCAACATTT GGATAGCGAA CGCTAA
 
Protein sequence
MQKRQLGRDG LVVSAAGLGC MAMSGMYGPS DRAESIATIH SALDAGVNLL DTGDFYGMGH 
NELLISEALR ERSRSDVVLS VKFGAMRSPD GSWLGYDARP AAVKNFLYHS LTRLNTDYID
IYRPSRLDPN VPIEETIGAI AEMVEKGYVR HIGLSEVGVE TIRRAAAVHP IVDLQIEYSL
MSRGIEAEIL PACRELGIGI TAYGVLSRGL LSGAWSKERV LAGSDFRSHG PRFTGENLDH
NLELVAALQT IAEAKGASIA QIASAWVVAQ GADIIPLFGA RRLHQLHDSL TSLDINLNAD
ELGIIERAIP KGAAAGERYN AYLMQHLDSE R