Gene Haur_3830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3830 
Symbol 
ID5735694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4809220 
End bp4810077 
Gene Length858 bp 
Protein Length285 aa 
Translation table11 
GC content49% 
IMG OID641280982 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_001546594 
Protein GI159900347 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000468983 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGGTA AAGTTTGTTT GGTCACCGGG GCTACTGGTG GTATCGGCAA GGTAACAGCC 
TTAGAACTAG CGCGTCAAGG CGCGACGGTC GTGATTATTG GGCGGCATCC GTTGCGCACC
GAAGGCGTTA CCGAGATGAT CAAACGCGAA ACTGGCAATC CCAATGTTAG CTATATTTTG
GCCGATTTAT CCAAGCAAGT TGATGTACGC AGGGCTGCCG CCGAATTTTT GAGCAAACAT
CATCAATTGC ATATTTTGGT CAATAATGCT GGAGCATTTT ATAACTCACG CCAAGAAAGT
GCCGATGGCA TCGAATTAAC CATGGCGCTC AATCATTTAG CCTACTTTCT ATTAACCGAT
TTGCTGTTGG ATACAATCAA AGCTAGCGCT CCTGCCCGAA TTATCAATGT TTCTTCTGAG
GCTCATCGGA TGGGCGCGAT GGATTTCAAC GATCTTGAAG GCAAGCGCAA ATGGGGTGGC
TGGCGCATGT ATGGTCGTTC CAAATTGGCC AATATTCTAT TTACCAAAGA GCTTGCCCGC
CGTTTGGCAG GCACCGATGT CACGGTCAAC TGCTTGCATC CAGGCGTGGT TTCAACCGGC
TTTGCTGCCA ACAACGGATT CTTTGGCATT GCCATGCGCA AATTAATGGA TCTTGGCTCG
ATCAGCGCCG AAAAAGGGGC CGAAACCACC TTGTATCTTG CCACATCGCA CGAAGTTGAG
CATCTCACGG GCTTGTATTT CGATAAAAAG AAACCACGTG AATCCAGCCC AGCCTCGCAC
GATCAAACTG CCGCCGAAAA GCTCTGGCAG TGGAGCGAAA CCAAAGTTAA GCAAATCAAC
GAGCAACAAG CAGCGTAA
 
Protein sequence
MKGKVCLVTG ATGGIGKVTA LELARQGATV VIIGRHPLRT EGVTEMIKRE TGNPNVSYIL 
ADLSKQVDVR RAAAEFLSKH HQLHILVNNA GAFYNSRQES ADGIELTMAL NHLAYFLLTD
LLLDTIKASA PARIINVSSE AHRMGAMDFN DLEGKRKWGG WRMYGRSKLA NILFTKELAR
RLAGTDVTVN CLHPGVVSTG FAANNGFFGI AMRKLMDLGS ISAEKGAETT LYLATSHEVE
HLTGLYFDKK KPRESSPASH DQTAAEKLWQ WSETKVKQIN EQQAA