Gene Gdia_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1049 
Symbol 
ID6974446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1175814 
End bp1176794 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content62% 
IMG OID643390571 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_002275447 
Protein GI209543218 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAGA TCTTTGACGC GACGTCGAGG ACCGACGATG TTCTCTCCGG CGTATCCCTG 
AAGGGCAAGC GCGTTCTCGT GACCGGCGTT TCCGCCGGAC TGGGTATTGA AACGGCCCGG
ACACTGGCAG GTCATGGCGC GCATGTCGTG GGCGCGGCAC GCGATCTTGC AAAAGCGGAA
CGCGCAACCG ATCAGGTTCG CGTGGCCGCG TCGCAAGGAG GCGGAGCGTT CGAACTCATC
GCGCTCGACC TTGCGGATCT AGCAAGTGTG CGCGCCTGCG CCGACCGTCT GAATGCGCAA
GGCACGCCCT TCGACCTGAT CATCGCCAAT GCGGGCGTGA TGGCGACTCC ATTCGGGCAT
ACCAAGGATG GGTTCGAGAC GCAGTTCGGC ACCAACCATC TGGGACATTT CGTTCTGGTC
AACCGAATTG CCGGACTACT GCGCGACGGC GCGCGACTGG TCAATGTGTC CTCGGCTGGA
CATCGCTTCG CCGATGTCGA TCTCGACGAT CCGAATTTCG AGCAGACGCC TTACGTGCCG
TTCGTGGCTT ATGGACGTTC CAAGACTGCC AATATTCTCT TCGCCGTGGC CTTCGATGCG
CGGCATCGTG CAAGGGGCAT ACGCGCTACG GCGGTTCACC CGGGTGGGAT CAAGACGGAA
CTGGCGCGGC ACATGGCACC CGGGGAGATC GAAGCCATGG TGAAGCAGGT CAACGAACAG
GCTGCTGCCG AGGGCCAGAA GCCGTTCCAG TTCAAGAGCA TTCCGCAGGG GGCTGCAACC
TCGGTCTGGG CCGGCGTCGT GGCCGAAGCC GACATGGTAG GCGCTCATTA CTGCGAGGAT
TGCCACGTCA GCGATGTTGT ACCGAACGAC CTGCCGATCA GTCTGGTCAA CGCAGGGGTG
CGCGCCTACG CTCTCGATCC GGCACACGCC GAAGCCCTGT GGACAAAAAG CGAGGAGATG
GTCGGCGAAC GCTTCGCCTG A
 
Protein sequence
MTQIFDATSR TDDVLSGVSL KGKRVLVTGV SAGLGIETAR TLAGHGAHVV GAARDLAKAE 
RATDQVRVAA SQGGGAFELI ALDLADLASV RACADRLNAQ GTPFDLIIAN AGVMATPFGH
TKDGFETQFG TNHLGHFVLV NRIAGLLRDG ARLVNVSSAG HRFADVDLDD PNFEQTPYVP
FVAYGRSKTA NILFAVAFDA RHRARGIRAT AVHPGGIKTE LARHMAPGEI EAMVKQVNEQ
AAAEGQKPFQ FKSIPQGAAT SVWAGVVAEA DMVGAHYCED CHVSDVVPND LPISLVNAGV
RAYALDPAHA EALWTKSEEM VGERFA