Gene Gdia_1764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1764 
Symbol 
ID6975185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1952345 
End bp1953409 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content67% 
IMG OID643391292 
Productaldo/keto reductase 
Protein accessionYP_002276143 
Protein GI209543914 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTATC GCCAGCTCGG CCGTTCCGGC TTGCGCGTAT CCGTCTTCAC TCTCGGCACG 
ATGACGTTCG GCGGCAAGGG TGCCTTCGCC AAGACGGGCA GCACCGATGT CGCCGGCGCG
AAGCGGCAGA TCGACATGTG CATCGAAGCC GGCATCAACA TGTTCGATAC CGCCGACGTC
TATTCCTCCG GCGTGTCCGA GGAAATCCTG GGCGAGGCGC TGAAGGGCCG GGCGGGGGAA
ATCCTGGTGG GCACCAAGGC GCGCTTTCCC ATGGGCAAGG GACAGAACGA CGCCGGCTCG
TCCCGCTATC ACCTGATATC GGCGGCGGAA GCCAGCCTGC GCCGCCTCGG CCGCGACCAT
ATCGACCTGT TCTACCTGCA TGAATGGGAC GGCCAGACCC CGCTGGACGA AACGCTGGAG
GCGCTGGACA CCCTGACCCG CGCGGGCAAG ATCCGCTATG CCGGCGTGTC CAATTTCTCC
GCCTGGCACA TCATGAAGGC GCTCAGCACG GCGGAACGCC ATCGCCTGAT CGCCCCGGTG
TCGCAGCAGA TCTATTATTC GCTGCAGGCG CGCGAGGCCG AATACGAACT CCTGCCGCTC
GCGCTCGACC AGGGGATCGG CGTGCAGGTC TGGAGCCCGA TGGCCGGAGG CCTGCTGTCC
GGCAAGCACC GGCGCGGCAA GCCCGAACCC GAGGGCACGC GCCAGCTCGC GCAATGGAAC
GAACCGCCGG TCTATGACGT GGAAAAGCTC TACGATGTCG TCGAAGTCCT GGTGGCGATC
GGCGCGGAAC GCGGCGTGTC CGCCGCCCAG GTCGCGCTGG CCTGGGTCGC GCATCGTCCT
GCGATCACCT CGGTGGTGAT CGGGGCGCGG ACCGACGCGC AACTGGCCGA TAACCTCAAG
GCGGCGGAAC TCAGCCTGTC GGCCGAGGAA ATGGCCCGGC TGGACGAGGC CAGCGCGCCG
CCGCTGCTCT ATCCGTACTG GCATCAGGCC AGCACGGCGT CGGACCGCCT GTCGCCGGCC
GACCTGCTGC TGCTCGGCCC CGCGATCGAC AGGAAGAAGG GCTGA
 
Protein sequence
MHYRQLGRSG LRVSVFTLGT MTFGGKGAFA KTGSTDVAGA KRQIDMCIEA GINMFDTADV 
YSSGVSEEIL GEALKGRAGE ILVGTKARFP MGKGQNDAGS SRYHLISAAE ASLRRLGRDH
IDLFYLHEWD GQTPLDETLE ALDTLTRAGK IRYAGVSNFS AWHIMKALST AERHRLIAPV
SQQIYYSLQA REAEYELLPL ALDQGIGVQV WSPMAGGLLS GKHRRGKPEP EGTRQLAQWN
EPPVYDVEKL YDVVEVLVAI GAERGVSAAQ VALAWVAHRP AITSVVIGAR TDAQLADNLK
AAELSLSAEE MARLDEASAP PLLYPYWHQA STASDRLSPA DLLLLGPAID RKKG