Gene Gdia_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1201 
Symbol 
ID6974605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1337728 
End bp1339254 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content69% 
IMG OID643390730 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002275599 
Protein GI209543370 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACC TTTCCGTCGA AGCCCGTTCG CTGCTCGCCA CGCTGGGGGT CGCGGACGAC 
GCCCTGACGG GCGACCTGCC TGTGCGTTCA CCCGTCAATG GGCAGGAAAT CGCGCGTGTT
GCCACCGCCA CGCGGCAGAT GGCGTGCGAC GGGATCGCCT CGGCCCACGC GGCCTTCCAG
GCGTGGCGGC TGGTGCCGGC GCCGCGCCGG GGCGAACTGG TGCGGCTGCT GGGCGAGGAA
TTGCGTGCCG GCAAGTCGGC CCTGGGCCGC CTGGTGTCGA TCGAGGCCGG GAAGTCCCCG
TCCGAGGGAC TGGGCGAAGT GCAGGAAATG ATCGATATCT GCGACTTCGC CGTCGGCCTG
TCCCGCCAGT TGCACGGCCT GACGATCGCG ACCGAACGTC CGGACCACCG GATGATGGAA
ACCTGGCATC CGCTGGGTGT TACCGGCGTG ATCTCCGCCT TCAATTTCCC CGTCGCAGTC
TGGTCCTGGA ACGCCGCGCT GGCGCTGGTA TGCGGGAACC CGGTGGTGTG GAAGCCGTCC
GAGAAGACGC CGCTGACCGC CCTGGCCTGC CAGGCGCTTT TCCACCGTGC CGCCGCCCGC
TTCGCGGCGT CGGGCACGGC GGTCCCCGAC GGCCTGTCGG TCCTGCTGAC CGGTGGACGC
GCGGTGGGTG AAATCCTGGT CGATCATCCG GATGTGAAGC TGGTATCGGC GACCGGGTCC
ACGGATATGG GCCGCGCGGT CGGCCAGCGC CTGGCTGCCC GCTTCGCGCG GGCCATCCTG
GAACTGGGCG GCAACAACGC CGCCATCGTT ACGCCGTCGG CGGATCTGGA ACTGGCCCTG
CGCGGCGTCG CCTTCGCCGC GATGGGCACG GCGGGCCAGC GCTGCACGAC GTTGCGTCGC
CTGTTCGTGC ATGACGCCAT CTATGACGGG TTCGTCGCGC GGCTGAAATC GGCCTATGCG
ACCGTGACGG TCGGCAGCCC CCTGGAAGAG GGCAATCTGG TCGGCCCCCT GATCGATGCC
GCGGCGATGG ACCGCATGCA GCGTGCGCTG GAATCGGCGC GTGCGTTGGG TGGCGTGGTC
ACGGGCGGCC ACAGGGAAGG CGCGGCCGAC TGGCCCGATG CGTATTATGT CCGTCCCGCC
CTGGTGGAAA TGCCCGAACA GGCCGGCCCG GTGCTGGACG AGACCTTCGC GCCGATCCTG
TATGTGATGA AATACTCCGA TTTCGACCGC GCCGTCGCGT TGCAGAACGC GGTTCCGCAG
GGCCTGTCCT CGGCCGTGTT CACCACCGAC CTTCGCCAGG CCGAACGGTT CCTGTCGGCC
GCCGGGTCCG ATTGCGGCAT TGCCAACGTC AATATCGGCA CATCCGGGGC CGAGATCGGC
GGGGCGTTCG GCGGCGAGAA GGAAACCGGC GGCGGCCGCG AATCGGGATC GGACGCCTGG
AAAGGCTATA TGCGCCGCGC CACCAACACG ATCAATTACG GTACGACCCT GCCGCTGGCG
CAGGGCGTAT CGTTCGATAT CGGTTAG
 
Protein sequence
MIDLSVEARS LLATLGVADD ALTGDLPVRS PVNGQEIARV ATATRQMACD GIASAHAAFQ 
AWRLVPAPRR GELVRLLGEE LRAGKSALGR LVSIEAGKSP SEGLGEVQEM IDICDFAVGL
SRQLHGLTIA TERPDHRMME TWHPLGVTGV ISAFNFPVAV WSWNAALALV CGNPVVWKPS
EKTPLTALAC QALFHRAAAR FAASGTAVPD GLSVLLTGGR AVGEILVDHP DVKLVSATGS
TDMGRAVGQR LAARFARAIL ELGGNNAAIV TPSADLELAL RGVAFAAMGT AGQRCTTLRR
LFVHDAIYDG FVARLKSAYA TVTVGSPLEE GNLVGPLIDA AAMDRMQRAL ESARALGGVV
TGGHREGAAD WPDAYYVRPA LVEMPEQAGP VLDETFAPIL YVMKYSDFDR AVALQNAVPQ
GLSSAVFTTD LRQAERFLSA AGSDCGIANV NIGTSGAEIG GAFGGEKETG GGRESGSDAW
KGYMRRATNT INYGTTLPLA QGVSFDIG