Gene Gdia_1366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1366 
Symbol 
ID6974774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1521590 
End bp1523125 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content66% 
IMG OID643390897 
ProductAldehyde dehydrogenase (NAD(+)) 
Protein accessionYP_002275762 
Protein GI209543533 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTCTC TCGAGAGCCC CCCGCTGGAA AGCCGCCGGA TCGAGCAGTC GCTACCGTTC 
AAGGCGCAAT ATGACAATTT CATCGGCGGC CGGTGGGTCC CGCCGCGCGA GGGGACGTAT
TTCGACGACC CTTCGCCGGT CGACGGGCGC ATCCTGTGCC GGGTCGCCCG GTCCGGGGCG
GCGGATGTCG AACAGGCGCT GGACGCGGCC CATGCGGCCC GGGAAGCCTG GGGCGAGACC
AGCCCCGCCC AGCGCGCCCT GCTGCTGAAC CGGATCGCGG ATCGGATCGA GGCCAACCTC
GACACCCTGG CGCGCGCCGA AAGCTGGGAC AACGGCAAGC CGCTGCGCGA GACGCTGAAC
GCGGACATTC CGCTGTGCGT CGACCATTTC CGCTATTTCG CCGCCTGCAT CCGCGCGCAG
GAAGGGGCAT TGTCGGAAAT CGACCACGAC ACCGTCGCCT ATCATTTCCA CGAACCGCTG
GGCGTGGTCG GCCAGATCAT TCCGTGGAAT TTCCCGCTGC TGATGGCAAG CTGGAAACTG
GCCCCGGCGC TTGCGGCGGG AAACTGCGTG GTCATGAAAC CGGCCGAGCA GACCCCGGCC
TCGATCATGG TGCTGGCCGA CCTGATCGCG GACATCCTGC CGCCCGGCGT GCTGAACATC
GTCAACGGTT TCGGCAGGGA AGCCGGCACG GCGCTGGCCT CCAGCAACCG GATCGCCAAG
ATCGCCTTCA CCGGGTCCAC GCCGACCGGC AGGACGATCG CCCACGCGGC GGCCGACAAC
CTGATCCCCG CGACGCTGGA ACTGGGCGGC AAGTCGCCCA ACATCTTCTT TTCCGACGTC
GCGGCCGAGG ACGACGATTT CCTGGACAAG GCGATCGAGG GCTTCGTCCT GTTCGCCTTC
AACCAGGGCG AGGTCTGCAC CTGCCCGTCA CGGGCGCTGA TCCATGAATC GGTCTATGAC
CGGTTCATGG AACGCGCGCT GCGCCGGGTG GCGGCGATCA AACAGGGCAA TCCGCTGGAC
ATGGCGACGA TGGTCGGCGC GCAGGCCTCG ACCGAGCAGG TCAGCAAGAT CCTGAACTAT
ATCGACATCG GACGGCAGGA AGGGGCGGAA CTGCTGATCG GCGGCGCGCG GGCCGAACTG
GGCGGCAGCC TGTCGAACGG GTGCTATATC CAGCCCACCG TGTTCAGGGG CGACAACCGG
ATGCGGATCT TCCAGGAGGA AATCTTCGGC CCCGTCGTCT CGGTCACCAC CTTCCGCGAC
GACGCCGAGG CCGTGGCGCT GGCCAATGAC ACGCTGTACG GCCTGGGGGC CGGGGTGTGG
ACCCGCGACA TCACCCGGGC CTATCGCATG GGACGCGCCA TCAAGGCCGG GCGCGTCTGG
ACCAACTGCT ATCACGCCTA TCCGGCGCAC GCGGCCTTCG GCGGATACAA GCAATCCGGA
ATCGGGCGGG AAAACCACCG CATGATGCTG GACCACTACC AGCAGACCAA GAACCTGCTG
GTCAGCTACA AGCCGCAGAA GCTGGGTTTC TTCTGA
 
Protein sequence
MDSLESPPLE SRRIEQSLPF KAQYDNFIGG RWVPPREGTY FDDPSPVDGR ILCRVARSGA 
ADVEQALDAA HAAREAWGET SPAQRALLLN RIADRIEANL DTLARAESWD NGKPLRETLN
ADIPLCVDHF RYFAACIRAQ EGALSEIDHD TVAYHFHEPL GVVGQIIPWN FPLLMASWKL
APALAAGNCV VMKPAEQTPA SIMVLADLIA DILPPGVLNI VNGFGREAGT ALASSNRIAK
IAFTGSTPTG RTIAHAAADN LIPATLELGG KSPNIFFSDV AAEDDDFLDK AIEGFVLFAF
NQGEVCTCPS RALIHESVYD RFMERALRRV AAIKQGNPLD MATMVGAQAS TEQVSKILNY
IDIGRQEGAE LLIGGARAEL GGSLSNGCYI QPTVFRGDNR MRIFQEEIFG PVVSVTTFRD
DAEAVALAND TLYGLGAGVW TRDITRAYRM GRAIKAGRVW TNCYHAYPAH AAFGGYKQSG
IGRENHRMML DHYQQTKNLL VSYKPQKLGF F