Gene Gdia_1997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1997 
Symbol 
ID6975423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2218395 
End bp2219837 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content67% 
IMG OID643391526 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002276372 
Protein GI209544143 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAGAAAA CCTATGATCT TTTCATCGAC GGCCGTTGGG TGCCGGCGGC CAAGGGCGAG 
CGCCTGGCCG TGGAAAACCC GGCGACGGGT GACGTCCTCG CCGAGGTCGC CAACGGCACG
TCCGAGGATG TCGACCGTGC CGTGGCGGCC GCGAAGCAGG CGATGCCGGG ATGGAGCCGC
AGGACCGCGA CCGAGCGGGC GGATGACCTG TATCGCCTCA TCGGCCTGAT CAAACGGGAT
GCCGAGCATC TGGCACGCAC GATCACGCGG GAAATGGGCA AGCCGATCCG CGAGGCGCGC
GTCGAAGTCG CGTTCGCCAC GGACCTGCTG CGCTTCGCGG CCGAAAATAC CCGCCGCCTG
GAAGGCGAGA TCCTTCCGGG CTCCCGCTCC GGCGAGAAGA TCCTGATCGA CCGCAAGCCG
GTCGGTGTCG TCGGCGCCAT CGCCGCCTGG AATTTCCCGC TGGCGCTGGT CGCGCGCAAG
CTGGGCCCGG CGCTGGCGGC GGGCAATGCG ATCGTCATCA AGCCGCATGA AATGACGCCG
CTGGCCGCGC TGGAACTGGC CAGGCTGGTG GCCGAGGCCG ACATCCCGGC GGGCGTGGTC
AATATCGTCA CCGGCGACGG TCCGCGCGTC GGCGTTCCGC TGGTGGCGCA TCCGGACACG
CGGCTGATCA CCATGACCGG CAGCACGTCC GCCGGGAAGA AGATCATGGC CGCCGCGGCC
GAGCACCTGA AGATCGTGCG CCTGGAACTG GGCGGCAAGG CGCCGTTCAT CGTGGCGGAC
GACGCGGACA TCGACCGCGC CGTGGAAGCC GCCGTGGTGT CGCGCTTCGG CAATGCCGGC
CAGGTCTGCA CGGCGAACGA GCGCACCTAT GTCGATGCGA AAATCTATGA CATCTTCGCG
GCCCGGCTGC GTGCCCGCAT CGAAAAGCTG AAGGTCGGCG ACCCGCTGGA CGAGGCGACG
GACATGGGGC CGAAGGTCTG CGGCCCGGAA CTCGAAAAGG TCGACCAGAT GGTCCGGCGC
GCGGTCGAAC AGGGCGCGAA GCTGGAACGG GGCGGCGCGC GGCTGACGGG CGGCCTCTAT
GACAGGGGGC AGTTCTATGC GCCCACGCTG CTGACCGGTG TCACCGGGAC GATGGACATC
GCCCGGAACG AGGTCTTCGG GCCGGTCCTC TCGCTGATCC GGGTGGACAG CTACGAGGAC
GCGATCCGCC AGGCCAATGC CTCGCGCTAT GGCCTGTCGG CCTACGTGTT CACCAACAGC
CTGGACCGGA TCATGAAGAT CAACGCCGAA CTGGAATTCG GCGAGGTCTA TGTGAACCGC
GAGAGCGGCG AGTCCGCGCA CGGCTTCCAT CACGGCTATC GCGACAGCGG CATCGGCGGT
GAAGACGGCC AGCACGGCCT GGAAGCCTAT GTCGAGACGC AGACCATCTA TCTGAACGCC
TGA
 
Protein sequence
MQKTYDLFID GRWVPAAKGE RLAVENPATG DVLAEVANGT SEDVDRAVAA AKQAMPGWSR 
RTATERADDL YRLIGLIKRD AEHLARTITR EMGKPIREAR VEVAFATDLL RFAAENTRRL
EGEILPGSRS GEKILIDRKP VGVVGAIAAW NFPLALVARK LGPALAAGNA IVIKPHEMTP
LAALELARLV AEADIPAGVV NIVTGDGPRV GVPLVAHPDT RLITMTGSTS AGKKIMAAAA
EHLKIVRLEL GGKAPFIVAD DADIDRAVEA AVVSRFGNAG QVCTANERTY VDAKIYDIFA
ARLRARIEKL KVGDPLDEAT DMGPKVCGPE LEKVDQMVRR AVEQGAKLER GGARLTGGLY
DRGQFYAPTL LTGVTGTMDI ARNEVFGPVL SLIRVDSYED AIRQANASRY GLSAYVFTNS
LDRIMKINAE LEFGEVYVNR ESGESAHGFH HGYRDSGIGG EDGQHGLEAY VETQTIYLNA