Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_1997 |
Symbol | |
ID | 6975423 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 2218395 |
End bp | 2219837 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643391526 |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_002276372 |
Protein GI | 209544143 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCAGAAAA CCTATGATCT TTTCATCGAC GGCCGTTGGG TGCCGGCGGC CAAGGGCGAG CGCCTGGCCG TGGAAAACCC GGCGACGGGT GACGTCCTCG CCGAGGTCGC CAACGGCACG TCCGAGGATG TCGACCGTGC CGTGGCGGCC GCGAAGCAGG CGATGCCGGG ATGGAGCCGC AGGACCGCGA CCGAGCGGGC GGATGACCTG TATCGCCTCA TCGGCCTGAT CAAACGGGAT GCCGAGCATC TGGCACGCAC GATCACGCGG GAAATGGGCA AGCCGATCCG CGAGGCGCGC GTCGAAGTCG CGTTCGCCAC GGACCTGCTG CGCTTCGCGG CCGAAAATAC CCGCCGCCTG GAAGGCGAGA TCCTTCCGGG CTCCCGCTCC GGCGAGAAGA TCCTGATCGA CCGCAAGCCG GTCGGTGTCG TCGGCGCCAT CGCCGCCTGG AATTTCCCGC TGGCGCTGGT CGCGCGCAAG CTGGGCCCGG CGCTGGCGGC GGGCAATGCG ATCGTCATCA AGCCGCATGA AATGACGCCG CTGGCCGCGC TGGAACTGGC CAGGCTGGTG GCCGAGGCCG ACATCCCGGC GGGCGTGGTC AATATCGTCA CCGGCGACGG TCCGCGCGTC GGCGTTCCGC TGGTGGCGCA TCCGGACACG CGGCTGATCA CCATGACCGG CAGCACGTCC GCCGGGAAGA AGATCATGGC CGCCGCGGCC GAGCACCTGA AGATCGTGCG CCTGGAACTG GGCGGCAAGG CGCCGTTCAT CGTGGCGGAC GACGCGGACA TCGACCGCGC CGTGGAAGCC GCCGTGGTGT CGCGCTTCGG CAATGCCGGC CAGGTCTGCA CGGCGAACGA GCGCACCTAT GTCGATGCGA AAATCTATGA CATCTTCGCG GCCCGGCTGC GTGCCCGCAT CGAAAAGCTG AAGGTCGGCG ACCCGCTGGA CGAGGCGACG GACATGGGGC CGAAGGTCTG CGGCCCGGAA CTCGAAAAGG TCGACCAGAT GGTCCGGCGC GCGGTCGAAC AGGGCGCGAA GCTGGAACGG GGCGGCGCGC GGCTGACGGG CGGCCTCTAT GACAGGGGGC AGTTCTATGC GCCCACGCTG CTGACCGGTG TCACCGGGAC GATGGACATC GCCCGGAACG AGGTCTTCGG GCCGGTCCTC TCGCTGATCC GGGTGGACAG CTACGAGGAC GCGATCCGCC AGGCCAATGC CTCGCGCTAT GGCCTGTCGG CCTACGTGTT CACCAACAGC CTGGACCGGA TCATGAAGAT CAACGCCGAA CTGGAATTCG GCGAGGTCTA TGTGAACCGC GAGAGCGGCG AGTCCGCGCA CGGCTTCCAT CACGGCTATC GCGACAGCGG CATCGGCGGT GAAGACGGCC AGCACGGCCT GGAAGCCTAT GTCGAGACGC AGACCATCTA TCTGAACGCC TGA
|
Protein sequence | MQKTYDLFID GRWVPAAKGE RLAVENPATG DVLAEVANGT SEDVDRAVAA AKQAMPGWSR RTATERADDL YRLIGLIKRD AEHLARTITR EMGKPIREAR VEVAFATDLL RFAAENTRRL EGEILPGSRS GEKILIDRKP VGVVGAIAAW NFPLALVARK LGPALAAGNA IVIKPHEMTP LAALELARLV AEADIPAGVV NIVTGDGPRV GVPLVAHPDT RLITMTGSTS AGKKIMAAAA EHLKIVRLEL GGKAPFIVAD DADIDRAVEA AVVSRFGNAG QVCTANERTY VDAKIYDIFA ARLRARIEKL KVGDPLDEAT DMGPKVCGPE LEKVDQMVRR AVEQGAKLER GGARLTGGLY DRGQFYAPTL LTGVTGTMDI ARNEVFGPVL SLIRVDSYED AIRQANASRY GLSAYVFTNS LDRIMKINAE LEFGEVYVNR ESGESAHGFH HGYRDSGIGG EDGQHGLEAY VETQTIYLNA
|
| |