Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0727 |
Symbol | |
ID | 6974124 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 827083 |
End bp | 828528 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643390256 |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_002275132 |
Protein GI | 209542903 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.505527 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCAGT ATCCCGAGAC GCTGCTGTTC ATCGACGGGC AGTGGATCGC GGCCAGCGAC GGACGGTTCG TCGACGTGGT GAACCCCGCG ACGGAACAGG TCATCGGCCG CGTGGCCCAT GCCGGCCAGC CCGAACTGGC GGCGGCGGCC GCGGCGGCCG GGCGCGGCTT CGCGGTCTGG AGCCGGATGT CGGTGTTCGA CCGCTACAGG ATCATGCGGC AGGCGGCGAC CCTGCTGCGC GAGCGGGTCG AAACGGTGGC CCCGATCATG ACGATGGAAC AGGGCAAGCC CGTGGCCGAG GCCCGGACGG AACTGCTGGC GGCGGCGGAT ACGATCGACT GGCTGGCGGA AGAGGGGCGT CGCGCCTATG GCCGCCTGAT TCCCGCCCGC GCCGTGGGCG TGACCCAGGC GGTGATCCGC ACCCCGGTCG GTCCGGTCGC CGCGTTCTCG CCGTGGAATT TCCCGGTCAA CCAGGTGGTG CGCAAGGTGG GCGCGGCGCT GGCCACCGGG TGCTCGATCA TCGTCAAGGC CGCCGAGGAA ACGCCGGCCT CGCCGGCCGC CCTGGTGCGG GCCTTCGCCG ATGCCGGGGT ACCGGCGGGG GTGATCGGGC TGGTCTACGG CACGCCGTCC GAAATCTCGG AAACGCTGAT CGCGCATCCG GCCATCCGCA AGGTGACGTT CACCGGATCG ACCGCCGTGG GCAAGATGCT GGCGGCCCTG GCCGGATCGC ACATGAAGCG CGCGACGATG GAACTGGGCG GCCATGGTCC GGCGATCGTG TGCGCCGATG CCGATCTCGA CCGGGCCGCG ACCACGCTGG TGGCGGCGAA GTTCCGCAAT GCCGGACAGG TCTGCGTCTC GCCCACCCGC TTCCTGGTGG AACGCCCGGT CTTCGGCCGC TTCGTCGAAC GGTTCGCGGA ACTGGCCCGG AAGGTTCAGG TCGGCGACGG GCTGCAGTCC GGCACCACCA TGGGACCGCT GGCCAATGTC CGGCGCGTGG ACGCGATGGA GGCCCTGATC GGCGATGCCA CCGCCCGGGG GGCGGAGATC GTGACCGGCG GCCAGCGGGT GGGCAATGCG GGGTATTTCT TCGCGCCCAC GGTCCTGCGC GACCTGACCA CCGAGATGCG CATCATGAAC GAGGAGCCGT TCGGTCCCGT GGCGCTGATG TGCCCGTTCG ACACGCTGGA CGATGCGGTG GGCGAGGCGA ACCGCCTGCC TTACGGCCTG GCAGCCTATG CCTTTACCGG TTCGGGGCGC ACGGCGGCCC GCCTGAGGGA CGAGGTCCGG ACCGGCATGC TGACCGTCAA CCACCTGGGC CTGGGACTGC CCGAAGTCCC GTTCGGCGGA ATCGGCGATT CCGGGTACGG ATCGGAAGGC GGGACCGAAG CGCTGGATGC CTATCTGGAC ACCCGCTTCT TCTCGATGCG CGACTATCCG GCTTGA
|
Protein sequence | MSQYPETLLF IDGQWIAASD GRFVDVVNPA TEQVIGRVAH AGQPELAAAA AAAGRGFAVW SRMSVFDRYR IMRQAATLLR ERVETVAPIM TMEQGKPVAE ARTELLAAAD TIDWLAEEGR RAYGRLIPAR AVGVTQAVIR TPVGPVAAFS PWNFPVNQVV RKVGAALATG CSIIVKAAEE TPASPAALVR AFADAGVPAG VIGLVYGTPS EISETLIAHP AIRKVTFTGS TAVGKMLAAL AGSHMKRATM ELGGHGPAIV CADADLDRAA TTLVAAKFRN AGQVCVSPTR FLVERPVFGR FVERFAELAR KVQVGDGLQS GTTMGPLANV RRVDAMEALI GDATARGAEI VTGGQRVGNA GYFFAPTVLR DLTTEMRIMN EEPFGPVALM CPFDTLDDAV GEANRLPYGL AAYAFTGSGR TAARLRDEVR TGMLTVNHLG LGLPEVPFGG IGDSGYGSEG GTEALDAYLD TRFFSMRDYP A
|
| |