Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_51320 |
Symbol | galD |
ID | 7763972 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 5215262 |
End bp | 5216410 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643807952 |
Product | galactonate dehydratase |
Protein accession | YP_002802186 |
Protein GI | 226947113 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATCA CCCGACTGAC CACCTACATA GTCCCGCCGC GCTGGCTGTT TTTGAAAGTC GAGACCGACG AGGGCATCGT CGGCTGGGGC GAGCCGATCG TCGAAGGGCG TGCGCACAGC GTGGCGGCGG CCGTCGAGGA ACTGGCCGAC TACCTGATCG GCAAGGACCC GCGGCTGATC GAGGATCACT GGAACGTGCT CTACCGCGCC GGCTTCTACC GTGGCGGGCC GATCCACATG AGCGCCCTGG CCGGCATCGA CCAGGCCCTC TGGGACATCA AGGGCAAGGC CCTCGGCGTG CCGGTGCACG CGCTGCTCGG CGGCCAGTGC CGCGAGCGCA TCAAGGTCTA TTCGTGGATC GGCGGCGACC GCCCGGCCGA CGTCGCCCGC GCCGCCCGCG AAGTGGTCGG CCGCGGCTTC AGCGCGGTGA AGATGAACGG CACCGAGGAG CTGCAGTTCG TCGACTCCCA CGCCAAGATC GACGCGGCGG TGGCCAACGT GGCGGCGGTG CGCGAGGCCG TGGGACCGGA CATCGGCATC GGCGTGGATT TCCACGGCCG GGTGCACAAG CCGATGGCCA AGATCCTCGC CAGGGAGCTG GAGCCGTACC GGCTGATGTT CATCGAGGAG CCGGTGCTCA GCGAGAACTA CGAGGCGCTG CGCGACATCC GCGAGCACAC CTCGACGCCC ATCGCCCTCG GCGAGCGGCT GTTCTCGCGC TGGGACTTCA AGCGCGTGCT GGCCGACGGC TTCGTCGACA TCATCCAGCC CGACCCCTCG CACTCCGGCG GCATCACCGA GACGCGCAAG ATCGCCGCCA TGGCCGAGGC CTACGACGTC GCCCTGGCGC TGCACTGCCC GCTGGGGCCG ATCGCCCTGG CCGCCAACCT GCAACTGGAC GCGGTCTGCT ACAACGCCTT CATCCAGGAG CAGAGCCTGG GCATCCACTA CAACGAGAGC AACGACATCC TCGACTATCT GGCCCGGCCG GAAGTCTTCG CCTACCGCGA CGGCTTCGTC GACATCCCCC AGGGGCCGGG GCTCGGCATC GAGGTCAACG AGGACTACGT GCTCGAGCGC GCCGATGTCG GCCACCGCTG GCGCAACCCG GTATGGCGCC ACGCCGACGG CAGCGTCGCC GAGTGGTAG
|
Protein sequence | MKITRLTTYI VPPRWLFLKV ETDEGIVGWG EPIVEGRAHS VAAAVEELAD YLIGKDPRLI EDHWNVLYRA GFYRGGPIHM SALAGIDQAL WDIKGKALGV PVHALLGGQC RERIKVYSWI GGDRPADVAR AAREVVGRGF SAVKMNGTEE LQFVDSHAKI DAAVANVAAV REAVGPDIGI GVDFHGRVHK PMAKILAREL EPYRLMFIEE PVLSENYEAL RDIREHTSTP IALGERLFSR WDFKRVLADG FVDIIQPDPS HSGGITETRK IAAMAEAYDV ALALHCPLGP IALAANLQLD AVCYNAFIQE QSLGIHYNES NDILDYLARP EVFAYRDGFV DIPQGPGLGI EVNEDYVLER ADVGHRWRNP VWRHADGSVA EW
|
| |