Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_08730 |
Symbol | xylG |
ID | 7759823 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 827784 |
End bp | 829244 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643803787 |
Product | 2-hydroxymuconic semi-aldehyde dehydrogenase |
Protein accession | YP_002798089 |
Protein GI | 226943016 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR03216] 2-hydroxymuconic semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGAGA TCAAGCACTT CATCAACGGC GAATACGTCG GCTCTGCCAG CGGCAAGCTG TTCGACAACG TCAACCCGGC CAACGGCGAG GTGATCGCCA GGATCCACGA AGCCGGCGAG GCCGAGGTGG ACGCCGCGGT CAAGGCCGCC CGCGCCGCCC TGAAAGGCCC CTGGGGCAAG ATGAGCGTGG CCGAGCGCAC CGAGATCCTG CACCGCGTCG CCGCCGGCAT CACCGCGCGC TTCGACGAGT TCCTCGAAGC CGAGTGCCAG GACACCGGCA AGCCGAAATC GCTCGCCTCG CATATCGACA TCCCGCGCGG CGCGGCCAAC TTTTCGGTGT TCGCCGACCT GGTGAAGAAC GTCCCCACCG AGGCCTTCGA GATGGCCACC CCGGACGGCA GCGGCGCGCT CAACTACGGC GTGCGCCGGC CCAAGGGGGT GATCGGCGTG ATCAGCCCGT GGAACCTGCC GCTGCTGTTG ATGACCTGGA AGGTCGGCCC GGCACTGGCC TGCGGCAACA CCGTGGTGGT CAAGCCGTCC GAGGAAACCC CGAGCACCAC CGCGCTGCTC GGCGAGGTGA TGAACGCCGC CGGCGTGCCG GCCGGCGTCT ACAACGTGGT GCACGGCTTC GGCGGCAACT CGGCCGGCGC CTTCCTCACC GCCCACCCGG ACGTCGACGG CATCACCTTC ACCGGTGAAA CCGGCACCGG CGAAACCATC ATGCGTGCCG CCGCCAAGGG CGTGCGCCAG GTGTCCCTGG AGCTGGGCGG CAAGAACGCC GGCATCGTGT TCGCCGACGC CGACCTGGAC AAGGCTATCG AGGGCACCCT GCGTTCGGCC TTCGCCAACT GCGGCCAGGT CTGCCTGGGT ACCGAGCGGG TCTACGTGCA GCGGCCGATC TTCGACGCGT TCGTCGCCCG CCTGAAGGCC GGCGCCGAGG CGCTGGTAAT CGGCGAGCCG AACGATCCGA AGGCCAACTT CGGCCCGCTG GTCAGCCACA AGCACCGCGA GAAGGTGCTC AGCTACTACC AGAAGGCCAA GGACGAGGGC GCCACCATAG TCACCGGCGG CGGCGTGCCG GACATGCCTC AGCACCTGGC CGGCGGCGCC TGGGTGCAGC CGACCATCTG GACCGGCCTG AAGGACGATT CGCCGGTGGT CACCGAGGAA ATCTTCGGGC CCTGCTGCCA CATCCGCCCG TTCGATACCG AGGAAGAAGC CATCGAGCTG GCCAACAGCC TGCCCTATGG CCTGGCCTCG GCGATCTGGA CCGAGAACGC CTCGCGCGCC CACCGCGTCG CCGGGCGGAT CGAGGCCGGC ATCGTCTGGG TGAATAGCTG GTTCCTGCGC GACCTGCGCA CCGCCTTCGG CGGCGCCAAG CAGTCGGGTA TCGGCCGCGA GGGAGGGGTG CACTCGCTGG AGTTCTACAC CGAGCTGAAG AACATCTGCG TGAAGCTGTG A
|
Protein sequence | MKEIKHFING EYVGSASGKL FDNVNPANGE VIARIHEAGE AEVDAAVKAA RAALKGPWGK MSVAERTEIL HRVAAGITAR FDEFLEAECQ DTGKPKSLAS HIDIPRGAAN FSVFADLVKN VPTEAFEMAT PDGSGALNYG VRRPKGVIGV ISPWNLPLLL MTWKVGPALA CGNTVVVKPS EETPSTTALL GEVMNAAGVP AGVYNVVHGF GGNSAGAFLT AHPDVDGITF TGETGTGETI MRAAAKGVRQ VSLELGGKNA GIVFADADLD KAIEGTLRSA FANCGQVCLG TERVYVQRPI FDAFVARLKA GAEALVIGEP NDPKANFGPL VSHKHREKVL SYYQKAKDEG ATIVTGGGVP DMPQHLAGGA WVQPTIWTGL KDDSPVVTEE IFGPCCHIRP FDTEEEAIEL ANSLPYGLAS AIWTENASRA HRVAGRIEAG IVWVNSWFLR DLRTAFGGAK QSGIGREGGV HSLEFYTELK NICVKL
|
| |