Gene Avin_08730 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_08730 
SymbolxylG 
ID7759823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp827784 
End bp829244 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content68% 
IMG OID643803787 
Product2-hydroxymuconic semi-aldehyde dehydrogenase 
Protein accessionYP_002798089 
Protein GI226943016 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR03216] 2-hydroxymuconic semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAGA TCAAGCACTT CATCAACGGC GAATACGTCG GCTCTGCCAG CGGCAAGCTG 
TTCGACAACG TCAACCCGGC CAACGGCGAG GTGATCGCCA GGATCCACGA AGCCGGCGAG
GCCGAGGTGG ACGCCGCGGT CAAGGCCGCC CGCGCCGCCC TGAAAGGCCC CTGGGGCAAG
ATGAGCGTGG CCGAGCGCAC CGAGATCCTG CACCGCGTCG CCGCCGGCAT CACCGCGCGC
TTCGACGAGT TCCTCGAAGC CGAGTGCCAG GACACCGGCA AGCCGAAATC GCTCGCCTCG
CATATCGACA TCCCGCGCGG CGCGGCCAAC TTTTCGGTGT TCGCCGACCT GGTGAAGAAC
GTCCCCACCG AGGCCTTCGA GATGGCCACC CCGGACGGCA GCGGCGCGCT CAACTACGGC
GTGCGCCGGC CCAAGGGGGT GATCGGCGTG ATCAGCCCGT GGAACCTGCC GCTGCTGTTG
ATGACCTGGA AGGTCGGCCC GGCACTGGCC TGCGGCAACA CCGTGGTGGT CAAGCCGTCC
GAGGAAACCC CGAGCACCAC CGCGCTGCTC GGCGAGGTGA TGAACGCCGC CGGCGTGCCG
GCCGGCGTCT ACAACGTGGT GCACGGCTTC GGCGGCAACT CGGCCGGCGC CTTCCTCACC
GCCCACCCGG ACGTCGACGG CATCACCTTC ACCGGTGAAA CCGGCACCGG CGAAACCATC
ATGCGTGCCG CCGCCAAGGG CGTGCGCCAG GTGTCCCTGG AGCTGGGCGG CAAGAACGCC
GGCATCGTGT TCGCCGACGC CGACCTGGAC AAGGCTATCG AGGGCACCCT GCGTTCGGCC
TTCGCCAACT GCGGCCAGGT CTGCCTGGGT ACCGAGCGGG TCTACGTGCA GCGGCCGATC
TTCGACGCGT TCGTCGCCCG CCTGAAGGCC GGCGCCGAGG CGCTGGTAAT CGGCGAGCCG
AACGATCCGA AGGCCAACTT CGGCCCGCTG GTCAGCCACA AGCACCGCGA GAAGGTGCTC
AGCTACTACC AGAAGGCCAA GGACGAGGGC GCCACCATAG TCACCGGCGG CGGCGTGCCG
GACATGCCTC AGCACCTGGC CGGCGGCGCC TGGGTGCAGC CGACCATCTG GACCGGCCTG
AAGGACGATT CGCCGGTGGT CACCGAGGAA ATCTTCGGGC CCTGCTGCCA CATCCGCCCG
TTCGATACCG AGGAAGAAGC CATCGAGCTG GCCAACAGCC TGCCCTATGG CCTGGCCTCG
GCGATCTGGA CCGAGAACGC CTCGCGCGCC CACCGCGTCG CCGGGCGGAT CGAGGCCGGC
ATCGTCTGGG TGAATAGCTG GTTCCTGCGC GACCTGCGCA CCGCCTTCGG CGGCGCCAAG
CAGTCGGGTA TCGGCCGCGA GGGAGGGGTG CACTCGCTGG AGTTCTACAC CGAGCTGAAG
AACATCTGCG TGAAGCTGTG A
 
Protein sequence
MKEIKHFING EYVGSASGKL FDNVNPANGE VIARIHEAGE AEVDAAVKAA RAALKGPWGK 
MSVAERTEIL HRVAAGITAR FDEFLEAECQ DTGKPKSLAS HIDIPRGAAN FSVFADLVKN
VPTEAFEMAT PDGSGALNYG VRRPKGVIGV ISPWNLPLLL MTWKVGPALA CGNTVVVKPS
EETPSTTALL GEVMNAAGVP AGVYNVVHGF GGNSAGAFLT AHPDVDGITF TGETGTGETI
MRAAAKGVRQ VSLELGGKNA GIVFADADLD KAIEGTLRSA FANCGQVCLG TERVYVQRPI
FDAFVARLKA GAEALVIGEP NDPKANFGPL VSHKHREKVL SYYQKAKDEG ATIVTGGGVP
DMPQHLAGGA WVQPTIWTGL KDDSPVVTEE IFGPCCHIRP FDTEEEAIEL ANSLPYGLAS
AIWTENASRA HRVAGRIEAG IVWVNSWFLR DLRTAFGGAK QSGIGREGGV HSLEFYTELK
NICVKL