Gene Avin_30620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_30620 
SymbollapC 
ID7761962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3169742 
End bp3171202 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content69% 
IMG OID643805938 
Product2-hydroxymuconic semialdehyde dehydrogenase; LapC 
Protein accessionYP_002800202 
Protein GI226945129 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR03216] 2-hydroxymuconic semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.141179 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAAGA TCGAGAACTT CATTGCCGGC GAGTACGTCG CCGCCGCCAG CGGCAGGCGT 
TTCGACAAGC GTTCGCCGCT CGACAACCGG GTGATCGCCT CGATCGCCGA GGCCGGGCGC
GCCGAGGTGG ATGCGGCGGT CGGGGCCGCG CGCGGCGCGC TCGCCGGCGA CTGGGGCAGG
CTGAGTACCG AACAGCGCGT CGAGCTGCTG TACGGCGTGG CCAACGAGAT CACCCGCCGC
TTCGATGATT TCGTCGAGGC GGAGATGGCC GACACCGGCC AGCCGGCGCA CGTGATGAAG
CAGGTGTTCA TCCCGCGCGG CGCGGCCAAC TTCAAGGTGT TCGCCGACGT GGTGAAGAAC
GTCGCCAGCG AATCCTTCCA GACGGCCACC CCGGACGGGC GCGGCGCGCT CAATTACGCG
CTGCGCGTGC CCAAGGGGGT GATCGGGGTG ATCTGCCCGT GGAACGCGCC CTTCATGCTG
ATGACCTGGA AGGTCGGCCC GGCGCTGGCC TGCGGCAACG CGGTGGTGGT CAAGCCGTCC
GAGGAGTCGC CGCAGACCGC CGCGCTGCTC GGCGAGGTGA TGAACGCGGT GGGCATTCCC
AAGGGCGTCT ACAACGTGGT GCAGGGGTTC GGCCCGGATT CGGCGGGCGA ATTCGTCACC
CAGCACCCGG GCGTCGACGC CATCACCTTC ACCGGCGAAA CGCGCACCGG CGCGGCGATC
ATGAAGGCCG CCTCGGAAGG CATGCGCGAC GTATCCTTCG AGCTGGGCGG CAAGAACGCC
GGCATCGTCT TCGCCGATTG CGACTTCGAG GCGGCGGTGG AGGGCATCTT CCGCTCCGCT
TTCCTCAATT CCGGGCAGGT GTGCCTGGGC ACCGAACGGG TGTACGTCGA GCGGCCGATC
TTCGAGCGCT TCGTGCAGGC GCTCAAGGTC AAAGCGGAAA GCGTCCGCTT CGGCCGCCCG
GACGATCACG ACGCCAATTA TGGTCCGCTG ATCAGCCAGG AGCACCGCCA GAAGGTGCTG
TCCTACTACC GCAAGGCGCT GGAAGAGGGC GCCACGCTGG TCACCGGCGG CGGCGTGCCG
GAGATGCCGG GCGAACTGGC CGAGGGCGCC TGGGTGCAGC CGACCATCTG GACCGGCCTG
CCGGAAAGCG CCGCGGTGGT GCGCGAGGAG ATCTTCGGAC CCTGCTGCCA CATCCGCCCG
TTCGACGCCG AGGACGAGGT GGTGCAACTC GCCAACGCCA CCGACTACGG CCTGTCCACC
ACGCTCTGGA CCAACGACCT GGCCCGCGCC CACCGCCTGG CGGCGCGCGT CGAGGTGGGC
ATCACCTGGA TCAACAGTTG GTTCCTGCGC GACCTGCGCA CCGCCTTCGG CGGCGCCAAG
CAGTCCGGCA TCGGCCGCGA GGGCGGCGTG CATTCGCTGG AGTTCTACAC CGAGACGCGC
AACGTCTGCG TGAAGCTCTG A
 
Protein sequence
MRKIENFIAG EYVAAASGRR FDKRSPLDNR VIASIAEAGR AEVDAAVGAA RGALAGDWGR 
LSTEQRVELL YGVANEITRR FDDFVEAEMA DTGQPAHVMK QVFIPRGAAN FKVFADVVKN
VASESFQTAT PDGRGALNYA LRVPKGVIGV ICPWNAPFML MTWKVGPALA CGNAVVVKPS
EESPQTAALL GEVMNAVGIP KGVYNVVQGF GPDSAGEFVT QHPGVDAITF TGETRTGAAI
MKAASEGMRD VSFELGGKNA GIVFADCDFE AAVEGIFRSA FLNSGQVCLG TERVYVERPI
FERFVQALKV KAESVRFGRP DDHDANYGPL ISQEHRQKVL SYYRKALEEG ATLVTGGGVP
EMPGELAEGA WVQPTIWTGL PESAAVVREE IFGPCCHIRP FDAEDEVVQL ANATDYGLST
TLWTNDLARA HRLAARVEVG ITWINSWFLR DLRTAFGGAK QSGIGREGGV HSLEFYTETR
NVCVKL