Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_30620 |
Symbol | lapC |
ID | 7761962 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 3169742 |
End bp | 3171202 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643805938 |
Product | 2-hydroxymuconic semialdehyde dehydrogenase; LapC |
Protein accession | YP_002800202 |
Protein GI | 226945129 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR03216] 2-hydroxymuconic semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.141179 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCAAGA TCGAGAACTT CATTGCCGGC GAGTACGTCG CCGCCGCCAG CGGCAGGCGT TTCGACAAGC GTTCGCCGCT CGACAACCGG GTGATCGCCT CGATCGCCGA GGCCGGGCGC GCCGAGGTGG ATGCGGCGGT CGGGGCCGCG CGCGGCGCGC TCGCCGGCGA CTGGGGCAGG CTGAGTACCG AACAGCGCGT CGAGCTGCTG TACGGCGTGG CCAACGAGAT CACCCGCCGC TTCGATGATT TCGTCGAGGC GGAGATGGCC GACACCGGCC AGCCGGCGCA CGTGATGAAG CAGGTGTTCA TCCCGCGCGG CGCGGCCAAC TTCAAGGTGT TCGCCGACGT GGTGAAGAAC GTCGCCAGCG AATCCTTCCA GACGGCCACC CCGGACGGGC GCGGCGCGCT CAATTACGCG CTGCGCGTGC CCAAGGGGGT GATCGGGGTG ATCTGCCCGT GGAACGCGCC CTTCATGCTG ATGACCTGGA AGGTCGGCCC GGCGCTGGCC TGCGGCAACG CGGTGGTGGT CAAGCCGTCC GAGGAGTCGC CGCAGACCGC CGCGCTGCTC GGCGAGGTGA TGAACGCGGT GGGCATTCCC AAGGGCGTCT ACAACGTGGT GCAGGGGTTC GGCCCGGATT CGGCGGGCGA ATTCGTCACC CAGCACCCGG GCGTCGACGC CATCACCTTC ACCGGCGAAA CGCGCACCGG CGCGGCGATC ATGAAGGCCG CCTCGGAAGG CATGCGCGAC GTATCCTTCG AGCTGGGCGG CAAGAACGCC GGCATCGTCT TCGCCGATTG CGACTTCGAG GCGGCGGTGG AGGGCATCTT CCGCTCCGCT TTCCTCAATT CCGGGCAGGT GTGCCTGGGC ACCGAACGGG TGTACGTCGA GCGGCCGATC TTCGAGCGCT TCGTGCAGGC GCTCAAGGTC AAAGCGGAAA GCGTCCGCTT CGGCCGCCCG GACGATCACG ACGCCAATTA TGGTCCGCTG ATCAGCCAGG AGCACCGCCA GAAGGTGCTG TCCTACTACC GCAAGGCGCT GGAAGAGGGC GCCACGCTGG TCACCGGCGG CGGCGTGCCG GAGATGCCGG GCGAACTGGC CGAGGGCGCC TGGGTGCAGC CGACCATCTG GACCGGCCTG CCGGAAAGCG CCGCGGTGGT GCGCGAGGAG ATCTTCGGAC CCTGCTGCCA CATCCGCCCG TTCGACGCCG AGGACGAGGT GGTGCAACTC GCCAACGCCA CCGACTACGG CCTGTCCACC ACGCTCTGGA CCAACGACCT GGCCCGCGCC CACCGCCTGG CGGCGCGCGT CGAGGTGGGC ATCACCTGGA TCAACAGTTG GTTCCTGCGC GACCTGCGCA CCGCCTTCGG CGGCGCCAAG CAGTCCGGCA TCGGCCGCGA GGGCGGCGTG CATTCGCTGG AGTTCTACAC CGAGACGCGC AACGTCTGCG TGAAGCTCTG A
|
Protein sequence | MRKIENFIAG EYVAAASGRR FDKRSPLDNR VIASIAEAGR AEVDAAVGAA RGALAGDWGR LSTEQRVELL YGVANEITRR FDDFVEAEMA DTGQPAHVMK QVFIPRGAAN FKVFADVVKN VASESFQTAT PDGRGALNYA LRVPKGVIGV ICPWNAPFML MTWKVGPALA CGNAVVVKPS EESPQTAALL GEVMNAVGIP KGVYNVVQGF GPDSAGEFVT QHPGVDAITF TGETRTGAAI MKAASEGMRD VSFELGGKNA GIVFADCDFE AAVEGIFRSA FLNSGQVCLG TERVYVERPI FERFVQALKV KAESVRFGRP DDHDANYGPL ISQEHRQKVL SYYRKALEEG ATLVTGGGVP EMPGELAEGA WVQPTIWTGL PESAAVVREE IFGPCCHIRP FDAEDEVVQL ANATDYGLST TLWTNDLARA HRLAARVEVG ITWINSWFLR DLRTAFGGAK QSGIGREGGV HSLEFYTETR NVCVKL
|
| |