Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_22040 |
Symbol | |
ID | 7761122 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 2202322 |
End bp | 2203926 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643805089 |
Product | NAD-dependent aldehyde dehydrogenase |
Protein accession | YP_002799370 |
Protein GI | 226944297 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAGTTT TTCCTAGTAT TCAGGACATT CCGGAGAAGT ACCGCCTGGG CGCGCCCATC GAACAGCGCG ACTACCTGGT CGACGGCGAA CTGCGCCGCT GGGACGGCCC GCTGGCCGCC GTGCGCAGCC CCATCCACCT GAAGACCGCC AAGGGCGACG AGCAGGTCGT CCTCGGCAGC ACCCCGCTGC TCGACGCCCA GGCTGCGCTG GGCGCCCTGG ACGCCGCGGT CAGGGCCTAC GACAACGGCC AGGGCCTGTG GCCGAGCATG CCGGTGGCCG GGCGCATCCA GCACGTCGAG ACCTTCCTGG CCCGCATGCG TGAACAGCGC GAGGCGGTGG TCAAACTGCT GATGTGGGAG ATCGGCAAGA ACCTCAAGGA CGCCGAGAAG GAATTCGACC GCACCTGCGA CTACATCGTC GACACCATCC ACGAACTCAA GGAACTCGAC CGCCGCTCCA GCCGCTTCGA GCTGGAGCAG GGCACCCTCG GCCAGATCCG CCGCGTGCCG CTGGGCGTGG CGCTGTGCAT GGGCCCCTAC AACTACCCGC TGAACGAGAC CTTCACCACC CTGATCCCGG CGCTGATCAT GGGCAACACC GTGGTGTTCA AGCCGGCCAA GTTCGGCGTG CTGCTGATCC GCCCGCTGCT CGAGGCGTTC CGCGACAGCT TCCCGGCCGG GGTGATCAAC GTCATCTACG GGCGCGGCCG CGAGACCGTC AGCGCGCTGA TGGAAAGCGG CAAGGTGGAC GTGTTCGCCT TCATCGGCAC CAACAAGGGC GCCAGCGACC TGAAGAAGCT GCACCCGCGC CCGCACCGCC TGCGCGCAAT CCTCGGCCTG GACGCCAAGA ACCCCGGCAT CGTCCTGCCC GAGGTGGACC TGGACAACGC GGTCGGCGAG GCGATCACCG GCGCCCTGTC GTTCAACGGC CAGCGCTGCA CGGCGCTGAA GATCCTCTTC GTCCACGAAC AAGTGGTCGA CGCCTTCCTG GAGAAGTTCA ACGCCAGGCT GGCCGCGCTC AAGTCGGGCA TGCCCTGGGA ACCAGGGGTG GCGCTGACCC CGTTGCCGGA GCCGGGCAAG ACCGATTTTC TCGCCACCCT GGTGGCCGAC GCCCTGGCCA AGGGCGCGAA GGTGGTCAAC CCCGGCGGTG GCGAGGTGCG CGAGACCTTC TTCTACCCGG CGCTGCTCTA CCCGGTGAGT CCGCAGATGC GCGTCTACCA GGAGGAGCAG TTCGGCCCGC TGATCCCGGT GGTGCCTTAC CGCGACCTGC AGACGGTGAT CGACTACGTG CGCGAGTCGG ACTTCGGCCA GCAGCTGTCG ATCTTCGGCA ACGACCCGCA GCAGGTCGCC AGGCTGGTGG ATGCCTTCGC CAACCAGGTC GGGCGGATCA ACCTCAACAC CCAGTGCCAG CGCGGGCCGG ACAGCTTCCC GTTCAACGGC CGCAAGAACT CGGCGGAGGG GACTCTGTCG GTGTACGACG CGCTGCGGGC GTTCTCGATC CGCACGCTGG TGGCGACCAA GCTTCAGGAG GACAACAAGC AGTTGATCAG CGACATCATC CGCAACCGCG AGTCGAGCTT CCTGACCACC GATTATCTTT TTTGA
|
Protein sequence | MIVFPSIQDI PEKYRLGAPI EQRDYLVDGE LRRWDGPLAA VRSPIHLKTA KGDEQVVLGS TPLLDAQAAL GALDAAVRAY DNGQGLWPSM PVAGRIQHVE TFLARMREQR EAVVKLLMWE IGKNLKDAEK EFDRTCDYIV DTIHELKELD RRSSRFELEQ GTLGQIRRVP LGVALCMGPY NYPLNETFTT LIPALIMGNT VVFKPAKFGV LLIRPLLEAF RDSFPAGVIN VIYGRGRETV SALMESGKVD VFAFIGTNKG ASDLKKLHPR PHRLRAILGL DAKNPGIVLP EVDLDNAVGE AITGALSFNG QRCTALKILF VHEQVVDAFL EKFNARLAAL KSGMPWEPGV ALTPLPEPGK TDFLATLVAD ALAKGAKVVN PGGGEVRETF FYPALLYPVS PQMRVYQEEQ FGPLIPVVPY RDLQTVIDYV RESDFGQQLS IFGNDPQQVA RLVDAFANQV GRINLNTQCQ RGPDSFPFNG RKNSAEGTLS VYDALRAFSI RTLVATKLQE DNKQLISDII RNRESSFLTT DYLF
|
| |