Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_49290 |
Symbol | |
ID | 7763784 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 4986610 |
End bp | 4988190 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643807765 |
Product | Aldehyde dehydrogenase |
Protein accession | YP_002802000 |
Protein GI | 226946927 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.213935 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCGGA TCATCGGTCA CAACTACATC GCGGGAACCC GCAGCAACGC GGGCGACATC CGCGTCCACA GCGTCGACGC CAGCACCGGC GAAAAGTTGT CCCATGACTT CTACCAGGCC ACCCCCGCCG AGGTGAACGC CGCCGCCGAG GCCGCCGCCA CCGCCTACCC GACCTACCGC GCCCTGCCCG CCGCCCGCCG CGCCGACTTC CTCGACGCCA TCGCCGCCGA ACTCGACGCC CTCGGCGACG ACTTCGTCGC CCTGGTCGGC CGCGAGACCG CCCTGCCCGC CGCGCGCATC CAGGGCGAGC GCGGCCGCAC CAGCAACCAG ATGCGCCTGT TCGCCCAGCT CCTGCGCCGC GGCGACTTCC ACGGCGCGCG CATCGACCGC GCCCTGCCCG AGCGTCAGCC GCTGCCGCGC GTCGACCTGC GCCAGTGCCG CATCGGCCTC GGCCCGGTCG CCGTGTTCGG CGCCTCCAAC TTCCCGCTGG CCTTTTCCAC CGCCGGCGGC GACACCGCCG CCGCCCTCGC CGCCGGCTGC CCGGTGGTGT TCAAGGCGCA CAGCGGCCAC ATGGCCACCG CCGAGTGCGT GGCCGACGCC ATCGTCCGCG CCGCCGAAAA GACCGGCATG CCCGCCGGGG TGTTCAACAT GATCTACGGC AACGGCGTCG GCGAGGCGCT GGTCAAGCAC CCGGCGATCC AGGCGGTCGG TTTCACCGGC TCGCTCAAGG GCGGCCGCGC CCTCTGCGAC ATGGCCGCCG CCCGCGCGCA GCCGATCCCG GTGTTCGCCG AGATGTCGAG CATCAACCCG GTGCTGCTGC TGCCCGAGGC GCTCAAGCGA CGCGGCGAGC AGATCGCCCG GGAACTGAGC GCGTCGGTGA CGATGGGTTG CGGGCAGTTC TGCACCAACC CCGGGCTGAT CCTCGGCCTG CGCTCGGCGC AGTTCTCCGC CTTCCTGGAG GTCTTCGTCG CCGCCATGGC CGAACAGGGC CCGCAGACCA TGCTCAACGC CGGCACCCTG AAGAGCTACG AGCAGGGCAT CGCCGCCCTG CATGCCCACT CCGGGGTCAG GCACCTGGCC GGCCGGAAAC AGGAAGGCGG GCAGGCCCGG CCGCAACTGT TCCAGGCCGA CGTCGCCCTG CTGCTGGACG GCGACGAACT GCTGCAGGAA GAGGTCTTCG GCCCGACCAC CGTGGTCGTC GAGGTCGCCG ACCAGGCCGA GCTGGTCCGC GCCCTGCAGG CCCTGCACGG CCAGCTCACC GCCACCCTGA TCGCCGAGCC GGCGGACCTG TCCGCCTTCG CGTCCCTGGT GCCGGTGCTC GAGCAGAAGG CCGGGCGCCT GTTGGTCAAC GGCTACCCGA CCGGGGTGGA GGTGTGCGAT GCGATGGTCC ACGGCGGCCC TTACCCGGCT ACCTCGGACG CCCGCGGCAG CTCGGTGGGC ACGCTGGCGA TCGAGCGCTT CCTGCGCCCG CTGTGCTACC AGAACTACCC GGACGAACTG CTGCCGGACG CGCTGAAGAA CGCCAACCCG CTGGGCATCC TGCGCCTGGT CGACGGCCAG CCGAGCCGCG AGGCGCTGTA A
|
Protein sequence | MPRIIGHNYI AGTRSNAGDI RVHSVDASTG EKLSHDFYQA TPAEVNAAAE AAATAYPTYR ALPAARRADF LDAIAAELDA LGDDFVALVG RETALPAARI QGERGRTSNQ MRLFAQLLRR GDFHGARIDR ALPERQPLPR VDLRQCRIGL GPVAVFGASN FPLAFSTAGG DTAAALAAGC PVVFKAHSGH MATAECVADA IVRAAEKTGM PAGVFNMIYG NGVGEALVKH PAIQAVGFTG SLKGGRALCD MAAARAQPIP VFAEMSSINP VLLLPEALKR RGEQIARELS ASVTMGCGQF CTNPGLILGL RSAQFSAFLE VFVAAMAEQG PQTMLNAGTL KSYEQGIAAL HAHSGVRHLA GRKQEGGQAR PQLFQADVAL LLDGDELLQE EVFGPTTVVV EVADQAELVR ALQALHGQLT ATLIAEPADL SAFASLVPVL EQKAGRLLVN GYPTGVEVCD AMVHGGPYPA TSDARGSSVG TLAIERFLRP LCYQNYPDEL LPDALKNANP LGILRLVDGQ PSREAL
|
| |