Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_32550 |
Symbol | |
ID | 7762154 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3340794 |
End bp | 3342041 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643806125 |
Product | Aldo/keto reductase protein |
Protein accession | YP_002800389 |
Protein GI | 226945316 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCGGA ACGACAAACA CGACCCCCAC GGCCACGGGC CGGCCGACCC GCAACGCCGC CAGTTGCTGG CGACTGCGGC TATCCTCGGC GTCGCGCCGT GGCTGCTGTC GGCCTGCACC GCGCCGTCGG CTTCCACGGC CAATACCGCC GCGCCGGCAC AACCCGCCGG CACCGGCAGC GCCACCACCA CCGGCATCTT GCCGACCAGC CAGCGCCGCA AGCTGGGCGC GCTGGAGGTT TCCTCCATCG GCCTGGGGTG CCAGTGGGTG CCCGCCGCGG TCGAGGGGTC GGTCTCCGAC CGTTATGGCA GCACCATCGA CCGGAAAACC GCGATCAACC TGATTCGCAC GGCCGTCGAT TCCGGCGTGA CCCTGTTCGA CACCGCCGAA GCCTATGGCC CGTACCTGTC CGAGGAAGTC GTCGGCGAAG CCTTGCAGGG CGTGCGCGAT CAGGTGGTCA TCGAGACCAA GTACGGCTTC AGCTTCGACC CGAAGGTCGC GGCGGCGCGC GGCGGCCGCG ACAGCCGGCC CGAACACATC AAGCAGGTGG TCGAGGGCAT GCTCAAGCGC CTGCGCACCG ACCGCATCGA CCTGCTGTAC CAGCACCGCG TCGATCCGCA GGTGCCGATC GAGGACGTGG CCGGGGCGAT CAAGGACCTG ATCGCCGAGG GCAAGGTGCT GAACTACGGT CTTTCCGAGC CGGGCATCCA GACCATCCGC CGTGCCCATG CCGAACATCC GCTGGCGGCG ATCCAGAACG AATACTCCAT GCTCTGGCGC GGGCCGGAAG CGGAGGTACT GCCGGTGTGC GAGGAACTGG GCATCGGCTT TGTGCCGTGG AGCCCGATGG GCATGGGTTT CCTCAGCGGC ACGATCACGG CCGAGACGCG CTTCGTCCCC GATGGCGACC GTGAATTCCG TGTCGCCGTG CCGCGCTTCG CCCCCGACAA CCTGCGCGCG AACATGGCGC TGGTGGAGGT GGTCAAGACC TGGGCGCAGC GCAAGAACGC GACGCCGGCC CAGCTCGCGC TGGCCTGGCT GCTGGCGCAG AAGCCGTGGA TCGTGCCGAT TCCGGGCACG ACCAAGATCG CCCACCTGAA GGAGAACCTC GGCGCCGCCG CGATCACCTT CAGCGGCGAG GAACTGCGCG AACTCAATGC CACCGTGGCC GCCGTCCCGA TCCAGGGCGA CCGACTGCCT CCGGGGGTCA TGCAGTTGTC CGGCGTGGAA GCGCCGCCGA AGCGCTGA
|
Protein sequence | MTRNDKHDPH GHGPADPQRR QLLATAAILG VAPWLLSACT APSASTANTA APAQPAGTGS ATTTGILPTS QRRKLGALEV SSIGLGCQWV PAAVEGSVSD RYGSTIDRKT AINLIRTAVD SGVTLFDTAE AYGPYLSEEV VGEALQGVRD QVVIETKYGF SFDPKVAAAR GGRDSRPEHI KQVVEGMLKR LRTDRIDLLY QHRVDPQVPI EDVAGAIKDL IAEGKVLNYG LSEPGIQTIR RAHAEHPLAA IQNEYSMLWR GPEAEVLPVC EELGIGFVPW SPMGMGFLSG TITAETRFVP DGDREFRVAV PRFAPDNLRA NMALVEVVKT WAQRKNATPA QLALAWLLAQ KPWIVPIPGT TKIAHLKENL GAAAITFSGE ELRELNATVA AVPIQGDRLP PGVMQLSGVE APPKR
|
| |