Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_38050 |
Symbol | |
ID | 7762696 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3850165 |
End bp | 3851370 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643806669 |
Product | Aldo/keto reductase |
Protein accession | YP_002800922 |
Protein GI | 226945849 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACGTTG AAGATGAAAA AGATAGGCCT GGCTCCGATC TGACGCTGCC CGGCCGGAGA AAGGTGTTGG CTACGGGCGC GGTATTGGCG GCCTTGCCGC TGTTGGCGAG TTTGCCATCG ATGGGTTTCG CACAGCAGAC GACGGCGGGC ACCGGGGGCC GTGCAAAGGC CGATATCCAT AGCGCCCGCC GCAGGCCCGG TGCGCTGGAG GTCTCCGCCC TGGGCCTGGG CTGCATGAGC ATGAACGGCG GCCAGTACAA CCCGCCCGGG GACAAGCGCG AGATGATCCG CGTCATCCAC GCCGCCATCG ATCGCGGGGT GGACTTCTTC GATACCGCCG AAGTCTACGG CCCGTTCATC AACGAGGAAC TGCTTGGCGA GGCGCTCGCC CCGTATCGGG ACAAAGCGGT CATCGCGACC AAGTTCGGCT TCGGCATCGA CCCGGCGAGT GCATTGCGCA TCGGCGGCCT CGACAGTCGA CCGGAGCATA TCCGGGCGGT CGCGGAGACG TCGCTCAGGC GCCTGCGGAC CGACCGCATC GACCTGTTCT ACCAGCACCG CGTCGACCCG GCCGTGCCGA TCGAGGACGT GGCCGGCACG GTGAAAGACC TGATCGCCGA AGGCAAGGTC AAGCACTTCG GTCTCTCCGA GCCTGGCCTG CAGACCGTGC GCCGGGCGCA CGCGGTACAG CCCGTGGCAG CGATTCAGAA CGAGTACTCA CTGCTGTGGC GGGGACCGGA ACTGGGCTTG CTGGAGTCGT GCGAGGAGCT CGGTATCGGC CTGGTGCCCT GGAGTCCGTT AGGCGCCGGC CTGCTCACCG GCACGCTCGA CGCCGATACC CGCTTCGACG CCCCCGGATA CACGGACTAC CGCCGCACCA ACCCGCGCTT CGCCCCCGAA GCGCTCACGG GCAACATGGC ATTGGTCGAG CTGGCCCGCG AATGGGCGCA ACGCAAGGAA GCCACGCCGT CGCAGATCGC GCTGGCCTGG CTGCTGGCTC AGCGACCGTG GATCGTGCCC ATTCCCGGCA CCACCAACAT CCAGCACCTG GACGAGAACC TCGGCGCGAT CAACCTCCAG TTCAGCGCAG CGGAGATGCA GGCGTTCAAC ACCGCGTTGG CGCAGATCGT GGTGCATGGC GAAAGAGGAA CCCCGAGGCT GCTGGAGATG GTCGGGCGGG ATACGCCCCT GCCAAAAGGG CGGTGA
|
Protein sequence | MYVEDEKDRP GSDLTLPGRR KVLATGAVLA ALPLLASLPS MGFAQQTTAG TGGRAKADIH SARRRPGALE VSALGLGCMS MNGGQYNPPG DKREMIRVIH AAIDRGVDFF DTAEVYGPFI NEELLGEALA PYRDKAVIAT KFGFGIDPAS ALRIGGLDSR PEHIRAVAET SLRRLRTDRI DLFYQHRVDP AVPIEDVAGT VKDLIAEGKV KHFGLSEPGL QTVRRAHAVQ PVAAIQNEYS LLWRGPELGL LESCEELGIG LVPWSPLGAG LLTGTLDADT RFDAPGYTDY RRTNPRFAPE ALTGNMALVE LAREWAQRKE ATPSQIALAW LLAQRPWIVP IPGTTNIQHL DENLGAINLQ FSAAEMQAFN TALAQIVVHG ERGTPRLLEM VGRDTPLPKG R
|
| |