Gene Avin_32550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_32550 
Symbol 
ID7762154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3340794 
End bp3342041 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content69% 
IMG OID643806125 
ProductAldo/keto reductase protein 
Protein accessionYP_002800389 
Protein GI226945316 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCGGA ACGACAAACA CGACCCCCAC GGCCACGGGC CGGCCGACCC GCAACGCCGC 
CAGTTGCTGG CGACTGCGGC TATCCTCGGC GTCGCGCCGT GGCTGCTGTC GGCCTGCACC
GCGCCGTCGG CTTCCACGGC CAATACCGCC GCGCCGGCAC AACCCGCCGG CACCGGCAGC
GCCACCACCA CCGGCATCTT GCCGACCAGC CAGCGCCGCA AGCTGGGCGC GCTGGAGGTT
TCCTCCATCG GCCTGGGGTG CCAGTGGGTG CCCGCCGCGG TCGAGGGGTC GGTCTCCGAC
CGTTATGGCA GCACCATCGA CCGGAAAACC GCGATCAACC TGATTCGCAC GGCCGTCGAT
TCCGGCGTGA CCCTGTTCGA CACCGCCGAA GCCTATGGCC CGTACCTGTC CGAGGAAGTC
GTCGGCGAAG CCTTGCAGGG CGTGCGCGAT CAGGTGGTCA TCGAGACCAA GTACGGCTTC
AGCTTCGACC CGAAGGTCGC GGCGGCGCGC GGCGGCCGCG ACAGCCGGCC CGAACACATC
AAGCAGGTGG TCGAGGGCAT GCTCAAGCGC CTGCGCACCG ACCGCATCGA CCTGCTGTAC
CAGCACCGCG TCGATCCGCA GGTGCCGATC GAGGACGTGG CCGGGGCGAT CAAGGACCTG
ATCGCCGAGG GCAAGGTGCT GAACTACGGT CTTTCCGAGC CGGGCATCCA GACCATCCGC
CGTGCCCATG CCGAACATCC GCTGGCGGCG ATCCAGAACG AATACTCCAT GCTCTGGCGC
GGGCCGGAAG CGGAGGTACT GCCGGTGTGC GAGGAACTGG GCATCGGCTT TGTGCCGTGG
AGCCCGATGG GCATGGGTTT CCTCAGCGGC ACGATCACGG CCGAGACGCG CTTCGTCCCC
GATGGCGACC GTGAATTCCG TGTCGCCGTG CCGCGCTTCG CCCCCGACAA CCTGCGCGCG
AACATGGCGC TGGTGGAGGT GGTCAAGACC TGGGCGCAGC GCAAGAACGC GACGCCGGCC
CAGCTCGCGC TGGCCTGGCT GCTGGCGCAG AAGCCGTGGA TCGTGCCGAT TCCGGGCACG
ACCAAGATCG CCCACCTGAA GGAGAACCTC GGCGCCGCCG CGATCACCTT CAGCGGCGAG
GAACTGCGCG AACTCAATGC CACCGTGGCC GCCGTCCCGA TCCAGGGCGA CCGACTGCCT
CCGGGGGTCA TGCAGTTGTC CGGCGTGGAA GCGCCGCCGA AGCGCTGA
 
Protein sequence
MTRNDKHDPH GHGPADPQRR QLLATAAILG VAPWLLSACT APSASTANTA APAQPAGTGS 
ATTTGILPTS QRRKLGALEV SSIGLGCQWV PAAVEGSVSD RYGSTIDRKT AINLIRTAVD
SGVTLFDTAE AYGPYLSEEV VGEALQGVRD QVVIETKYGF SFDPKVAAAR GGRDSRPEHI
KQVVEGMLKR LRTDRIDLLY QHRVDPQVPI EDVAGAIKDL IAEGKVLNYG LSEPGIQTIR
RAHAEHPLAA IQNEYSMLWR GPEAEVLPVC EELGIGFVPW SPMGMGFLSG TITAETRFVP
DGDREFRVAV PRFAPDNLRA NMALVEVVKT WAQRKNATPA QLALAWLLAQ KPWIVPIPGT
TKIAHLKENL GAAAITFSGE ELRELNATVA AVPIQGDRLP PGVMQLSGVE APPKR