Gene Avin_51520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_51520 
Symbol 
ID7763991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5238683 
End bp5239726 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content69% 
IMG OID643807970 
ProductAldo/keto reductase 
Protein accessionYP_002802204 
Protein GI226947131 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTATA GCGCCGCGCC GAACCGCTAC GAGCGCATCC CCTACCGCCG CGTCGGCCGC 
AGCGGCCTGG TGCTGCCGGC CCTGTCCCTG GGCCTGTGGC ACAACTTCGG TGACAGCACC
CCGCTCGATA CCCAGCGCGC CCTGCTGCGC ACCGCCTTCG ACCTCGGCAT CAACCACTTC
GACCTGGCGA ACAACTACGG GCCGCCCTAC GGCAGCGCCG AGACCAACTT CGGCCGCCTG
CTGCGCGAGG ATTTCCGCGC CTATCGCGAC GAGCTGATCC TCTCCACCAA GGCCGGCTGG
GACATGTGGC CCGGCCCCTA CGGCCAGGGC GGCGGTTCGC GCAAGTACGT ACTCGCCAGC
CTCGACCAGA GCCTGCGGCG CATGGGCGTC GACTACGTGG ACATCTTCTA TTCGCACCGC
TTCGATCCGC ACACGCCACT GGAGGAAACC GCCGGCGCCC TGGCCGACAC CGTGCGCCAG
GGCAAGGCGC TATACGTGGG CATCTCCGCC TATTCGGAAG CCAAGACACG GGAAATGGCC
GCCCTGTTGC ACGAGCACAG GGTGCCGCTG CTGATCCACC AGCCGGCCTA CAACCTGTTC
AACCGCTGGA TCGAGAAGGA CCTGCTGGCC ACCACCGAGG ACCTCGGCGC CGGCGTGATC
GCCTTCACCC CCCTGGCCCA GGGGCTGCTC ACCGACAAGT ACCTGGATGG CATCCCCGCC
AATGCGCGGA TCAACCGTCC CGGCGGCGCC TCGCTGCGCC CCGAGCACCT GTCCGAGGCG
AACATTCGGC GTGCCCGGGC GCTCGCCGAG ATTGCCCGCC GGCGCGGGCA GAGCCTGGCC
CAGTTGGCCC TCGCCTGGCT GCTGCGCGAT GCGCGGGTGA CTTCGGCGCT GATCGGCGCC
AGCCGCCCGG AACAGCTCGT CGAGAATGTC GCGGCGCTGG ACAACCTGGC ATTCAGCCCC
GAAGAACTGG CGGAGATCGA CCGTCACGCC GCCGCAAGCG GCGTCAATCT CTGGGACAGG
CCCTACACCG ACTGGCCAGC GTGA
 
Protein sequence
MSYSAAPNRY ERIPYRRVGR SGLVLPALSL GLWHNFGDST PLDTQRALLR TAFDLGINHF 
DLANNYGPPY GSAETNFGRL LREDFRAYRD ELILSTKAGW DMWPGPYGQG GGSRKYVLAS
LDQSLRRMGV DYVDIFYSHR FDPHTPLEET AGALADTVRQ GKALYVGISA YSEAKTREMA
ALLHEHRVPL LIHQPAYNLF NRWIEKDLLA TTEDLGAGVI AFTPLAQGLL TDKYLDGIPA
NARINRPGGA SLRPEHLSEA NIRRARALAE IARRRGQSLA QLALAWLLRD ARVTSALIGA
SRPEQLVENV AALDNLAFSP EELAEIDRHA AASGVNLWDR PYTDWPA