Gene Avin_38050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_38050 
Symbol 
ID7762696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3850165 
End bp3851370 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content67% 
IMG OID643806669 
ProductAldo/keto reductase 
Protein accessionYP_002800922 
Protein GI226945849 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACGTTG AAGATGAAAA AGATAGGCCT GGCTCCGATC TGACGCTGCC CGGCCGGAGA 
AAGGTGTTGG CTACGGGCGC GGTATTGGCG GCCTTGCCGC TGTTGGCGAG TTTGCCATCG
ATGGGTTTCG CACAGCAGAC GACGGCGGGC ACCGGGGGCC GTGCAAAGGC CGATATCCAT
AGCGCCCGCC GCAGGCCCGG TGCGCTGGAG GTCTCCGCCC TGGGCCTGGG CTGCATGAGC
ATGAACGGCG GCCAGTACAA CCCGCCCGGG GACAAGCGCG AGATGATCCG CGTCATCCAC
GCCGCCATCG ATCGCGGGGT GGACTTCTTC GATACCGCCG AAGTCTACGG CCCGTTCATC
AACGAGGAAC TGCTTGGCGA GGCGCTCGCC CCGTATCGGG ACAAAGCGGT CATCGCGACC
AAGTTCGGCT TCGGCATCGA CCCGGCGAGT GCATTGCGCA TCGGCGGCCT CGACAGTCGA
CCGGAGCATA TCCGGGCGGT CGCGGAGACG TCGCTCAGGC GCCTGCGGAC CGACCGCATC
GACCTGTTCT ACCAGCACCG CGTCGACCCG GCCGTGCCGA TCGAGGACGT GGCCGGCACG
GTGAAAGACC TGATCGCCGA AGGCAAGGTC AAGCACTTCG GTCTCTCCGA GCCTGGCCTG
CAGACCGTGC GCCGGGCGCA CGCGGTACAG CCCGTGGCAG CGATTCAGAA CGAGTACTCA
CTGCTGTGGC GGGGACCGGA ACTGGGCTTG CTGGAGTCGT GCGAGGAGCT CGGTATCGGC
CTGGTGCCCT GGAGTCCGTT AGGCGCCGGC CTGCTCACCG GCACGCTCGA CGCCGATACC
CGCTTCGACG CCCCCGGATA CACGGACTAC CGCCGCACCA ACCCGCGCTT CGCCCCCGAA
GCGCTCACGG GCAACATGGC ATTGGTCGAG CTGGCCCGCG AATGGGCGCA ACGCAAGGAA
GCCACGCCGT CGCAGATCGC GCTGGCCTGG CTGCTGGCTC AGCGACCGTG GATCGTGCCC
ATTCCCGGCA CCACCAACAT CCAGCACCTG GACGAGAACC TCGGCGCGAT CAACCTCCAG
TTCAGCGCAG CGGAGATGCA GGCGTTCAAC ACCGCGTTGG CGCAGATCGT GGTGCATGGC
GAAAGAGGAA CCCCGAGGCT GCTGGAGATG GTCGGGCGGG ATACGCCCCT GCCAAAAGGG
CGGTGA
 
Protein sequence
MYVEDEKDRP GSDLTLPGRR KVLATGAVLA ALPLLASLPS MGFAQQTTAG TGGRAKADIH 
SARRRPGALE VSALGLGCMS MNGGQYNPPG DKREMIRVIH AAIDRGVDFF DTAEVYGPFI
NEELLGEALA PYRDKAVIAT KFGFGIDPAS ALRIGGLDSR PEHIRAVAET SLRRLRTDRI
DLFYQHRVDP AVPIEDVAGT VKDLIAEGKV KHFGLSEPGL QTVRRAHAVQ PVAAIQNEYS
LLWRGPELGL LESCEELGIG LVPWSPLGAG LLTGTLDADT RFDAPGYTDY RRTNPRFAPE
ALTGNMALVE LAREWAQRKE ATPSQIALAW LLAQRPWIVP IPGTTNIQHL DENLGAINLQ
FSAAEMQAFN TALAQIVVHG ERGTPRLLEM VGRDTPLPKG R