Gene Avin_15120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_15120 
SymbolmhpF 
ID7760447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1488179 
End bp1489114 
Gene Length936 bp 
Protein Length311 aa 
Translation table11 
GC content70% 
IMG OID643804409 
Productacetaldehyde dehydrogenase 
Protein accessionYP_002798702 
Protein GI226943629 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4569] Acetaldehyde dehydrogenase (acetylating) 
TIGRFAM ID[TIGR03215] acetaldehyde dehydrogenase (acetylating) 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.179269 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC TCAAAGTCGC CATCGTCGGC TCCGGCAACA TCGGCACCGA CCTGATGATC 
AAGATCCTGC GCCACGGCCA GCACCTGGAA ATGGGCGCCC TGGTCGGCAT CGACCCGGAT
TCCGACGGCC TGGCCCGCGC CGCGCGCCTC GGCGTGGCCA CCACCGCCGA GGGCGTCGAG
GGCCTGGCCC GCCTGCCGGG GTTCGGCGAG ATCGATTTCG TCTTCGACGC CACCTCGGCC
GCCGCCCACG TGAAGAACGA CCCGTTCCTG CGCGGCCTCA GGCCCGGCCT GCGGCTGATC
GACCTGACCC CGGCGGCCGT CGGCCCCTAC TGCGTGCCGG TGGTGAATCT CGAGCAGAAC
CTGCGCGAAC CCAACGTCAA CATGGTCACC TGCGGCGGCC AGGCGACCAT CCCCATGGTC
GCCGCGGTGT CGCGGGTGGC CAGGGTCCAC TACGCCGAGA TCGTCGCCTC GATCGCCAGC
CGGTCGGCCG GCCCCGGCAC CCGCGCCAAC ATCGACGAAT TCACCGAGAC CACCTCGAAA
GCCATCGAGG CGATCGGCGG GGCGCGCAAG GGCAAGGCGA TCATCGTCCT CAACCCGGCC
GAGCCGCCGC TGATCATGCG CGACACCGTC TACGTGCTCT CCGCGCCGGC CGACCAGGCC
CGGGTCGAGG CCTCCCTCGC GGAAATGGCC CAGGCGGTAC AGGGCTACGT GCCGGGCTAT
CGCCTCAAGC AGCGGGTGCA GTTCGACGAG ATCCCCGACG CCGCGCCGCT GAACATCCCC
GGCCTCGGCC GCCTGTCCGG CCTGAAGACC TCGGTGTTCC TCGAGGTCGA GGGCGCCGCC
CATTACCTGC CGGCCTACGC CGGCAACCTG GACATCATGA CCTCCGCCGC GCTGGCTACC
GCCGAGCGCA TGGCGCAATC CATGCTGAAC GCCTGA
 
Protein sequence
MKKLKVAIVG SGNIGTDLMI KILRHGQHLE MGALVGIDPD SDGLARAARL GVATTAEGVE 
GLARLPGFGE IDFVFDATSA AAHVKNDPFL RGLRPGLRLI DLTPAAVGPY CVPVVNLEQN
LREPNVNMVT CGGQATIPMV AAVSRVARVH YAEIVASIAS RSAGPGTRAN IDEFTETTSK
AIEAIGGARK GKAIIVLNPA EPPLIMRDTV YVLSAPADQA RVEASLAEMA QAVQGYVPGY
RLKQRVQFDE IPDAAPLNIP GLGRLSGLKT SVFLEVEGAA HYLPAYAGNL DIMTSAALAT
AERMAQSMLN A