Gene Avin_49950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_49950 
Symbol 
ID7763847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5061798 
End bp5063423 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content67% 
IMG OID643807827 
ProductAldehyde dehydrogenase 
Protein accessionYP_002802061 
Protein GI226946988 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.489954 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCG AGAGCCGTCT GCACACCCTT TTCCCCGCCG CCGCCGATAT CCCCGAGCAA 
TACCGCCCGG GCGCGCCCCT CGAACAGCGC GACTACCTGG TCGACGGTGA ACTGCGCCGC
TGGGACGGCC CGCTGGCCGC CGTGCGCAGT CCCATCCACC TGAAGACCGC CAAGGGCGAC
GAGCAGGTCG TCCTCGGCAG CACCCCGCTG CTCGACGCCG GCGCCGCGCT AAGCGCGCTG
GACGCCGCGG TCAGGGCCTA CGACAACGGC CAGGGCCTGT GGCCGAGCCT GCCGGTGGCC
GGGCGCATCC AGCACGTCGA GACCTTCCTG GCGCGTATGC GCGAGCAGCG CGAGGCGGTG
GTCAAACTGC TGATGTGGGA GATCGGCAAG AACCTCAAGG ACGCCGAGAA GGAATTCGAC
CGCACCTGCG ACTACATCGT CGACACCATC CACGAACTCA AGGAACTCGA CCGCCGCTCC
AGCCGCTTCG AGCTGGAGCA GGGCACCCTC GGCCAGATCC GCCGCGTGCC GCTGGGCGTG
GCGCTGTGCA TGGGCCCCTA CAACTACCCG CTGAACGAGA CCTTCACCAC CCTGATCCCG
GCGCTGATCA TGGGCAACAC CGTGGTGTTC AAGCCGGCCA AGTTCGGCGT GCTGCTGATC
CGCCCGCTGC TCGAGGCGTT CCGCGACAGC TTCCCGGCCG GGGTGATCAA CGTCATCTAC
GGGCGCGGCC GCGAGACCGT CAGCGCGCTG ATGGAAAGCG GCAAGGTGGA CGTGTTCGCC
TTCATCGGCA CCAACAAGGG CGCCAGCGAC CTGAAGAAGC TGCACCCACG CCCGCACCGC
CTGCGCGCCG CGCTCGGCTT GGACGCCAAG AACCCCGGCA TCGTGCTGCC CGAGGTGGAC
CTGGACAACG CGGTCGGCGA GGCGATCACC GGCGCGCTGT CGTTCAACGG CCAGCGCTGC
ACGGCGCTGA AGATTCTCTT CGTCCACGAA CAGGTGGTCG ACGCCTTCCT CGAGAAATTC
AACCAGAAGC TCGCCGCGCT CAAGCCGGGC ATGCCCTGGG AGCCGGGGGT GGCGCTGACC
CCGTTGCCGG AGCCGGGCAA GACCGATTTT CTCGCCACCC TGGTGGCCGA CGCCCTGGCC
AAGGGGGCGA AGGTGGTCAA CCCCGGCGGC GGCGAAGTGC GCGAGACCTT CTTCTACCCG
GCGCTGCTCT ACCCGGTGAG CCCGCAGATG CGCGTCTACC AGGAGGAGCA GTTCGGCCCG
CTGATCCCGG TGGTGCCCTA CCGCGACCTG CAGACGGTGA TCGACTACGT GCGCGAGTCG
GACTTCGGCC AGCAGTTGTC GATCTTCGGC AACGATCCGA AGCAGGTCGG CCGGCTGGTG
GACGCTTTCG CCAATCAGGT CGGACGGATC AACATCAACG CCCAGTGCCA GCGCGGCCCG
GATAGCTTTC CGTTCAACGG CCGCAAGAAC TCGGCGGAAG GGACCCTGTC GGTGTACGAC
GCGCTGCGCG TGTTCTCGAT CCGCACCCTG GTGGCGACCA AGTTCCAGGA GGATAACAAG
CGGCTGATCA GCGAGATCCT GCGCCACCGC GCGTCGAGCT TCCTGAGTAC CGACTACATC
TTCTGA
 
Protein sequence
MSTESRLHTL FPAAADIPEQ YRPGAPLEQR DYLVDGELRR WDGPLAAVRS PIHLKTAKGD 
EQVVLGSTPL LDAGAALSAL DAAVRAYDNG QGLWPSLPVA GRIQHVETFL ARMREQREAV
VKLLMWEIGK NLKDAEKEFD RTCDYIVDTI HELKELDRRS SRFELEQGTL GQIRRVPLGV
ALCMGPYNYP LNETFTTLIP ALIMGNTVVF KPAKFGVLLI RPLLEAFRDS FPAGVINVIY
GRGRETVSAL MESGKVDVFA FIGTNKGASD LKKLHPRPHR LRAALGLDAK NPGIVLPEVD
LDNAVGEAIT GALSFNGQRC TALKILFVHE QVVDAFLEKF NQKLAALKPG MPWEPGVALT
PLPEPGKTDF LATLVADALA KGAKVVNPGG GEVRETFFYP ALLYPVSPQM RVYQEEQFGP
LIPVVPYRDL QTVIDYVRES DFGQQLSIFG NDPKQVGRLV DAFANQVGRI NINAQCQRGP
DSFPFNGRKN SAEGTLSVYD ALRVFSIRTL VATKFQEDNK RLISEILRHR ASSFLSTDYI
F