Gene Avi_5053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_5053 
Symbolvdh 
ID7381210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011988 
Strand
Start bp44354 
End bp45802 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content62% 
IMG OID643648724 
Productvanillin: NAD oxidoreductase 
Protein accessionYP_002546961 
Protein GI222106170 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.207548 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGAAA TCCAGCAAAT CATTGGTGGC AAGAAAGTCG GAGCCTTGTC GGGCAAGACC 
TTCGACCGGA TCGATCCATT CAATGGCGAG ATTGCTTCCC GCGCCCCGGC CTCGAGCCTG
GACGATGTCA AGGCAGCAGT TGCCGCAGCC CAGGCGGCCT TTCCCGCCTG GTCGCGCACC
GGCCCCGGCG AAAGGCGGGC GCTGCTGCTG AAAGCCGCCG ACATCATGGC CTCGAAGGCG
GCAGACTTCA CCGCGCTGAT GATCACGGAG ACGGGTGCAA CCGGTCCCTG GGCCGGCTTC
AACACCATGC TTGCCGCTGG CGTATTGCGG GAAGCCGCCA GCATGACCAG CCAGATCCAG
GGCGAAGTCA TCCCCTCCGA CAAGCCCGGT ACATTGTCCA TGGCGGTGCG CCAGGCAGCA
GGCGTCTGCC TTGGCATTGC GCCCTGGAAC GCACCAATCA TTTTGGGCAC ACGCGCCATT
GCCATGGCAA TTGCCTGCGG CAACAGCGTC ATCCTGAAGG CATCCGAAGC CTGCCCCGGC
GTTCACGTCC TGATCGGCCA GGTGCTGGTC GAAGCTGGCC TGCCGGATGG CGTCATCAAT
GTCATCACCA ATGCGCCTGA AGATGCTGCC CAGGTGGTGG AGGCGCTGGT CAGCGCACCG
GAAGTTCGCC GCGTCAATTT CACCGGTTCC ACCAAGGTCG GACGCATTAT CGGCGAATTG
TGCGGTCGCC ACCTGAAGCC CGCCCTGCTT GAACTCGGCG GAAAAGCACC CTTTCTGGTG
CTCGAAGATG CCGATATCGA CGCTGCCGTC AATGCGGCGG TGTTTGGCTG CTACATGAAC
ATGGGCCAGA TCTGCATGTC CACGGAGCGG TTGATTGTCC ACGAAAAGGT CGCCGACGAA
TTCGTGGCAA AGCTGGCCGC CCGGGCAGCC TCGCTTCCCG CTGGCGATCC GCGCGGCCAT
GTCGTGCTGG GCTCGCTGGT TAATCCTCAG GCCGCCATCA AAATGCAGGA ATTCATCGAC
GATGCCGTCG GCAAGGGCGC AACCCTCGCG GCTGGCGGCA AGGTCACGGG CAGCGTGGTG
GAAGCGACGC TTCTCGACCA TGTCACATCA GGAATGCGCA GTTTCGATGA GGAAAGCTTC
GGCCCGGTCA AGCCGGTCAT CCGGGTCAAG GACGAGGAAG AGGCCATCCG CATCGCCAAT
GACAGCGAAT ACGGCCTGTC CTCGGCGATT TTCAGCCGCG ATATCCAACG CGCCCTGGCG
ATTGCGGCCC GTATCGAAGC CGGCATTTGC CATATCAACG GCCCGACCGT TGCCGATGAG
GCGCAAATGC CGTTTGGCGG TGTGAAAAGC TCCGGTTTCG GTCGGTTCGG CGGCAAGGCG
GCGATCAACG AATTCACCGA CCTGCGCTGG ATCACCATCG AAGATCCGAA CCAGCACTAT
CCGTTCTGA
 
Protein sequence
MHEIQQIIGG KKVGALSGKT FDRIDPFNGE IASRAPASSL DDVKAAVAAA QAAFPAWSRT 
GPGERRALLL KAADIMASKA ADFTALMITE TGATGPWAGF NTMLAAGVLR EAASMTSQIQ
GEVIPSDKPG TLSMAVRQAA GVCLGIAPWN APIILGTRAI AMAIACGNSV ILKASEACPG
VHVLIGQVLV EAGLPDGVIN VITNAPEDAA QVVEALVSAP EVRRVNFTGS TKVGRIIGEL
CGRHLKPALL ELGGKAPFLV LEDADIDAAV NAAVFGCYMN MGQICMSTER LIVHEKVADE
FVAKLAARAA SLPAGDPRGH VVLGSLVNPQ AAIKMQEFID DAVGKGATLA AGGKVTGSVV
EATLLDHVTS GMRSFDEESF GPVKPVIRVK DEEEAIRIAN DSEYGLSSAI FSRDIQRALA
IAARIEAGIC HINGPTVADE AQMPFGGVKS SGFGRFGGKA AINEFTDLRW ITIEDPNQHY
PF