Gene Avi_1147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_1147 
SymbolbetB 
ID7386211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp968092 
End bp969558 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content59% 
IMG OID643650617 
Productbetaine aldehyde dehydrogenase 
Protein accessionYP_002548823 
Protein GI222147866 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01804] glycine betaine aldehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAAG CCCAACCCAA AGCCTCCCAC TTCATTGATG GCGAATACGT TGAAGACGCA 
AGCGGCACCG TGATTGACAG CATCTACCCC GCAACGGGCG AGGTGATTGC CCGGCTGCAC
GCCGCAACGC CTGCGATTGT GGAGCGGGCC ATTGCAGCGG CCAAGCGGGC GCAGAAGGAA
TGGGCAGCTC TTAGCCCCAC TGCCCGTGGC CGGGTGCTGA AAAAGGCCGC CGAGATCATG
CGGGAGCGCA ACCGCGAACT GTCGGAACTG GAAACGATGG ACACGGGCAA ACCGATTCAA
GAAACCATTG TGGCGGACCC AACATCGGGC GCGGATAGTT TTGAGTTTTT TGGCGGAATT
GCCGCCGCTG GACTCAACGG CAGCCACATT CCGCTGGGCA ATGATTTTGC CTATACCAAA
CGGGTTCCCC TTGGCGTCTG TGTTGGCATT GGCGCGTGGA ACTATCCGCA GCAGATCGCC
TGCTGGAAAT CGGCCCCGGC GCTTGCCGCT GGCAATGCCA TGGTGTTCAA GCCATCAGAA
ATGACGCCGC TGGGGGCTTT GAAGATTGCC GAGATTTTGA TCGAGGCAGG TGCGCCCAAG
GGCGTGTTCA ACGTCATTCA GGGCGATCGG GAGACAGGGC CATTGCTGGT CAATCATCCC
GATGTGGCCA AGGTGTCTTT GACTGGCTCA GTGCCAACGG GGCGGAAAGT TGCGGCGGCG
GCGGCAGGGC ATTTGAAGCA CGTCACCATG GAACTTGGCG GCAAATCGCC TCTGATCGTG
TTTGATGATG CGGATGTGGA CAGCGCCATT TCCGGGGCGA TGCTGGCGAA TTTTTACTCC
ACGGGACAGG TTTGCTCCAA CGGCACGCGG GTGTTTGTTC ACACAGCCAT CAAGCAGGTC
TTCCTTGAGC GCTTGAAGGC CCGCACGGAA GCGATTGTGA TTGGTGATCC GCAAGATGAG
GCCACCCAGA TGGGGCCTTT GGTGTCGATG GCGCAGCGGG AAAAAGTGCT GTCCTATATC
GACAAGGGCA AGGCCGAGGG CGCGACACTG ATCACGGGCG GCGGCATTCC CAACAGCGCG
TCGGGCACTG GTGCGTTTAT TCAGCCCACG GTGTTTGCCG ATGTGACTGA TAGCATGACC
ATCGCCCGCG AAGAGATTTT TGGCCCCGTG ATGTGCGTGC TGGATTTTGA CGACGAGGCT
GACGTGATTG CCCGCGCCAA TGCATCCGAA TTTGGCCTCG CGGGCGGCGT GTTTACCGCC
GACATCACCC GCGCCCACCG CGTGGTGGAT CAGTTGGAAG CCGGAACGCT GTGGATCAAC
ACTTATAATC TCTGCCCGGT GGAAATGCCG TTTGGTGGCT CCAAACAATC CGGTTTTGGC
CGCGAGAATT CGCTGGCGGC GCTGGAGCAT TATTCGGAGT TGAAGACGGT TTATGTGGGC
ATGGGCAAGT GTGAAGCGCC GTATTGA
 
Protein sequence
MMKAQPKASH FIDGEYVEDA SGTVIDSIYP ATGEVIARLH AATPAIVERA IAAAKRAQKE 
WAALSPTARG RVLKKAAEIM RERNRELSEL ETMDTGKPIQ ETIVADPTSG ADSFEFFGGI
AAAGLNGSHI PLGNDFAYTK RVPLGVCVGI GAWNYPQQIA CWKSAPALAA GNAMVFKPSE
MTPLGALKIA EILIEAGAPK GVFNVIQGDR ETGPLLVNHP DVAKVSLTGS VPTGRKVAAA
AAGHLKHVTM ELGGKSPLIV FDDADVDSAI SGAMLANFYS TGQVCSNGTR VFVHTAIKQV
FLERLKARTE AIVIGDPQDE ATQMGPLVSM AQREKVLSYI DKGKAEGATL ITGGGIPNSA
SGTGAFIQPT VFADVTDSMT IAREEIFGPV MCVLDFDDEA DVIARANASE FGLAGGVFTA
DITRAHRVVD QLEAGTLWIN TYNLCPVEMP FGGSKQSGFG RENSLAALEH YSELKTVYVG
MGKCEAPY