Gene YpsIP31758_2830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_2830 
SymbolbetB 
ID5387151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp3196970 
End bp3198442 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content54% 
IMG OID640865822 
Productbetaine aldehyde dehydrogenase 
Protein accessionYP_001401793 
Protein GI153950641 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01804] glycine betaine aldehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.143897 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCGCT ACGGCTTGCA AAAACTTTAT ATCAACGGCG CGTACACCGA CAGCGCCAGT 
GGTGATACTT TCGATGCCGT GAACCCTGCG AATGGCGAAT GCATTGCACA ACTCCAGGCC
GCCAACGCAC AGGATGTTGA TAAGGCGGTA GCCGCCGCCA AACAAGGTCA GCCTGTGTGG
GCAGCGATGA CCGCGATGGA ACGTTCACGC ATCTTGCGCC GGGCCGTGGA TATCTTGCGT
GATCGTAATG ATGAACTGGC GGCAATTGAA ACGGCAGACA CCGGTAAACC CCTATCCGAA
ACCCGATCTG TGGACATCGT TACCGGTGCC GATGTGCTGG AATATTATGC TGGCCTAATC
CCCGCCCTTG AAGGGCAACA GATCCCACTA CGTGGCAGCG CATTTGTCTA TACCCGCCGT
GAGCCACTGG GCGTGGTTGC CGGTATTGGT GCCTGGAACT ATCCCATTCA GATCGCTTTG
TGGAAATCTG CACCGGCGCT GGCGGCGGGC AACGCTATGA TCTTCAAACC AAGTGAAGTG
ACATCGCTGA CCGCACTGAA ATTGGCGGAA ATCTACACCG AAGCGGGTTT ACCGGCTGGC
GTATTTAACG TATTGACCGG CAGTGGTGAC CAGGTTGGGC AGATGCTGAC AGAGCATCCG
GGTATTGCAA AGGTCTCCTT CACCGGGGGG ATTGCCAGCG GTAAAAAGGT GATGGCTAAC
GCCGCGGGAT CGACCCTGAA AGATGTCACC ATGGAGTTGG GCGGAAAGTC CCCACTAATT
ATTTTTGCTG ATGCCGATCT CGATAAGGCC GCTGATATTG CGATGATGGC CAATTTCTAC
AGCTCAGGAC AAGTCTGCAC CAACGGCACG CGAGTTTTTG TCCCGCAGGC GTTACAGGCC
GCGTTTGAGC AGAAAATCGT TGAACGGGTC AAGCGGATTC ATATTGGTGA CCCAAGCGAT
GAACGAACCA ACTTTGGCCC GTTGGTTAGC TTCCAGCACC GCGATTCAGT GATGCGTTAC
ATTGACAGCG GTAAACGGGA AGGCGCAACC CTGCTGATCG GGGGATACAG TCTGACCGAG
GACGCACTGG CACACGGGGC CTATGTCGCC CCTACGGTAT TCACCCACTG CCGTGATGAC
ATGCAAATTG TGCGTGAAGA GATCTTCGGA CCGGTGATGA GCATTCTTAG TTATCAAAGC
GAAGAGGAAG TCATTCGCCG CGCCAATGAT ACCGAGTACG GTTTAGCGGC GGGGGTCGTT
ACACAGGATT TGAACCGTGC CCATCGCGTG ATTCATCAAC TGCAAGCGGG TATCTGCTGG
ATCAATACTT GGGGCGAATC GGCACCAGAG ATGCCTGTAG GCGGATATAA GCATTCTGGT
GTAGGCCGTG AAAACGGTAT CAGCACGCTG GAACATTACA CGCAAATCAA ATCAATTCAG
GTTGAGTTAG GCAGCTTCAA TTCTGTTTTT TAA
 
Protein sequence
MSRYGLQKLY INGAYTDSAS GDTFDAVNPA NGECIAQLQA ANAQDVDKAV AAAKQGQPVW 
AAMTAMERSR ILRRAVDILR DRNDELAAIE TADTGKPLSE TRSVDIVTGA DVLEYYAGLI
PALEGQQIPL RGSAFVYTRR EPLGVVAGIG AWNYPIQIAL WKSAPALAAG NAMIFKPSEV
TSLTALKLAE IYTEAGLPAG VFNVLTGSGD QVGQMLTEHP GIAKVSFTGG IASGKKVMAN
AAGSTLKDVT MELGGKSPLI IFADADLDKA ADIAMMANFY SSGQVCTNGT RVFVPQALQA
AFEQKIVERV KRIHIGDPSD ERTNFGPLVS FQHRDSVMRY IDSGKREGAT LLIGGYSLTE
DALAHGAYVA PTVFTHCRDD MQIVREEIFG PVMSILSYQS EEEVIRRAND TEYGLAAGVV
TQDLNRAHRV IHQLQAGICW INTWGESAPE MPVGGYKHSG VGRENGISTL EHYTQIKSIQ
VELGSFNSVF