Gene BURPS668_A1097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1097 
SymbolmmsA 
ID4886669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1053386 
End bp1054981 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content69% 
IMG OID640131037 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_001062096 
Protein GI126442504 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.856592 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCGTTC CTCGCGCGCG CGCCGGCGCC GGCGGCGCGA CCCATCCCAC GCAACATACG 
AATTCAGGAA ACACCGCGAT GAAACACGAC AGCAACGTCA CTTCCCCCCT CGGCCATCTG
ATCGACGGCA AGCGCGTCGA CGGCGGCGAG CGCGTCCAGC CCGTGTTCGA TCCGGCGACG
GGCGCATCGA CGAAGCGCGT CCGCATGGCC GACCGCCCGA GCGTCGAGGC GGCGATCGCC
GCCGCGCAGG CCGCGTATCC GGCCTGGCGC AACACGCCGC CGCTCAAGCG CGCGCGGGTG
ATGAGCCGCT TCAAGACGCT GCTCGAGGAG CACGCGAACG AGCTGTGCGC GCTGATCACG
GCCGAGCACG GCAAGGTGCT CGCCGATGCG ATGGGCGAGT TGCAGCGCGG GATCGAGAAC
GTCGAGTACG CGAGCTACGC GCCCGAGCTG CTCAAGGGCG AGCACAGCAA GAACGTCGGC
CCGGCGATCG ACTCGTGGAG CGAGTTCCAG GCGCTCGGCG TGGTGGCGGG GATCACGCCG
TTCAATTTCC CGATCATGGT GCCGCTGTGG ATGTGGCCGA TGGCTGTCGC GTGCGGCAAC
ACGTTCGTGC TGAAGCCTTC CGAGCGCACG CCGTCGTCGA CGCTGCGCAT GGCCGAGCTC
GCGCTCGAAG CGGGCCTGCC GCCGGGTGTG CTGAACGTCG TGAACGGCGA CAAGGAAGCC
GTCGACACGA TCCTGACCGA TTCGCGCGTG AAGGCGGTGA GCTTCGTCGG CTCGACGCCG
ATCGCCGAGT CCATCTACGC GACGGGCTGC GCGCACGGCA AGCGCGTGCA GGCGCTGGGC
GGCGCGAAGA ACTTCGCGAT CGTGATGCCG GACGCCGACA TCGGCAACGC GGTCAACGCG
CTGATGGGCG CGGCGTACGG TTCGTGCGGC GAGCGGTGCA TGGCGATTCC GCTCGTCGTC
GCGATCGGCG ACGACACGGC GGAGCAGGTC GTCGACGGCC TGAAGGCCGA GATCGCGAAG
ATGAAGGTCG GCCCGGGCAC GGGCGAGCAG GTCGACATGG GGCCGCTCGT CACGCGGCAG
CACTTCGAGA AGGTGACGGG CTTCGTCGAG GCGGGCATCG CCGCGGGCGC GACGCTCGTC
GTCGACGGGC GCGGCGTGAA GGTGGACGGC CACGAAGGCG GCTATTACCT CGGCCCGTGC
CTGTTCGATC ACGTGAAGCC CGGCATGCCG ATCTATCAGC ACGAGATCTT CGGGCCGGTG
CTGGGCGTCG TGCGCGTCGC GTCGCTCGCC GAGGCGATGG CGCTCGTCGA CGCGCACGAG
TACGGCAACG GCACGTGCCT CTTCACGCGC GACGGCGAGG CCGCGCGCTT TTTCAGCGAC
AACATCCAGG TCGGGATGGT CGGCATCAAC GTGCCGCTGC CCGTGCCCGT TGCTTATCAC
TCGTTCGGCG GCTGGAAGCG CTCGCTGTTC GGCGATCTGC ACGCATACGG ACCCGACGCG
GTGCGCTTCT ATACGAAGCG CAAGACGATC ACGCAGCGCT GGCCGTCGGC CGGCGTGCGC
GAGGGGACGG TGTTCAGCTT CCCGTCGAGC CGCTGA
 
Protein sequence
MRVPRARAGA GGATHPTQHT NSGNTAMKHD SNVTSPLGHL IDGKRVDGGE RVQPVFDPAT 
GASTKRVRMA DRPSVEAAIA AAQAAYPAWR NTPPLKRARV MSRFKTLLEE HANELCALIT
AEHGKVLADA MGELQRGIEN VEYASYAPEL LKGEHSKNVG PAIDSWSEFQ ALGVVAGITP
FNFPIMVPLW MWPMAVACGN TFVLKPSERT PSSTLRMAEL ALEAGLPPGV LNVVNGDKEA
VDTILTDSRV KAVSFVGSTP IAESIYATGC AHGKRVQALG GAKNFAIVMP DADIGNAVNA
LMGAAYGSCG ERCMAIPLVV AIGDDTAEQV VDGLKAEIAK MKVGPGTGEQ VDMGPLVTRQ
HFEKVTGFVE AGIAAGATLV VDGRGVKVDG HEGGYYLGPC LFDHVKPGMP IYQHEIFGPV
LGVVRVASLA EAMALVDAHE YGNGTCLFTR DGEAARFFSD NIQVGMVGIN VPLPVPVAYH
SFGGWKRSLF GDLHAYGPDA VRFYTKRKTI TQRWPSAGVR EGTVFSFPSS R