Gene BURPS1106A_A1010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1010 
SymbolmmsA 
ID4906151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp978188 
End bp979870 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content70% 
IMG OID640144116 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_001075046 
Protein GI126456253 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.291559 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCCGGA GCCGCGCGGC AGGCACGCGT TCGGCGCCGG CGCGGCGGAC CCGGCCCGAT 
CGGCGTCGGT CCGCTTCGAT CCGCGCGGTG CGCGTTCCTC GCGCGCGCGC CGGCGCCGGC
GGCTCGGCGC ATCCCACGCA ACATACGAAT TCAGGAAACA CCGCGATGAA ACACGACAGC
AACGTCACTT CCCCCCTCGG CCATCTGATC GACGGCAAGC GCGTCGACGG CGGCGAGCGC
GTCCAGCCCG TGTTCGATCC GGCGACGGGC GCATCGACGA AGCGCGTCCG CATGGCCGAC
CGCCCGAGCG TCGAGGCGGC GATCGCCGCC GCGCAGGCCG CGTATCCGGC CTGGCGCAAC
ACGCCGCCGC TCAAGCGCGC GCGGGTGATG AGCCGCTTCA AGACGCTGCT CGAGGAGCAT
GCGAACGAGC TGTGCGCGCT GATCACGGCC GAGCACGGCA AGGTGCTCGC CGATGCGATG
GGCGAGTTGC AGCGCGGGAT CGAGAACGTC GAGTACGCGA GCTACGCGCC CGAGCTGCTC
AAGGGCGAGC ACAGCAAGAA CGTCGGCCCG GCGATCGACT CGTGGAGCGA GTTCCAGGCG
CTCGGCGTGG TGGCGGGGAT CACGCCGTTC AATTTCCCGA TCATGGTGCC GCTGTGGATG
TGGCCGATGG CCGTCGCGTG CGGCAACACG TTCGTGCTGA AGCCTTCCGA GCGCACGCCG
TCGTCGACGC TGCGCATGGC CGAGCTCGCG CTCGAAGCGT GCCTGCCGCC GGGCGTGCTG
AACGTCGTGA ACGGCGACAA GGAAGCCGTC GACACGATCC TGACCGATTC GCGCGTGAAG
GCGGTGAGCT TCGTCGGCTC GACGCCGATC GCCGAGTCCA TCTACACGAC GGGCTGCGCG
CACGGCAAGC GCGTGCAGGC GCTGGGCGGC GCGAAGAACT TCGCGATCGT GATGCCGGAC
GCCGACATCG GCAACGCGGT CAACGCGCTG ATGGGCGCGG CGTACGGTTC GTGCGGCGAG
CGGTGCATGG CGATTCCGCT CGTCGTCGCG ATCGGCGACG ACACGGCGGA GCAGGTCGTC
GACGGCCTGA AGGCCGAGAT CGCGAAGATG AAGGTCGGTC CGGGCACGGG CGAGCAGGTC
GACATGGGGC CGCTCGTCAC GCGGCAGCAC TTCGAGAAGG TGACGGGCTT CGTCGAGGCG
GGCATCGCCG CGGGCGCGAC GCTCGTCGTC GACGGGCGCG GCGTGAAGGT GGACGGCCAC
GAAGGCGGCT ATTACCTCGG CCCGTGCCTG TTCGACCACG TGAAGCCCGG CATGCCGATC
TATCAGCACG AGATCTTCGG GCCGGTGCTG GGCGTCGTGC GCGTCGCGTC GCTCGCCGAG
GCGATGGCGC TCGTCGACGC GCACGAGTAC GGCAACGGCA CGTGCCTCTT CACGCGCGAC
GGCGAGGCCG CGCGCTTTTT CAGCGACAAC ATCCAGGTCG GGATGGTCGG CATCAACGTG
CCGCTGCCCG TGCCCGTTGC TTATCACTCG TTCGGCGGCT GGAAGCGCTC GCTGTTCGGC
GATCTGCACG CATACGGACC GGACGCGGTG CGCTTCTATA CGAAGCGCAA GACGATCACG
CAGCGCTGGC CGTCGGCCGG CGTGCGCGAG GGGACGGTGT TCAGCTTCCC GTCGAGCCGC
TGA
 
Protein sequence
MRRSRAAGTR SAPARRTRPD RRRSASIRAV RVPRARAGAG GSAHPTQHTN SGNTAMKHDS 
NVTSPLGHLI DGKRVDGGER VQPVFDPATG ASTKRVRMAD RPSVEAAIAA AQAAYPAWRN
TPPLKRARVM SRFKTLLEEH ANELCALITA EHGKVLADAM GELQRGIENV EYASYAPELL
KGEHSKNVGP AIDSWSEFQA LGVVAGITPF NFPIMVPLWM WPMAVACGNT FVLKPSERTP
SSTLRMAELA LEACLPPGVL NVVNGDKEAV DTILTDSRVK AVSFVGSTPI AESIYTTGCA
HGKRVQALGG AKNFAIVMPD ADIGNAVNAL MGAAYGSCGE RCMAIPLVVA IGDDTAEQVV
DGLKAEIAKM KVGPGTGEQV DMGPLVTRQH FEKVTGFVEA GIAAGATLVV DGRGVKVDGH
EGGYYLGPCL FDHVKPGMPI YQHEIFGPVL GVVRVASLAE AMALVDAHEY GNGTCLFTRD
GEAARFFSDN IQVGMVGINV PLPVPVAYHS FGGWKRSLFG DLHAYGPDAV RFYTKRKTIT
QRWPSAGVRE GTVFSFPSSR