Gene Dshi_1747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1747 
SymbolmmsA 
ID5713314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1812332 
End bp1813831 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content66% 
IMG OID641267665 
Productacylating methylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_001533090 
Protein GI159044296 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.207366 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.6572 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGAGT TGAGCCATTT CATCAACGGC AAGCGCGTGG CAGGCACGTC GGGCCGGTTC 
GCGGATGTCA TGAACCCGGC CACGGGCGAG GTGCAGGCCA GGGTGCCACT TGCGTCGCCC
GAAGAGCTGG ACGCCGCTGT GGCCGCCGCG GCCGCAGCAC AGCCCGCCTG GGCCGCCACC
AATCCCCAGC GCCGCGCGCG GGTGCTGATG GAATTCGTGC GCCTGTTGAA CCGAGACATG
GACAAGCTGG CCGAGGCCCT GAGCCGCGAG CATGGCAAGA CCCTGCCCGA TGCCAAGGGC
GACGTGGTGC GCGGGCTGGA GGTGGTCGAG TTCTGCATCG GCGCGCCCCA TCTTCTGAAG
GGCGAATTCA CCGACAGCGC GGGTCCGGGC ATCGACATGT ATTCCATGCG CCAGGCCTTG
GGCGTGGTGG CAGGGATCAC CCCGTTCAAC TTCCCCGCCA TGATCCCGAT GTGGAAGATG
GCCCCCGCCC TGGCCTGCGG CAACGCCTTT ATCCTCAAGC CGTCGGAGCG CGATCCGTCC
GTGCCGCTGA TGCTGGCCGA GCTCATGACC GAGGCCGGGC TGCCCGACGG GCTGCTGCAG
GTGATCAACG GCGACAAGGG CGCGGTGGAT GCGATCCTCG ACAATGACAC GATCCAGGCG
ATCGGCTTCG TCGGCTCCAC CCCGATCGCG GAATACATCT ATTCCCGCGG CTGTGCCAAC
GGCAAGCGCG TACAGTGTTT CGGGGGGGCC AAGAACCACA TGATCATCAT GCCGGACGCC
GATCTCGACC AGGCGGCGGA CGCGCTGGTC GGCGCAGGCT ACGGGGCTGC GGGCGAGCGG
TGCATGGCGA TCTCGGTCGC GGTGCCCGTC GGCGAAGAGA CCGCGGACCG GCTGATCGAG
AAGCTGGTGC CGCGGGTCGA GGCGCTGAAG GTCGGGCCCT ATACCTCGGG CACGGATGTC
GATTACGGCC CCGTGGTGAC CGCCGCGGCC AAGGCCAATA TCGAGCGGCT GGTGCAATCC
GGCGTCGATC AGGGCGCCAA GCTGGTGGTG GACGGGCGGG ACTTCTCGCT GCAGGGCTAT
GAAAACGGGT TCTTCGTCGG CCCGCACCTG TTCGACAATG TCTCCAAAGA AATGGACATC
TATCGGACCG AGATTTTCGG CCCCGTTCTG TGCACCGTGC GCGCCAAATC CTACGAGGAA
GCCCTCGGTC TGGCGATGGA CCACGAATAC GGCAACGGCA CCGCGATCTT CACCCGCGAC
GGGGACGCGG CGCGCGACTT CGCCAACCGC ATCAATATCG GTATGGTGGG GATCAACGTG
CCGATCCCGG TGCCGCTGGC CTACCACACC TTCGGCGGCT GGAAGAAGTC CGGATTCGGC
GACCTGAACC AGCACGGGCC AGACGCGTTC CGGTTCTATA CGCGGACCAA GACCGTCACC
GCCCGCTGGC CCTCGGGCAT CAAGGAAGGC GGAGAATTCT CGATCCCCGT GATGGAGTGA
 
Protein sequence
MEELSHFING KRVAGTSGRF ADVMNPATGE VQARVPLASP EELDAAVAAA AAAQPAWAAT 
NPQRRARVLM EFVRLLNRDM DKLAEALSRE HGKTLPDAKG DVVRGLEVVE FCIGAPHLLK
GEFTDSAGPG IDMYSMRQAL GVVAGITPFN FPAMIPMWKM APALACGNAF ILKPSERDPS
VPLMLAELMT EAGLPDGLLQ VINGDKGAVD AILDNDTIQA IGFVGSTPIA EYIYSRGCAN
GKRVQCFGGA KNHMIIMPDA DLDQAADALV GAGYGAAGER CMAISVAVPV GEETADRLIE
KLVPRVEALK VGPYTSGTDV DYGPVVTAAA KANIERLVQS GVDQGAKLVV DGRDFSLQGY
ENGFFVGPHL FDNVSKEMDI YRTEIFGPVL CTVRAKSYEE ALGLAMDHEY GNGTAIFTRD
GDAARDFANR INIGMVGINV PIPVPLAYHT FGGWKKSGFG DLNQHGPDAF RFYTRTKTVT
ARWPSGIKEG GEFSIPVME