Gene Namu_4538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4538 
Symbol 
ID8450166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5045387 
End bp5046949 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content69% 
IMG OID645043579 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_003203806 
Protein GI258654650 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGA TCAGCGATCC TGAGGCGGTT CCGGCCGCGG CCGAGCAGGT ACGGGGGACC 
GGCGGGGTGC TGCCGCACTG GAAGGACGGG GCCCCCTTCG CCGGCACCTC GTCCCGCACC
GCGCCGGTGT ACGACCCGGC CACCGGCCAG GTCACCAAGC AGGTCGCGTT GGCCTCGGAG
GCCGACGCGG ACGCGGTGAT CGCCTCGGCG GCCGCGGCGT TCCCGGGGTG GCGGGACACC
TCGTTGGCCC GGCGCACCCA GATCCTGTTC CGGTTCCGCG AGCTGCTCAA CGCCCGCAAG
CACGAGGCCG CCGAGATTAT CACCAGCGAG CACGGCAAGG TCCTCTCGGA CGCCCTGGGC
GAGATCACCC GGGGCCAGGA GATCGTCGAG CTGGCCTGCG GCCTGGCCCA TTTGCTCAAG
GGCAGCCTGA CCGAGAACGC CTCGACCAAG GTCGACGTCT CCTCGATCCG GCAGCCACTG
GGCGTCGTCG GAATCATCTC CCCGTTCAAC TTCCCGGCGA TGGTGCCGAT GTGGTTCTTC
CCGATCGCGA TCGCCGCCGG CAACACGGTG GTCCTCAAAC CGAGCGAGAA GGACCCCTCG
ACGGCGAACT GGCTGGCCGA CCTGTGGAAG GAGGCCGGCC TGCCCGATGG GGTGTTCACC
GTGCTCCACG GCGACAAGGT CGCGGTGGAC GCGCTCCTGA CCGATTCGCG GGTCAAGGCG
ATCAGCTTCG TCGGCAGCAC GCCGATCGCC CAGTACGTGT ACGCCACCGG CACCGACAAC
GGCAAACGGG TGCAGGCCCT GGGCGGGGCC AAGAACCACA TGCTGGTGCT GCCCGATGCC
GACCTGGACC TGGCCGCCGA CGCCGCGGTG AACGCCGGCT TCGGCTCGGC CGGCGAACGG
TGCATGGCCA TCTCGGCGTT GGTGGTGGTG GACGCGGTGG CCGACGAGCT GATCGCCAAG
ATCGGTGAAC GGGTCGCCAC CCTGCGCACC GGCGACGGGC GCCGCGGCTG CGACATGGGA
CCACTGGTCA CCGGCGCGCA CCGGGACAAG GTCGCCGGCT ACGTCCAGGC CGGGGTCGAT
GCCGGCGCCG AACTGGTCAT CGACGGCCGG GTCAGCAACC CGAACCAGCA CTTCGACGGC
GCGGCCGACG GGTTCTGGCT CGGCCCAACC CTGTTCGACC GGGTCACCCC CGACATGAGC
ATCTACACCG ACGAGATCTT CGGGCCGGTA CTCAGCGTGG TGCGGGTGGA CTCCTACGAC
CAGGGTCTGG ACCTGATCAA CAGCAACCCG TACGGCAACG GCACCGCGAT CTTCACCAAC
GACGGCGGCG CCGCCCGCCG CTTCCAGAAC GAGGTCCAGG TCGGCATGGT CGGCATCAAC
GTGCCCGTGC CCGTCCCCGT CGGCTACTAC AGCTTCGGCG GGTGGAAAGC ATCGCTGTTC
GGCGACACCC ACGCCCACGG CACCGAGGGC TTCCACTTCT ACACCCGCGG CAAGGTCATC
ACCACCCGCT GGCTCGACCC CAGCCATGGC GGCCTGAACC TCGGTTTCCC CCAGAACATC
TGA
 
Protein sequence
MTTISDPEAV PAAAEQVRGT GGVLPHWKDG APFAGTSSRT APVYDPATGQ VTKQVALASE 
ADADAVIASA AAAFPGWRDT SLARRTQILF RFRELLNARK HEAAEIITSE HGKVLSDALG
EITRGQEIVE LACGLAHLLK GSLTENASTK VDVSSIRQPL GVVGIISPFN FPAMVPMWFF
PIAIAAGNTV VLKPSEKDPS TANWLADLWK EAGLPDGVFT VLHGDKVAVD ALLTDSRVKA
ISFVGSTPIA QYVYATGTDN GKRVQALGGA KNHMLVLPDA DLDLAADAAV NAGFGSAGER
CMAISALVVV DAVADELIAK IGERVATLRT GDGRRGCDMG PLVTGAHRDK VAGYVQAGVD
AGAELVIDGR VSNPNQHFDG AADGFWLGPT LFDRVTPDMS IYTDEIFGPV LSVVRVDSYD
QGLDLINSNP YGNGTAIFTN DGGAARRFQN EVQVGMVGIN VPVPVPVGYY SFGGWKASLF
GDTHAHGTEG FHFYTRGKVI TTRWLDPSHG GLNLGFPQNI