Gene Namu_0529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0529 
Symbol 
ID8446112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp588439 
End bp589644 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content76% 
IMG OID645039664 
Productfumarylacetoacetase 
Protein accessionYP_003199936 
Protein GI258650780 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID[TIGR01266] fumarylacetoacetase 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTTCCT GGGTGCCCGG TGCCGCCGGG TCGGGGTTCG ACGACGACCA CCTGCCGTAC 
GGGGTGTTCG ACGCGGGTGC CGGCCGCCGG GTGGGCGTGC GGATCGGCTC CTCCGTGCTC
GACCTGGCCG CGGTGGCCGA CACCCCTGAG CTGGCCGGCG TTCTGGCGGC CGGTTCGCTG
GATCCGCTGC TGGCCGCCGG CCCGGCGACC TGGGCGGCCG CCCGCTCCCT GGCGCACCGG
GCGGTGACCG ATCCGGACTG CCGCACCCTC GTCGAGTCCC ACCTGCACCC GCTGGAGTCG
GTGCGCCTGT TGCTGCCATT CACGGTCGCC GACTACGTCG ACTTCTACGC CAGCCAGTGG
CACGCGACCG CGGTCGGCCG GATGTTCCGG CCGGACGCGG ACCCGTTGCC GCCCAACTGG
AAACATCTGC CGATCGGCTA CCACGGCCGG GCCGGCAGCG TGGTCGTGTC CGGCACCCCG
GTCAGCCGGC CCCGCGGGCA GACCCGGTTG CCCGGCGCGG CGCCGACGTT CGGTCCGACG
CAGCGGCTGG ACCTGGAGGC GGAGGTCGCG TTCGTCGTCG GGGTCGGCTC GCCGCTGGGC
TCCCCGGTGC CGGCCGGCGC GTTCGCCCGG CACGTGTTCG GCGTCGGCCT GCTCAACGAC
TGGAGCGCCC GCGACATCCA GGCCTGGGAG TACCGCCCGC TCGGGCCGAT GCTCGGCAAA
TCCTTTGCCA CTTCGGTCGG CCCCTGGATC ACCCCGCTCG CCGCGCTGGC CGCCGCCCGG
GTCGCCCCGC CGCCGCGCAC CCACCGGCTG CTGCCCTACC TGGCCGACGA TGCCGGGCTG
CCTTGGGGCC TGGATCTGGC CCTGACCGTC GAGGTGAACG GGACCGTGGT CAGCCGGCCG
CCTTTCGCCG CCATGTACTG GACCGGGCCC CAGTTGATCG CGCACCTGAC CAGCAACGGC
GCGCGCCTGC GCACCGGGGA TCTGCTGGCG TCCGGCACCG TGTCCGGGCC CGCCGCCGAC
CAGGCCGGTT CGCTGCTGGA GCTCTCGGCC AACGGGACCC GGCCGGTGCC GCTGGGCGAC
GGCACGTCGC GGACCTTCCT GGCCGACGGC GACGTCGTCA CGATCACGGC GACCGCCCCG
TCGACCGGTG GCGGCCGGTT GACCCTGGGC GAGGTGACCG GGGCTGTGCG GCCGGCCGCG
GGCTGA
 
Protein sequence
MASWVPGAAG SGFDDDHLPY GVFDAGAGRR VGVRIGSSVL DLAAVADTPE LAGVLAAGSL 
DPLLAAGPAT WAAARSLAHR AVTDPDCRTL VESHLHPLES VRLLLPFTVA DYVDFYASQW
HATAVGRMFR PDADPLPPNW KHLPIGYHGR AGSVVVSGTP VSRPRGQTRL PGAAPTFGPT
QRLDLEAEVA FVVGVGSPLG SPVPAGAFAR HVFGVGLLND WSARDIQAWE YRPLGPMLGK
SFATSVGPWI TPLAALAAAR VAPPPRTHRL LPYLADDAGL PWGLDLALTV EVNGTVVSRP
PFAAMYWTGP QLIAHLTSNG ARLRTGDLLA SGTVSGPAAD QAGSLLELSA NGTRPVPLGD
GTSRTFLADG DVVTITATAP STGGGRLTLG EVTGAVRPAA G