Gene Namu_4033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4033 
Symbol 
ID8449652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4447195 
End bp4448397 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content76% 
IMG OID645043078 
Productbenzoate transporter 
Protein accessionYP_003203314 
Protein GI258654158 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3135] Uncharacterized protein involved in benzoate metabolism 
TIGRFAM ID[TIGR00843] benzoate transporter 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.485429 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0404711 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACGTT CGTGGACGCA GCCCGTCGGC GTCGGCGTGG TAACGGCGCT GGTCGGGTTC 
ACCAGCTCGT TCGCGGTGGT GCTGGCCGGC CTGCAGGCGG TGGGGGCCGA CACCGCCCAG
GCCGCCTCCG GGCTGGCCGT GCTCTGCGTG CTGCAGGGCG CCGGCACGAT CTGGCTGAGC
CAGCGGCACC GCACCCCGCT GACCCTGGCC TGGTCGACGC CCGGCGCGGC GCTGCTGGTC
GCCGCGGCCG GCCTGCAGAT CGGGTGGTCG GCGGCCGTCG GGGCGTTCGT CGTGACCGGG
GCATTGCTGG CCATCACCGG GCTGTGGCCG TGGTTGGGCC GGACGGTCGC CCGGATCCCG
GCCCCGCTGG CCCAGGCGAT GCTGGCCGGC GTGCTGCTGA CCCTGTGCCT GCAGCCGATC
ACCGCGCTGA CGGTCAGCCC GTTGCTGGTC GCCCCCGTGA TCGTCGTCTG GCTGGGGTTG
CAGCGGCTGG CCCCGCGCTG GTCGACCCCG GCCGCGTTCC TGCTCGCGCT GGCCCTGATC
GTGGGTGATG CCGTGCTGTC CGGTCAGGGC GTCACACTGC TGGCGCCGGT CGTGAGCCTC
ACCGTCCCCA CCGTCACCTG GACCGCGGTC GTCGGCATCG CGATACCGCT GTACGTGGTG
ACCATGGCCT CGCAGAACGT GCCCGGGGTG GCCGTGATGA GCGCCGCCGG GTACGCGGTG
CCCTGGCGGG AGTCCTTGCT GCTGACCGGC CTGGGCACGA TGGCCGGTGC CGGCGCCGGG
GCCCACGCGG TCAACCTGGC CGCGATCAGC GCGGCGCTGC CGGCGTCCGC CGAGGCCCAC
CCGGACCCCC GCCGCCGGTG GATCGCCTCG ACCACCGCCG GCGTGACCTA CCTGTTGCTG
GCCCCGCTGG CGGCCACGTT GACCGCCCTG GTGGCCGGGG CTCCGCCCGG CGTCATCGAG
TCGGTGGCCG GGCTGGCCCT GCTCGGCACC CTGGCCGCCT GCCTGGCCGC CGCGACCGCC
GATCCGGGCG AGCGGCTGCC GGCGGTGGCG GCGTTCCTGG TCGCGGCCAG CGGGGTGAGC
GCGCTGGGCA TCGGTGCGGC GTTCTGGGCG CTGCTGGCCG GGTTGGCGGT GCGGACCGTG
CTGCGGCCCC GCGATCCTCG AGCCGCCGAG AAGCCTCGAT CGGGCCGGCA CGCCCGCGTC
TAA
 
Protein sequence
MERSWTQPVG VGVVTALVGF TSSFAVVLAG LQAVGADTAQ AASGLAVLCV LQGAGTIWLS 
QRHRTPLTLA WSTPGAALLV AAAGLQIGWS AAVGAFVVTG ALLAITGLWP WLGRTVARIP
APLAQAMLAG VLLTLCLQPI TALTVSPLLV APVIVVWLGL QRLAPRWSTP AAFLLALALI
VGDAVLSGQG VTLLAPVVSL TVPTVTWTAV VGIAIPLYVV TMASQNVPGV AVMSAAGYAV
PWRESLLLTG LGTMAGAGAG AHAVNLAAIS AALPASAEAH PDPRRRWIAS TTAGVTYLLL
APLAATLTAL VAGAPPGVIE SVAGLALLGT LAACLAAATA DPGERLPAVA AFLVAASGVS
ALGIGAAFWA LLAGLAVRTV LRPRDPRAAE KPRSGRHARV