Gene Namu_0188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0188 
Symbol 
ID8445768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp211807 
End bp213225 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content71% 
IMG OID645039335 
Productbeta-galactosidase 
Protein accessionYP_003199610 
Protein GI258650454 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones71 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGTA CGCCGCGCAC ATTCCCCGAG GACTTCTTGT GGGGGTCGGC GACCGCGTCG 
TACCAGATCG AGGGAGCGGT CACCGAGGAC GGCCGCGGGC CGTCGATCTG GGACACCTTC
AGCCACACCC CCGGCAAGAC GATGAACGGC GACACCGGTG ACGTCGCCGA CGATCACTAC
CACCGGTGGT CCGCCGACCT GGACCTGATC AAGGGGCTGG GCCTGCAGGC CTACCGGTTC
TCGCTGGCCT GGCCGCGGAT CCAGCCGACC GGGTCCGGGG CGGTCAACGC CAAGGGCGTC
GACTTCTACT CGCGGCTGGT CGACGGTTTG CTCGAGCGCG GGGTCAAGCC CGTCGTCACG
CTGTACCACT GGGACCTGCC GCAGGCCCTG GAGGACGAGG GCGGCTGGAC GAACCGGGAC
ACCGCGTTGC GGTTCGCCGA CTACGCCGCG CATGTCGCCG GAGCGCTCGG GGACCGGGTG
GAGATGTGGA CGACCCTGAA CGAGCCATGG TGCTCGGCGT TCCTGGGCTA TGCCTCGGGC
GTGCACGCGC CGGGCCGCAC CGATGGGGAG GCGGCGCTGC GGGCGGCGCA CCACCTGAAC
CTGGGCCACG GGCTGGCCGG CCGGGCGGTG CGCGAGGTAC TCGGGGCCGA CACCAAGTTG
TCGGTGACGC TGAACCTGCA CGTGACCCGG CCGGTCGACC CGGACTCGGC CGCCGACCGG
GACGCGATCC GGCAGCTGGA CGCGGTCGGC AACCGGGTCT TCCTGGGTCC GATGCTGGAC
GGCGCCTACC CGGCCGACCT GCTGGCCGAC ACCGCGTCGG TCACCGACTG GTCGTTCGTG
CGGGACGGCG ACGAGGCCGC CTGCGCGGTC CCGATCGACG TGCTGGGCAT CAACTACTAC
TCGACCTCCC GGGCTCGCCG GCACACCGGC GACGGGCCGA TGGAGCACGC CGACGGGCAC
GGGGACACCG GCTTCAGCCC GTGGGTGGGG GCGGACGACA TCGAGTTCCT GCGCCAGCCC
GGGCCGTACA CCGCGATGGG CTGGAACATC GACCCGTCCG GCATGCTCGA GCTGCTCACC
GACATCAGCA CCCGCTACCC GAGCGTGCCG CTGATGGTCA CCGAGAACGG CGCGGCCTTC
TACGACACGG TGAGCGAGGA CGGCCACGTG CACGACGCCG ACCGGGTCGC CTACCTACAC
GGGCACATCG ACGCGGTCGG CCAGGCCATC GACGCCGGGG CCGACGTGCG CGGCTACTTC
CTGTGGTCGC TGCTGGACAA CTTCGAATGG GCCTGGGGCT ACGACCGCCG CTTCGGGATC
ATCCGCGTCG ACTACGACAC CCAGGAGCGC ACCGTCAAGG ACTCGGCCAC GTGGTACTCC
CGGCTGATCG CCACCCGCGA GCTGCCGCCG GTCGACTGA
 
Protein sequence
MTSTPRTFPE DFLWGSATAS YQIEGAVTED GRGPSIWDTF SHTPGKTMNG DTGDVADDHY 
HRWSADLDLI KGLGLQAYRF SLAWPRIQPT GSGAVNAKGV DFYSRLVDGL LERGVKPVVT
LYHWDLPQAL EDEGGWTNRD TALRFADYAA HVAGALGDRV EMWTTLNEPW CSAFLGYASG
VHAPGRTDGE AALRAAHHLN LGHGLAGRAV REVLGADTKL SVTLNLHVTR PVDPDSAADR
DAIRQLDAVG NRVFLGPMLD GAYPADLLAD TASVTDWSFV RDGDEAACAV PIDVLGINYY
STSRARRHTG DGPMEHADGH GDTGFSPWVG ADDIEFLRQP GPYTAMGWNI DPSGMLELLT
DISTRYPSVP LMVTENGAAF YDTVSEDGHV HDADRVAYLH GHIDAVGQAI DAGADVRGYF
LWSLLDNFEW AWGYDRRFGI IRVDYDTQER TVKDSATWYS RLIATRELPP VD