Gene Namu_0889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0889 
Symbol 
ID8446481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp979516 
End bp980958 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content67% 
IMG OID645040026 
Productglycoside hydrolase family 1 
Protein accessionYP_003200289 
Protein GI258651133 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGCA TTTCCGCACC AACGCAGTTC CCGCCCGGCT TCCTGTGGGG CGGCGCCACC 
GCCGCCAACC AGGTGGAGGG CGGCTACGAC CAGGGCGGCA AGGGCCTGTC CATCCAGGAC
GTGATGCCCC AGGGCATCGT CGGTCCGCGC ACCGACGGCC CCACCCCGGA GAACCTCAAG
CTGACCGGCA TCGACTTCTA CCACCGCTAT GCCGAGGACA TCGCCCTGTT CGCCGAGATG
GGCTTCTCGG TCTACCGGTT CTCCATCGCC TGGAGCCGGA TCTTCCCGGG CGGTGACGAC
GCCGAGCCCA ACGAGGAGGG CCTGGCCTTC TACGACCGGG TGCTCGACGA GCTGGAGCGG
CACGGCATCG AGCCGCTGAT CACGCTGTCG CACTACGAGA CCCCGCTGGC CATCGCCGAG
AAGTACGACG GCTGGGTCTC CCGGGACGTC ATCGCCCTGT TCGAGCGGTA CGTGCGGGTC
GTGTTCGCCC GCTACGGCCA CCGGGTGAAG TACTGGCTGA CCTTCAACGA GATCAACTCG
GTGATCCACG CGCCGTTCAT GAGCGGCGGC ATCAACACCC CCAAGGACGA GCTGACCCCG
ACCGACCTGT ACCAGGCGAT CCACCATGAG CTGGTGGCCA GCGCGCTGGC CACCAAGATC
GCGCACGAGA CCGACCCGCA GATCAAGGTC GGCTGCATGG TGCTGGGCAT GCCGATCTAC
CCGCTGTCCC CCGACCCGAA CGATCTGCTG GCGTCGATGA CCGCCGATCA CGCCAACCTG
ATGTTCAGCG ACGTGCACAC CCGCGGCGAG TACCCCGGAT ACGCCCTGCG GTACTTCCGG
GAGAACGGCA TCGAGCTGCA GATCACCGAA CAGGACCGGG AACTGCTGGC CGCGCACACC
GTCGACTTCG TCTCGTTCAG CTACTACATG AGCATCTGCG AGACCGGGGA TCCGGCCAGG
CGCCTGGCCG GTGCGGGCAA CATCATGGGC GGGGTGCCGA ACCCGCACCT GCCGGCCTCC
GAATGGGGCT GGCAGATCGA CCCGGTCGGG CTTCGGGTCA TCCTCAACCA GTTCTGGGAC
CGCTGGGGCA AGCCGCTGTT CATCGTCGAG AACGGGCTCG GCGCCCGGGA CGAGCTGGTC
GAGTCGGCTG ACGCCGTCGA CGGTTTCACC GTCCTGGACG ACTACCGGAT CGACTACCTG
CGCGCGCATC TGCAGCAGGT CGGCGAGGCC ATCGCCGACG GCGTCCAGGT GCTGGGCTAC
ACCAGTTGGG GCCCGATCGA TCTGGTCAGC GCCTCGACCG CGCAGATGTC CAAGCGGTAC
GGGTTCATCT ACGTCGACCG CAACGACGAC GGCACCGGCA CGCTGGCCCG CTACCGCAAG
AAGTCGTTCC ACTGGTACGC GCAGGTCATC GCGACCAACG GCGCCACCCT CCGGCAGAAC
TGA
 
Protein sequence
MTSISAPTQF PPGFLWGGAT AANQVEGGYD QGGKGLSIQD VMPQGIVGPR TDGPTPENLK 
LTGIDFYHRY AEDIALFAEM GFSVYRFSIA WSRIFPGGDD AEPNEEGLAF YDRVLDELER
HGIEPLITLS HYETPLAIAE KYDGWVSRDV IALFERYVRV VFARYGHRVK YWLTFNEINS
VIHAPFMSGG INTPKDELTP TDLYQAIHHE LVASALATKI AHETDPQIKV GCMVLGMPIY
PLSPDPNDLL ASMTADHANL MFSDVHTRGE YPGYALRYFR ENGIELQITE QDRELLAAHT
VDFVSFSYYM SICETGDPAR RLAGAGNIMG GVPNPHLPAS EWGWQIDPVG LRVILNQFWD
RWGKPLFIVE NGLGARDELV ESADAVDGFT VLDDYRIDYL RAHLQQVGEA IADGVQVLGY
TSWGPIDLVS ASTAQMSKRY GFIYVDRNDD GTGTLARYRK KSFHWYAQVI ATNGATLRQN