Gene Namu_0575 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0575 
Symbol 
ID8446159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp635809 
End bp637149 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content76% 
IMG OID645039708 
Productglycoside hydrolase family 1 
Protein accessionYP_003199979 
Protein GI258650823 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCGGC TGTTCGGCGT CGCGTCGGCG GCCATGCAGA TCGAGGGAGC GGCCGGGACC 
GCCGGCCGGA CGGCGTCGAT CTGGGACGTC TTCGCCACCC TGCCCGGCCG GATCGAGGAC
GGCAGCAGCC CGGCCCGGGC GGCCGACCAC CACGCCCGGT GGTCGACGGA CCTGGATCTG
CTGGCCGCGC TGGGGGTGGA CGCCTACCGG CTGACGCTGT CCTGGTCGCG GGTGCAGCCG
GGCGGGACCG GACCGGTCAA CCCGGCCGGG TTGGACTTCT ACGACGCCCT GGTCGACGGC
CTGCTGGCGC GGGGCATCGA GCCGTGGGTG TCGTTGTACC GGTGGGACCT GCCGCTGGAG
CTGATGCTGG AGGGCGGCTG GCTGCGGCGG GACACCGCCG AGCGGTTCGG CGACTACGCC
GCCGCGGTGG CCGCCCGGCT GGGCGATCGG GTCGGCGCCT GGGCGAGCCT GGAGGATCCG
TTCCCGCACC TGGCCCTGGG GCACGCGGTC GGCGTGGACG CGCCTGGATT GACCCTGCTC
GGCGGGGCCC TTCCGGTCAC CCACCACCTG CTGCTCGGGC ACGCCCGGGC CACCGCCGCG
CTGCGGGCGG CCGGCGCGGG GCGGGTCGGC CTGATCAACC ACCACACGAC CGTGCGGCCG
GCCGGCCCGT CCGGCCGGGA CCGGTGGGCG GCAGCGTTCT ACGACGAGTA CCACAACCGC
CAGTTCGCCG ACCCGGTGCT GCTCGGCCGC TACCCGGACC GGTTGCTGGC CCTGCCCGGG
GTCCCCGACG GGGTGATCGC CGACGGCGAC CTGGCCGCGA TCGCCGCCCC GCTGGACTTC
TACGGGGTGA GCTACGAGCA TCCGGTGGTG GTCGCCGCGG TGCCCGAGAA CCGGTCCGTG
CCGCTGACCC TGGTCCCGCT GGAAGGTGTG CCGCGCACCG CCGGCGACCT GCCGATCGAC
CCGCCGGCCC TGGAGCAGGT GCTGGTCGAC CTGGCCCGCC GGTACCCGCA CTTGCCGCCG
GTGGTGGTCA CCACCGGCGG CGCCTTCGAC GACCGTCCGG ACGGCGACGA CCGTCCGCGG
ATCGCGTTCC TGGACGAGCA TCTGGCCGCC GTCGACCGGG CCGCGGGCCG CGGCGTGCCG
GTCGCCGGCT ACTTCCACTG GTCGCTGCTG GACGGCTGGG CCGGCACCCC GGGCCACGTC
CGGACCGGCC TGGTCCGGGT CGATCCGGAC ACCCTGGAGC GCACCCCGCG GGCCGCATTC
GCCCACTACC GCGACCTGAT CGCGCATCGG TCGGGTGACA CATCGGTCAC GGCGCCGGCC
GATCGGCCGG CCAGGTCCTA G
 
Protein sequence
MTRLFGVASA AMQIEGAAGT AGRTASIWDV FATLPGRIED GSSPARAADH HARWSTDLDL 
LAALGVDAYR LTLSWSRVQP GGTGPVNPAG LDFYDALVDG LLARGIEPWV SLYRWDLPLE
LMLEGGWLRR DTAERFGDYA AAVAARLGDR VGAWASLEDP FPHLALGHAV GVDAPGLTLL
GGALPVTHHL LLGHARATAA LRAAGAGRVG LINHHTTVRP AGPSGRDRWA AAFYDEYHNR
QFADPVLLGR YPDRLLALPG VPDGVIADGD LAAIAAPLDF YGVSYEHPVV VAAVPENRSV
PLTLVPLEGV PRTAGDLPID PPALEQVLVD LARRYPHLPP VVVTTGGAFD DRPDGDDRPR
IAFLDEHLAA VDRAAGRGVP VAGYFHWSLL DGWAGTPGHV RTGLVRVDPD TLERTPRAAF
AHYRDLIAHR SGDTSVTAPA DRPARS