Gene Namu_3901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3901 
Symbol 
ID8449520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4302880 
End bp4304319 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content68% 
IMG OID645042947 
Productglycoside hydrolase family 1 
Protein accessionYP_003203183 
Protein GI258654027 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.00796853 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0223593 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGAT TTCCCGAGGG GTTCCTGTGG GGCGGCGCGG TGGCGGCGCA CCAGTTCGAG 
GGCGGCTGGG ACGCCGGCGG CAAGGGACCG AACGTCGTCG ACGTGCTGAC CGCCGGCGCG
CACGGCGTGC CCCGCCGGCT GACCGACTCC GTCGAGCCGG GCACGTTCTA CCCCAACCAC
GAGGCGATCG ACTTCTACCA CCGGTTCCGT TCGGACATCG CGCTGTTCGC CGAGCTCGGA
CTGCGCTGCT TTCGCACGTC CATCTCCTGG GCCCGGATCT TCCCCCGCGG GGACGAGACC
GAGCCCAACG AGGAGGGTCT GGCCTTCTAC GACGCCGTGT TCGACGAGCT GATCGCGCAC
GGCATCGCCC CGGTCATCAC CCTGTCGCAC TTCGAGTTGC CGCTGCACCT GGCCCGCGAG
TACGGGGGCT TTCGCAACCG CGCCCTGGTC GAGCTGTTCG CCCGGTTCGC CGAGGTGTGC
TTTCGCCGGT ACCGGCACAA GGTCCGGTAC TGGATGACCT TCAACGAGAT CAACAACCAG
ATGGACACCG ACAACTGGCT GTTCCTGTGG ACCAACTCCG GAGTGCTGGT CGGACCGGAG
GAGAACGCCC GCGAGGTGAT GTTCCAGACC GCCCACCACG AGCTGCTGGC CAGCGCCAGG
GCGGTCGCCA TCGGGCATGC GATCGACCCC GACCTGCAGA TCGGGGCGAT GGTCTCGCAC
GTGCCGATCT ACCCGTTCTC CTGCGACCCG CAGGACGTGA TGGCCGCCCA GATCGCGATG
CGGCAGCGGT TCTTCTTCCC CGACGTGCAG GTGCGCGGCG CCTACCCGGC CTACGCGCTC
AAGGAGTTCG AGCGCGAGGG CTACCGGATC GCGATGGATC CGCAGGACGC GCAGATCCTG
GCCGCCGGCA CGGTCGACTA CCTGGGCTTC AGCTACTACA TGTCCACCGT GGTCAAGGCT
GACGCGGTGA ACGAGAACAC CGGCGAGTCG GTCGATTTCA CCCTGCCCAA CGGGGTGCCC
AACCCGTACC TGACGGCCAG CGACTGGGGC TGGCAGATCG ACCCGGTCGG CCTGCGGTAC
ACGCTGAACA CCCTGTCCGA GCGCTACCAG CTACCGTTGT TCATCGTCGA GAACGGTTTC
GGCGCGGTCG ATGTGGTCGC CGACGACGGC ACCATCGACG ACGCCGAGCG GATCGACTAC
CTGCGCGCGC ACATCGAGGC GATGCGGGAC GCGATCGACC AGGACGGCGT CGACCTGATC
GGCTACACCC CCTGGGGCAT CATCGATCTC GTCTCGTTCA CGACGGGGGA GATGCGCAAG
CGGTACGGGA TGATCCACGT CGACCGGGAC AACGAGGGCC ACGGCACGCT GGCCCGGACC
CGCAAGCGGT CCTTCGGCTG GTACCGGGAC GTCATCGCCG CCAACGGCGC CGCGCTCTAG
 
Protein sequence
MSGFPEGFLW GGAVAAHQFE GGWDAGGKGP NVVDVLTAGA HGVPRRLTDS VEPGTFYPNH 
EAIDFYHRFR SDIALFAELG LRCFRTSISW ARIFPRGDET EPNEEGLAFY DAVFDELIAH
GIAPVITLSH FELPLHLARE YGGFRNRALV ELFARFAEVC FRRYRHKVRY WMTFNEINNQ
MDTDNWLFLW TNSGVLVGPE ENAREVMFQT AHHELLASAR AVAIGHAIDP DLQIGAMVSH
VPIYPFSCDP QDVMAAQIAM RQRFFFPDVQ VRGAYPAYAL KEFEREGYRI AMDPQDAQIL
AAGTVDYLGF SYYMSTVVKA DAVNENTGES VDFTLPNGVP NPYLTASDWG WQIDPVGLRY
TLNTLSERYQ LPLFIVENGF GAVDVVADDG TIDDAERIDY LRAHIEAMRD AIDQDGVDLI
GYTPWGIIDL VSFTTGEMRK RYGMIHVDRD NEGHGTLART RKRSFGWYRD VIAANGAAL