Gene Namu_4758 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4758 
Symbol 
ID8450388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5291397 
End bp5293682 
Gene Length2286 bp 
Protein Length761 aa 
Translation table11 
GC content72% 
IMG OID645043798 
Productglycoside hydrolase family 31 
Protein accessionYP_003204023 
Protein GI258654867 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGGCC ACCGCCTGCC CGATCCGCTG CCCGTCTCCC CGCTGGCCGA CCCGCGCGCG 
GTCGTCGCCG GATCGCGGTA CCGGATCACT GTTCTCACCG ACGGTTTGCT GCGCCTGGAG
TACGCCGAGG ATGGGGTGTT CGAGGACCGG GCGTCGGCGT TCGCGCTGCA CCGGGACCTG
CCGGTGCCGG CGTTCACCGT TCGCGAGACC GACGCCGCGC TGGAGATCGT GACCGAGCGG
CTGCACCTGG TCTACGACCG TGGCCCGTTC ACCACCAGCG GCCTGTCCGT GCAGGTGCGG
GGCAACATCA GCACCTACCA CTCGGTCTGG CGGTACGGCG AGCCGGCCGC CGACCTGGGG
GGCACCGCCC GGACGCTGGA CAACGCCGAC GGACGGGTCC CGCTCGAACC CGGGGTGGCC
TCGCGGTTCG GGTTCGCCCT GCTGGACGAC TCGACCAGCC TGCTGCTGGA ACCGGACGGC
TGGGTCGCGC CCCGACCGCC GGGCCGCACC GACCTGTACC TGTTCGCCTA CGGGCACGAT
TACGCCGCCG CGGTCCGGGC GCTGTATGCG GTCAGCGGGG CTCCGCCGGT GCTGCCGCGG
TGGGCGCTGG GCAACTGGTG GAGCCGCTAT CACCGGTACA CCGCGCAGGC TTACTCCGAG
CTGATCGAGC GGTTCCGCGC GCAGGGCCTG CCGTTCTCGG TCGCCGTCAT CGACATGGAT
TGGCATCTGG TCGATGTCGA TGCGGCACAC GGCTCCGGCT GGACCGGGTA CACCTGGAAC
CGGGAGCTCA TCCCCGAGCC CGAACAGCTG CTGGAATGGT TGCACGCCAA CGGGTTGCGT
ATCACGTTGA ACGTGCACCC GGCCGACGGG GTCCGCGCCT TCGAGGATGC CTACCCGGCG
ATGGCCACGG CGCTGGGTCG CGACGCGCAG GCGGGGGAGC CGATCGCCTT CGACGTCACC
GACCGCGAGT TCCTGGCCGC CTATCTCGAG GTGCTGCACC GCGATCTGGA GCGGCAGGGC
GTCGACTTCT GGTGGCTGGA CTGGCAGTCC GGGCCGCACT CGCGCGTCAT CGGTATCGAC
CCGCTCTGGA TGCTCAACCA CTTCCACTTC CTGGACAGCG TCCGGGCCGG GCCCGGGTTG
ACCTTCTCCC GGTACGCCGG ACCCGGCAGC CACCGGTATC CGGTCGGCTT CTCCGGGGAC
ACGGTGATCA GCTGGGCATC GTTGAACTTC CAGCCCGAGT TCACCGCCAC CGCGGCCAAC
ATCGGCTACG GCTGGTGGAG TCACGACATC GGCGGCCACA TGTTTGGCGC CAAGGACGAC
GAGCTGACCG CCAGATGGGT GCAGTACGGC GTCTTCTCGC CGATCCTGCG GTTGCATTCG
GGAGCGAACC CGTTCATCCA CAAGGAACCG TGGACGCTCG AACCGGACGC CGCGGCGGTG
ATGACGCAGT CCTTGCGGCT GCGGCACCGG CTGGTCCCGT ACCTGCACAC GATGAATCAC
CTGGCCGCGC AGGGCACCCC GCTGGTCCGG CCGATGTACT GGGAGCAGCC GGACCGCGCG
CCCGCCTACC GGGTATCCCA TCAGTTCCGG TTCGGCACCG AGTTGATCGT CGCGCCGATC
ACTACCCCGG CCGATCCGAT CAGCCGGCTG GGGGCGGTCC GCGTGTGGCT GCCACCGGGG
GAATGGGTCG ACATCGCCTG CGGGCGGCGC TATTCGGGCG ACCGGGAGCT GGTCGTGCAC
CGGGCCCTGG CCGACATCCC GGTGTTCGCC GCGCCCGGCG CCATCGTGCC GCTGGACGCC
GCGGCGGTGC CGGACAACGA CCCGGTCAAC CCGACCGAGC TGGAGCTGCT GGTGGTCCCC
GGGGCGGACG GGCGGTACGA GCTGATCGAG GACGACGGTG CCGGCCGGGT GGCCCGCACG
CCGATCCGCT ACGACGCCGC CACCGGGCGG GTGACGATCG GCCCGGCCCA GGGGCAACTG
GACGGGCTGC CGGCGAGCCG GACCTGGACC GTCCGCTCGC CGGGCGGGGA CGGCTCGGAC
CCGGTGACCG CGGCGCCGGG CGAGGCGGTG GTGCTGGACC TGGGCGCGGC CCCGGTCACC
GATCCGGGCC GGGAATTGTT CGACCGGCTC GATCGGGCCC GGCTCGACCA CGAACTCAAG
GTGCAGGCGC TGGCGGCGGT CACCGCGGAT CGACCGGCTG GGGCGCGAAT CGGTCACCTG
CATGCCCTGG CCCTGCCGCG GGCCGTCGAA TCGGCCGTGG TCGAGATCCT GAGCGCGCTC
GACTGA
 
Protein sequence
MPGHRLPDPL PVSPLADPRA VVAGSRYRIT VLTDGLLRLE YAEDGVFEDR ASAFALHRDL 
PVPAFTVRET DAALEIVTER LHLVYDRGPF TTSGLSVQVR GNISTYHSVW RYGEPAADLG
GTARTLDNAD GRVPLEPGVA SRFGFALLDD STSLLLEPDG WVAPRPPGRT DLYLFAYGHD
YAAAVRALYA VSGAPPVLPR WALGNWWSRY HRYTAQAYSE LIERFRAQGL PFSVAVIDMD
WHLVDVDAAH GSGWTGYTWN RELIPEPEQL LEWLHANGLR ITLNVHPADG VRAFEDAYPA
MATALGRDAQ AGEPIAFDVT DREFLAAYLE VLHRDLERQG VDFWWLDWQS GPHSRVIGID
PLWMLNHFHF LDSVRAGPGL TFSRYAGPGS HRYPVGFSGD TVISWASLNF QPEFTATAAN
IGYGWWSHDI GGHMFGAKDD ELTARWVQYG VFSPILRLHS GANPFIHKEP WTLEPDAAAV
MTQSLRLRHR LVPYLHTMNH LAAQGTPLVR PMYWEQPDRA PAYRVSHQFR FGTELIVAPI
TTPADPISRL GAVRVWLPPG EWVDIACGRR YSGDRELVVH RALADIPVFA APGAIVPLDA
AAVPDNDPVN PTELELLVVP GADGRYELIE DDGAGRVART PIRYDAATGR VTIGPAQGQL
DGLPASRTWT VRSPGGDGSD PVTAAPGEAV VLDLGAAPVT DPGRELFDRL DRARLDHELK
VQALAAVTAD RPAGARIGHL HALALPRAVE SAVVEILSAL D