Gene Namu_2793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2793 
Symbol 
ID8448406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3059669 
End bp3061429 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content72% 
IMG OID645041885 
Productmalto-oligosyltrehalose trehalohydrolase 
Protein accessionYP_003202127 
Protein GI258652971 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR02402] malto-oligosyltrehalose trehalohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000584085 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000363421 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCGCAGA CCATCCGTGT CTGGGCCCCC CGCGCGGAGC GGGTCGCCCT GGTCACCGCC 
GGCACGGACG CGCCGATGAC CGCGGCCGAG GGCGGCTGGT GGACCATCGC GACGCCGGAC
CAGCTGGGCG ACTACGGGTT TCGGCTCGAC GACGACGACA CGGTGCGCCC CGACCCGCGA
TCGCGCTGGC AGCCGACGGG CGTCGACGGG CCGACCCGCC CGTTCGACCC GGCCGAGTAC
GAGTGGGGCG ACGCGGCGTG GACGGGCCGG CGGTTGGCCG GCAGCGTCGT CTACGAACTG
CACCTGGGCA CCTTCACCCC CGAGGGCACC CTGGACGCGG CGATCGGCAA GCTCGACCAC
CTGGTCGACC TGGGCGTGGA CATGGTCGAG CTACTGCCGG TCAACGCCTT CGCCGGCACC
CACAACTGGG GCTACGACGG GGTGCTCTGG TTCGCTGTGC AGGACAGCTA CGGCGGGCCG
CGGGCCTACC AGCGGTTCGT CGACGCCTGC CACCAGCGCG GCATCGGGGT CATCCAGGAC
GTCGTCTACA ACCACCTCGG CGCTGGCGGC AACCACATCC CGCTGTTCGG GCCGTATCTC
AACCCGACCG CGGGCGGCAG CCCGTGGGGC GACAGCATCA ACCTGGACGG GCCGGACTCG
GGCGAGGTCC GCCGCTACAT CCTGGACAAC GTAGTCATGT GGCTGCAGGA CTACCACGTG
GACGGGCTGC GGCTGGACGC GGTGCACGCG CTCAACGACA GCCACGCCAC CCACCTGCTC
GAGGACATCG CCAAGCGGGT CGACGCGCTG GCCCCGCACG CGCGGCGGCC GCTGTCGCTG
ATCGCCGAGT CCGATCTGAA CGACCCGAAG CTGATCACTC CGCGCGAGGC CGGCGGCTAC
GGGCTGACCG CGCAGTGGAG CGACGACTTC CACCACGTCC TGCACGTCGC CCTGACCGGC
GAGACCGACG GCTACTACGC CGATTTCGGC AAGATGTCGG ACATCGTGAA GGTGCTGAGC
CGGGCCTTCT TCCACGACGG TACGTTCTCC AGCTTCCGCG GCCGCGATCA CGGCCGGCCG
GTGGACACCC TGACCACGCC GGCCTGGCGG TTCCTGGGGT ACGCGCAGAA CCACGACCAG
GTCGGCAACC GGGCCGTCGG CGACCGGCTC ACCGCCCAGC TGTCCCCGGA CGACCTGGCC
ATCGCCGCGG TGCTGGTGCT GACCAGCCCG TTCACCCCGA TGCTGTTCAT GGGCGAGGAG
TGGGCGGCCG GCACGCCGTG GCAGTTCTTC ACCTCGCACA CCGACCAGTT CTTGGCCGAC
GCCACCCGGG AGGGCCGGCT GGAGGAGTTC GCCCGGATGG GCTGGGACAA GGACCTGGTC
CCCGACCCGC AGGCCGAGTC GACCTTCCTG GACTCCAAGC TCGACTGGTC CGAACTCGGC
CGGGAACCGC ACGCCCGGTT ACTCGCCCTG CACCGGGACC TGATCGCGCT GCGCCGGGCG
CGGCCGGAAC TGACCGACCC CTGGTTCGGT GACCTGACCG CGACCGGGGA CGACGAGGCC
CGCTGGCTGC TGGTCGACCG GTCCGGCGTA CGCATCGCCG CCAACCTGTC CGACCAGGAG
CGCCGGATCC CGCTGGGCGG CCCGGCCGGT GCGCTGCTGC TGGCCACGCG GGACGGGGTG
CGGGTGGACC GGGCGTCCGA GCCGGGGGCC ACCCTGACGC TGCCCCCGCA TTCGGCCGCG
GTGCTGGCTC CGGCAAGCTG A
 
Protein sequence
MSQTIRVWAP RAERVALVTA GTDAPMTAAE GGWWTIATPD QLGDYGFRLD DDDTVRPDPR 
SRWQPTGVDG PTRPFDPAEY EWGDAAWTGR RLAGSVVYEL HLGTFTPEGT LDAAIGKLDH
LVDLGVDMVE LLPVNAFAGT HNWGYDGVLW FAVQDSYGGP RAYQRFVDAC HQRGIGVIQD
VVYNHLGAGG NHIPLFGPYL NPTAGGSPWG DSINLDGPDS GEVRRYILDN VVMWLQDYHV
DGLRLDAVHA LNDSHATHLL EDIAKRVDAL APHARRPLSL IAESDLNDPK LITPREAGGY
GLTAQWSDDF HHVLHVALTG ETDGYYADFG KMSDIVKVLS RAFFHDGTFS SFRGRDHGRP
VDTLTTPAWR FLGYAQNHDQ VGNRAVGDRL TAQLSPDDLA IAAVLVLTSP FTPMLFMGEE
WAAGTPWQFF TSHTDQFLAD ATREGRLEEF ARMGWDKDLV PDPQAESTFL DSKLDWSELG
REPHARLLAL HRDLIALRRA RPELTDPWFG DLTATGDDEA RWLLVDRSGV RIAANLSDQE
RRIPLGGPAG ALLLATRDGV RVDRASEPGA TLTLPPHSAA VLAPAS