Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2793 |
Symbol | |
ID | 8448406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 3059669 |
End bp | 3061429 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645041885 |
Product | malto-oligosyltrehalose trehalohydrolase |
Protein accession | YP_003202127 |
Protein GI | 258652971 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0296] 1,4-alpha-glucan branching enzyme |
TIGRFAM ID | [TIGR02402] malto-oligosyltrehalose trehalohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000000584085 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00000363421 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGTCGCAGA CCATCCGTGT CTGGGCCCCC CGCGCGGAGC GGGTCGCCCT GGTCACCGCC GGCACGGACG CGCCGATGAC CGCGGCCGAG GGCGGCTGGT GGACCATCGC GACGCCGGAC CAGCTGGGCG ACTACGGGTT TCGGCTCGAC GACGACGACA CGGTGCGCCC CGACCCGCGA TCGCGCTGGC AGCCGACGGG CGTCGACGGG CCGACCCGCC CGTTCGACCC GGCCGAGTAC GAGTGGGGCG ACGCGGCGTG GACGGGCCGG CGGTTGGCCG GCAGCGTCGT CTACGAACTG CACCTGGGCA CCTTCACCCC CGAGGGCACC CTGGACGCGG CGATCGGCAA GCTCGACCAC CTGGTCGACC TGGGCGTGGA CATGGTCGAG CTACTGCCGG TCAACGCCTT CGCCGGCACC CACAACTGGG GCTACGACGG GGTGCTCTGG TTCGCTGTGC AGGACAGCTA CGGCGGGCCG CGGGCCTACC AGCGGTTCGT CGACGCCTGC CACCAGCGCG GCATCGGGGT CATCCAGGAC GTCGTCTACA ACCACCTCGG CGCTGGCGGC AACCACATCC CGCTGTTCGG GCCGTATCTC AACCCGACCG CGGGCGGCAG CCCGTGGGGC GACAGCATCA ACCTGGACGG GCCGGACTCG GGCGAGGTCC GCCGCTACAT CCTGGACAAC GTAGTCATGT GGCTGCAGGA CTACCACGTG GACGGGCTGC GGCTGGACGC GGTGCACGCG CTCAACGACA GCCACGCCAC CCACCTGCTC GAGGACATCG CCAAGCGGGT CGACGCGCTG GCCCCGCACG CGCGGCGGCC GCTGTCGCTG ATCGCCGAGT CCGATCTGAA CGACCCGAAG CTGATCACTC CGCGCGAGGC CGGCGGCTAC GGGCTGACCG CGCAGTGGAG CGACGACTTC CACCACGTCC TGCACGTCGC CCTGACCGGC GAGACCGACG GCTACTACGC CGATTTCGGC AAGATGTCGG ACATCGTGAA GGTGCTGAGC CGGGCCTTCT TCCACGACGG TACGTTCTCC AGCTTCCGCG GCCGCGATCA CGGCCGGCCG GTGGACACCC TGACCACGCC GGCCTGGCGG TTCCTGGGGT ACGCGCAGAA CCACGACCAG GTCGGCAACC GGGCCGTCGG CGACCGGCTC ACCGCCCAGC TGTCCCCGGA CGACCTGGCC ATCGCCGCGG TGCTGGTGCT GACCAGCCCG TTCACCCCGA TGCTGTTCAT GGGCGAGGAG TGGGCGGCCG GCACGCCGTG GCAGTTCTTC ACCTCGCACA CCGACCAGTT CTTGGCCGAC GCCACCCGGG AGGGCCGGCT GGAGGAGTTC GCCCGGATGG GCTGGGACAA GGACCTGGTC CCCGACCCGC AGGCCGAGTC GACCTTCCTG GACTCCAAGC TCGACTGGTC CGAACTCGGC CGGGAACCGC ACGCCCGGTT ACTCGCCCTG CACCGGGACC TGATCGCGCT GCGCCGGGCG CGGCCGGAAC TGACCGACCC CTGGTTCGGT GACCTGACCG CGACCGGGGA CGACGAGGCC CGCTGGCTGC TGGTCGACCG GTCCGGCGTA CGCATCGCCG CCAACCTGTC CGACCAGGAG CGCCGGATCC CGCTGGGCGG CCCGGCCGGT GCGCTGCTGC TGGCCACGCG GGACGGGGTG CGGGTGGACC GGGCGTCCGA GCCGGGGGCC ACCCTGACGC TGCCCCCGCA TTCGGCCGCG GTGCTGGCTC CGGCAAGCTG A
|
Protein sequence | MSQTIRVWAP RAERVALVTA GTDAPMTAAE GGWWTIATPD QLGDYGFRLD DDDTVRPDPR SRWQPTGVDG PTRPFDPAEY EWGDAAWTGR RLAGSVVYEL HLGTFTPEGT LDAAIGKLDH LVDLGVDMVE LLPVNAFAGT HNWGYDGVLW FAVQDSYGGP RAYQRFVDAC HQRGIGVIQD VVYNHLGAGG NHIPLFGPYL NPTAGGSPWG DSINLDGPDS GEVRRYILDN VVMWLQDYHV DGLRLDAVHA LNDSHATHLL EDIAKRVDAL APHARRPLSL IAESDLNDPK LITPREAGGY GLTAQWSDDF HHVLHVALTG ETDGYYADFG KMSDIVKVLS RAFFHDGTFS SFRGRDHGRP VDTLTTPAWR FLGYAQNHDQ VGNRAVGDRL TAQLSPDDLA IAAVLVLTSP FTPMLFMGEE WAAGTPWQFF TSHTDQFLAD ATREGRLEEF ARMGWDKDLV PDPQAESTFL DSKLDWSELG REPHARLLAL HRDLIALRRA RPELTDPWFG DLTATGDDEA RWLLVDRSGV RIAANLSDQE RRIPLGGPAG ALLLATRDGV RVDRASEPGA TLTLPPHSAA VLAPAS
|
| |