Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_1712 |
Symbol | |
ID | 8447314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 1874680 |
End bp | 1876281 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 645040838 |
Product | glycoside hydrolase family 31 |
Protein accession | YP_003201091 |
Protein GI | 258651935 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.000051017 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0363963 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAGC GCACCGCCCG CCCGCCGGCC CTGTCCCTTG GAGTGCTGTC CCTGGAGCTG CCGGCCGGGG AACGCTGGTG GGGCGGCGCC GTGCCGGACG GCGGCGTCAT GCCGTTCGGC GATCGGCCGC ACCACCGCGA TCTGGCGGTG AACGCGGGCC TGCTCGACGA CCCGACCGGC GGGGCCAACC AGTCCGCTCC CCTGCTGCTG TCCAACCGCG GTCGTTACGT CTGGTCCGAC CGGCCGTTCG CCTTCACCGT GACCGGCGGC CGGCTGCAGG TGCAGGGCCC CGATCTGGTG GTCGGACAGG GCGCGGAGCC GACCCTGGCC GCGGCCTTCC GGGCGGCGTC GGCGGCCCAC TTCCCGCCCG CCGGCCGGGC CCCGGCCCGG GCGATGTTCA CCGGTCCGCA GTACAACACC TGGATCGAGA TGCCGTTCGC GCCCACCCAG CGCAAGGTGC TCGACTATGT CCGCGGCATG CTCGATGCCG GACTGCCGCC CGGTCCGGTG ATGATCGACG ACCTGTGGGC GCAGGACTAC GGCAGCTGGC AGTTCGACCG GAGCGCCTTC CCCGACCCCA CCGCGATGGT TGCCCAGCTG CACGACTGGG GCTGCCCGGT GCTGCTCTGG GTGGTCCCGT TCATCAGCCC GGACAGCCGC GCCTTCCGGG CGGTGCGGCC GCGCGGCCTG CTCATCCGGC GGCGGAGCGG CCGGATCGCC ATCCGCGAGT GGTGGAACGG GTTCAGCGCG ATGCTCGACC TGACCAACCC GGCGGCCGTG GACTGGTTGT GCGGGGAGCT GGACACGCTG CGCGACGAGC ACGGCATCGA CGGGTTCAAG TTCGACGCGG GCGACGTGCG CGACTACCGC AGCGACGACG TCACCCTCGG TGGAGCCGAG CCGGTCGACC TGTGCCAGGC CTGGGCCGAA CTGGGACAGC GCTACGCCTA CAACGAGTTC CGGGCCTGCT GGCGCTCCGG CGGTGCCCCG CTGGCCCAGC GGCTGCACGA CAAGCCGGCC ACCTGGGACG AGCGGGGGCT GGGCTCGCTC ATCCCCGAGT CGATCGCCCA GGGCCTGATC GGCCATCCCT TCAGCTGCCC GGACATGATC GGCGGCGGCG ATCTGTGGGG GGCCGAGGAC GAGGTCGACC AGGAGCTGTT CGTCCGCTAC GCCCAGGTCG CCGCGCTGCA CCCGATGATG CAGTTCTCCC GGGCCCCCCA GCGCGTGCTC GACGCCGAGC ACCTGGCCAT CGTCCGGGCC GCCGTCGCCC TGCGCGAGCG GCTGCTGCCC GACCTGCTGG CCCTGGTCGA CCACGCGGCC CGCACCGGCG AGCCGATCCT GCGGTCACTG GCCTGGCTGG ATCCGGACGA TCCCGGTCCG GCCGATCAGT TCCTGCTCGG TTCCGACCTG CTCGTCGCCC CGGTGCTCGA ACCCGCGTCG CCGCATTCGG TTCCGACCCG CGGTCAGACC CGCAGCATCC GGCTGCCGCC GGGTCGTTGG CAGGCGCAAT GGGACGGCGC CGATCGCACC GTGCTGGCCG GGCCGACCCG GATCACCGTG CCGGTGGACC TGAGCACGCT GCCCTGGTTC CGCCGGGTCT GA
|
Protein sequence | MTERTARPPA LSLGVLSLEL PAGERWWGGA VPDGGVMPFG DRPHHRDLAV NAGLLDDPTG GANQSAPLLL SNRGRYVWSD RPFAFTVTGG RLQVQGPDLV VGQGAEPTLA AAFRAASAAH FPPAGRAPAR AMFTGPQYNT WIEMPFAPTQ RKVLDYVRGM LDAGLPPGPV MIDDLWAQDY GSWQFDRSAF PDPTAMVAQL HDWGCPVLLW VVPFISPDSR AFRAVRPRGL LIRRRSGRIA IREWWNGFSA MLDLTNPAAV DWLCGELDTL RDEHGIDGFK FDAGDVRDYR SDDVTLGGAE PVDLCQAWAE LGQRYAYNEF RACWRSGGAP LAQRLHDKPA TWDERGLGSL IPESIAQGLI GHPFSCPDMI GGGDLWGAED EVDQELFVRY AQVAALHPMM QFSRAPQRVL DAEHLAIVRA AVALRERLLP DLLALVDHAA RTGEPILRSL AWLDPDDPGP ADQFLLGSDL LVAPVLEPAS PHSVPTRGQT RSIRLPPGRW QAQWDGADRT VLAGPTRITV PVDLSTLPWF RRV
|
| |