Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_1770 |
Symbol | |
ID | 8447372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 1938889 |
End bp | 1940436 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645040896 |
Product | glycoside hydrolase family 31 |
Protein accession | YP_003201149 |
Protein GI | 258651993 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.134766 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.116734 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTCCG CCGAGGAGCC AGCGGGCCGG ACCATGCCGC TGCTGGATGG CGAGTTGTGG TGGGGCGGCG CGGTCGCCGA CGGGACCGTC ATGCCGTTCG GTGTCGGCTC CCGGCACCAC CGGGACCTGT CGACCAACGC GGGGTTTGTC GGTGATCCCG CCGCGGGAGC GAACCAGTCT GCGCCATTGC TGGTCTCGAG CCGGGGGCGG TACGTGTGGT CGGCTCAGGC GTTCGCGTTC GGCTTCGCCG ACGGTCAACT TGCCGTTTCC GGCACCGATG TCGTGGTGGG CGAGGGTGAT ACCCCAACGC TGGCGGGGGC TTTCCGCGCC GCCCGGGTGA ACTTCCCCGC GCTGGGCCGG GCCCCGGCGG CCCCGCTATT CGCCGGGCCG CAGTACAACA CGTGGATGGA GCTGCCGTAC CGGCCCACCC AGGACGGTGT GCTGGCCTAC GTCCGGGGGC TGCTGGACGC CGGGTTCCCG CCCGGGGTGG TGATGATCGA CGATCGCTGG AGCGTCGACT ACGGAGTCTG GCGCTTCGAT CCGGCCGCGT TCCCGGACCC GTCCGGCATG ATCTCGACCC TGCATGATTG GGGCTGCCCG GTGATGCTGT GGGTGGTGCC CTTCATCAGC CCGGATAGTG CGACGTTCCG GGACCTGGCC GGCCGGGGGC TGCTCATTCG CCGACCGCAC GGTGAGATCG CCGTCCGGCA GTGGTGGAAC GGGTACAGCG CGATGCTCGA CCTGACCACA CCCGACGCGA TCGCCTGGTT CACCGGCGAG CTGGACGAAC TCCGCGAGCG GTATGGGGTG GACGGCTTCA AGTTCGACGC GGGCGACCTG CGCGACTACC GGCTCGACGA CGTGACGGCG AAAAGTGCGA CCCCCACCGA GCTCTGCGAA GCCTGGGCGC GCGTCGGGCT GCGGTACTCG TTCAACGAAT ACCGCGCCGG CTGGAAGATG GGTGGCTCCC CACTCGCGCA ACGTCTGCAC GACAAACCGC CGACCTGGGA CGGCCACGGG CTGGCCTCGC TCATCCCCGA GTCGATCGCC CAAGGTCTGA TCGGCCACCC GTTCGTCTGC CCGGACATGA TCGGCGGCGG CGACCTGGCC GCCGCCGCGG CCGGCGTCGA TCAGGAACTG TTCGTCCGCT ACGCCCAGCT CGCCGCACTG CATCCGATGA TGCAGTTCTC TCTGGCCCCG CACCGGGTGC TGGACGCCGA TCATCTGATG GCGGTGCGAC AGGCCGTCGA CCTGCGCCAA ACGCTGCTGG CCGAGCTGAC CGCAATGGTC CACGACGCCG CCCGCACCGG TGAGCCCATC CTGCGGTCGC TGGCCTACGA CGATCCCGAC GACCCCGGCA CCACCGACCA GTACACCCTC GGCGGCGACA TCCTGGTCGC GCCGGTTCTG GAGCCTGGTG CGACGACCCG GCGGGTCCGA TTTCCCGCCG GGTGCTGGGT GGCCCCGGAC CGAGCCCGAT TCGATGGTCC GGACGTGCGG TCCATCCCCG TCACGCTGAC CTCAGTTCCC TGGTACCGGC GCGCATGA
|
Protein sequence | MTSAEEPAGR TMPLLDGELW WGGAVADGTV MPFGVGSRHH RDLSTNAGFV GDPAAGANQS APLLVSSRGR YVWSAQAFAF GFADGQLAVS GTDVVVGEGD TPTLAGAFRA ARVNFPALGR APAAPLFAGP QYNTWMELPY RPTQDGVLAY VRGLLDAGFP PGVVMIDDRW SVDYGVWRFD PAAFPDPSGM ISTLHDWGCP VMLWVVPFIS PDSATFRDLA GRGLLIRRPH GEIAVRQWWN GYSAMLDLTT PDAIAWFTGE LDELRERYGV DGFKFDAGDL RDYRLDDVTA KSATPTELCE AWARVGLRYS FNEYRAGWKM GGSPLAQRLH DKPPTWDGHG LASLIPESIA QGLIGHPFVC PDMIGGGDLA AAAAGVDQEL FVRYAQLAAL HPMMQFSLAP HRVLDADHLM AVRQAVDLRQ TLLAELTAMV HDAARTGEPI LRSLAYDDPD DPGTTDQYTL GGDILVAPVL EPGATTRRVR FPAGCWVAPD RARFDGPDVR SIPVTLTSVP WYRRA
|
| |