Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2035 |
Symbol | |
ID | 8447644 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 2244639 |
End bp | 2246111 |
Gene Length | 1473 bp |
Protein Length | 490 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 645041161 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003201407 |
Protein GI | 258652251 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.0658634 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00997148 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGATCCC GACGCGGCTG CGGTCTGCCC GGTTCGAACC GCGCCCGGCT GGTCGGCGCG CTGGCCGCCG GTGGCCTGGT GCTGGCCGCG TGCTCGTCCG CACCCGACCC GTCCGCCGGT TCCGCCTCCG GCGCGGCGGT GTCCAGCAGC CCGGTGTCCA GCCCCGTGTC CAGCCCGGCG TTGAGTGCTG CGTTGAGTTC GGCGGCGCCG ACCCCATCGA GCACCGAACC GACACCGTCG GCCACCGCAC CGGCCAGTCC GAGTGCCCCG CCGCCGGAGG CCGCCGCGCC GGCGCCGGCC GTCCCGGCCG GTGGGTTGGT GACCGGCGCC GACATGACGG CCGCGTCCGC CGCGGTGGCC GCGATGAGCA CCGCCGACCG CGCCGGCCTG GTCGTCATGG CCAGCTCGGC CGACGCCGTG GACACCGATC TGGTCGCGCA GCTGCACCTG GGCGGGGTGA TCCTGATGGG GTCGCAGGGG TCGATCGACG GCACCTCGAC CGGTACGCCC GAGCAGGTCG CCGCGGTCAC CGCCCAGCTG CAGAGTCAGG TGCCGGCCGC CCAGGCGGGC GCGCCGCTGC TGATCGCCAC CGACCAGGAA TCCGGCCTGG TCACCCGGCT GGTCAACGGT TTCAACGACT TCCCCGGCAA CCAGGAACTG TCCGGCATCG CCGACACCGC CGCGGCGGCC GCGGCCACTG AGGCGGTCAC CGCGGCCAGC GGGGCCGAGA TGCGCGCGGT CGGCATCAAC GTCGACTTCG CGCCGGACGC CGACGTGCTG CCGCAGTCCG GGGATTCCGG GGTCGACGGC CGGACCTTCG GCGCCGATCC CGACCGGTCG GCGACCCTGG TCGCCGCCGC CGTCCGCGGG TACCAGAGCG GCGGGGTGGC CGCGACGGTC AAGCACTTCC CGGGGATCGG CCGGCTGGCT ACCGACACGC ACAAGGCGCT GCCGTCGCTG GACGTGGACT GCGCGGAGTG GAACGCGGTC GAGGCGGTGC CGATGCAGGC CGGAGTGGAC GCCGGCGCCG CGCTGGTGAT GACCGGACAC ATCGAATTGC CGGCGGTCGG CGCGGTGGGC GAGTCGTCGG CGCTGAGCTC CGCGGTGGTC ACCGACCTGC TCAAGGGCTC GGGGGCGGGC GGCTGCACCG GGCTGAACTT CGCCGGCGTC GCGGTCTCCG ACTCGTTCGA GATGGCGCCG GTGGTGGACA ACTTCTCGCC GAGCGAGGCC GCCTGGCGGG GCATCGCGGC CGGTCAGGAC CTGGTGCTGA TGCCGGTCGA CCCGACGGCC GCGGTGACCG GCATCGCCGC CGCGGCCGAC AGCGGGCAGC TGCCGGCGAC GCGGCTGGCC GAGGCGGCCA CCCGGGTCTA CGCGCTGCGG CTGGCGCTGG GTCGGATCCC GGCCCCCGGC CTGGAGGTGG TCGGCTCGGC CGAGCACGAG GCGGTCGCGG CGAACGCACG GGCCCAGGGC TGA
|
Protein sequence | MRSRRGCGLP GSNRARLVGA LAAGGLVLAA CSSAPDPSAG SASGAAVSSS PVSSPVSSPA LSAALSSAAP TPSSTEPTPS ATAPASPSAP PPEAAAPAPA VPAGGLVTGA DMTAASAAVA AMSTADRAGL VVMASSADAV DTDLVAQLHL GGVILMGSQG SIDGTSTGTP EQVAAVTAQL QSQVPAAQAG APLLIATDQE SGLVTRLVNG FNDFPGNQEL SGIADTAAAA AATEAVTAAS GAEMRAVGIN VDFAPDADVL PQSGDSGVDG RTFGADPDRS ATLVAAAVRG YQSGGVAATV KHFPGIGRLA TDTHKALPSL DVDCAEWNAV EAVPMQAGVD AGAALVMTGH IELPAVGAVG ESSALSSAVV TDLLKGSGAG GCTGLNFAGV AVSDSFEMAP VVDNFSPSEA AWRGIAAGQD LVLMPVDPTA AVTGIAAAAD SGQLPATRLA EAATRVYALR LALGRIPAPG LEVVGSAEHE AVAANARAQG
|
| |