Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2109 |
Symbol | |
ID | 8447720 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 2326876 |
End bp | 2327874 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645041232 |
Product | periplasmic binding protein/LacI transcriptional regulator |
Protein accession | YP_003201476 |
Protein GI | 258652320 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.0292423 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0717969 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGCCA CCCGTGTCGC TGTCGCGACC CTGTCCCTGA TCGCCGCCGC CGCCCTGGTT GCCGGTTGCT CCTCCGGCTC GTCCAGCGCG TCGGGCAGCA GCAGTGGTGC CGCAGAGGGC AAGAAGGTCT ACGCCCTGCT GCCGCAGGGC ACCGACCAGC CCTACGGCAC CGAGTACCTC AAGGCGATGC AGGCCGAGGC CGACAAGGAC GGCATCGACC TGACCATCAC CAACTCGCAG TACGACGCCG ACAAGCAGGC CAGCGACTGC CAGGTCGCGG TGGCGGCCAA ACCGAATCTG ATCATCCTGT GGCCCGCGGT GGCCGATGCG GTCCGGCCCT GCCTGGAGCG GGCCAAGGCG GCCGGAATCC CGGTGACGGT CACCAACTCC GACGTCGAGG CCGACGACAA GTCCCTGGTC GTCGCCTATT CCGGCCCCGA CACGATCGGT CAGGGAGCCG CGTCGGCCGA GATCATGTGC GATCTGGCCA AGGGGCAGGC CCTGAACATC CTGGAGATCG ACGGGCTCAC CGGCAACACC ACCGCCATCA ACCGAGCCAA GGGCTTCGCC GACACCATCG CCAGCACGTG CCCGAACGTC AAGGTGCTGG CCGCCCAACC CGGCGACTGG AACAAGGACG ATGCGCAGAC CGTGACCTCG GAAATGCTGA CCTCGGTCGG CGCGGCCAAC GTCCAGGGCA TCTACGCCGC GGACGACACC ATGGTGGCCG GCGCGATCGA CGCGCTCAAG GCGCAGAACA TCGACCCGAA GTCGTTGATC ATCACCTCCA TCGGCAACAC CAAACTGGGT AATCCGCTGG TGATCTCGGG TGAGCTGGAC GGCACCGTCT TCCAGTCCTC CTCGTGGGAC GGGCAGAACG CGATCGTGGT CGCCAACAAG GTGCTCTCGG GGGAGCAGGT CTCCGGCGAT CTGTTCATGC CCTCGGTCAA GGTGACCTCG GCCAACGCGA CGGACCCCTC CGTCACCCCG GAGTGGTAA
|
Protein sequence | MRATRVAVAT LSLIAAAALV AGCSSGSSSA SGSSSGAAEG KKVYALLPQG TDQPYGTEYL KAMQAEADKD GIDLTITNSQ YDADKQASDC QVAVAAKPNL IILWPAVADA VRPCLERAKA AGIPVTVTNS DVEADDKSLV VAYSGPDTIG QGAASAEIMC DLAKGQALNI LEIDGLTGNT TAINRAKGFA DTIASTCPNV KVLAAQPGDW NKDDAQTVTS EMLTSVGAAN VQGIYAADDT MVAGAIDALK AQNIDPKSLI ITSIGNTKLG NPLVISGELD GTVFQSSSWD GQNAIVVANK VLSGEQVSGD LFMPSVKVTS ANATDPSVTP EW
|
| |