Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3847 |
Symbol | |
ID | 8449466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 4218328 |
End bp | 4219593 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645042896 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003203132 |
Protein GI | 258653976 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.00585532 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.205792 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGTTGG CGTTCTCCCT GGCCGCCTGC GGCAAGGGAG CGAGCAGCAG CAGCGACGCG CAGACCGACG CCAGCGGCAC GACCACGTTG AAGATGTGGA CGCACAACGC GGGCAACGAC ACGGAACTCG CCGCGATCAA CCAGGTCGTC GCGGACTACA ACGCCAGCCA GAGCAACTAC AAGGTCGAGG TCCAGGCGTT CCCGCAGGAC TCCTACAACA CCTCGGTGAC GGCGGCGGCC GCCTCCAAGA GCCTGCCCTG CATCCTGGAC GTGGACGGCC CGAACGTGCC GAACTGGGCC TGGGCCGGGT ACCTGGCCCC GCTGGACGGC CTGGACGAGC GGATCGCCCA GTTCCTGCCC AGCGTGGTCG GCAGCTTCGA CGGCAAGAAT TACGCCGTCG GCTACTACGA CGTGGCGCTG ATCATGCAGG CCCGGACGTC GGCCTTGCAG GAGAACGGGA TCCGCATCCC GACCATCGAC CAGCCCTGGA CCGAGGACGA GTTCGCGGCC GCGCTGGCCG CGATCAAGGC CAGCGGCAAG TACGAGAACA CGCTGGACCT GCAGACCGGC AACACCGGTG AGTGGTGGCC GTACGCGTAC TCGCCGATGC TGCAGAGCTT CGGGGGCGAC CTGATCAACC GGGACGGCTA CACCAGCGCC GACGGGGTGC TCAACGGTCC GGCCGCGGTG CAGTGGGCCA CCTGGTTCCG CTCGCTGGCC ACCGACGGCT ACATGCCGCT CAAGTCGGGC GCCGATCCGG CCCAGGACTT CCTCAACGGC AAGACCGCGA TCCTGTACAA CGGCTCGTGG GGCGCCGAAC CCGCGCGGGC GTCGGCCATC GCCGACGACG TCTCCTTCCT GCCGGCGGTC AATCTCGGCC AGGGAGCCAA GATCGGCGGC GGATCCTGGC AGTGGGCGGT CAGTTCCGGC TGCCCGTCGA CCGAGGGCGC GCTGGACTAC ATGAAGTTCG CGCTGCAGGA CAAGTACGTC GCCGCGGTGT CCAAGGCGAC CGGGACGATC CCGGCCACCG ACGCCGCCGC GGCCATGGTG CCCGGCTACG AACCGGGTGG GGACAACGAC ATCTTCCGTC AGTACTCCAA GGAGTTCGCC CTGATCCGGC CGGCGACCCC GGGCTACCCG TTCATCGCGA CCACCTTCAC CAAGACCGCC CAGGACATCC TCAACGGCGC CGACCCGCAG GAAGCGCTGA ACCAGGCGGT CGCCGACATC GACGCGAACC AGCAGTCCAA CAACAACTTC CAGTAG
|
Protein sequence | MALAFSLAAC GKGASSSSDA QTDASGTTTL KMWTHNAGND TELAAINQVV ADYNASQSNY KVEVQAFPQD SYNTSVTAAA ASKSLPCILD VDGPNVPNWA WAGYLAPLDG LDERIAQFLP SVVGSFDGKN YAVGYYDVAL IMQARTSALQ ENGIRIPTID QPWTEDEFAA ALAAIKASGK YENTLDLQTG NTGEWWPYAY SPMLQSFGGD LINRDGYTSA DGVLNGPAAV QWATWFRSLA TDGYMPLKSG ADPAQDFLNG KTAILYNGSW GAEPARASAI ADDVSFLPAV NLGQGAKIGG GSWQWAVSSG CPSTEGALDY MKFALQDKYV AAVSKATGTI PATDAAAAMV PGYEPGGDND IFRQYSKEFA LIRPATPGYP FIATTFTKTA QDILNGADPQ EALNQAVADI DANQQSNNNF Q
|
| |