Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_4686 |
Symbol | |
ID | 8450316 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 5209251 |
End bp | 5210585 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645043727 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003203952 |
Protein GI | 258654796 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 0.749978 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCGAA TGCTCAAGAT CGGCCTCGCG TTGGCCGCCG TTCCACTCGT GATCTCCGCG TGTAGCAGCG GGAGCAGTTC ATCAAGCACC ACCAGCGCGG CCTCCTCGGC CGCGACCGGC GCCAGCTCGG CGGCCTCCGG TTCGGCCGCC GCGACCGGTG GCGGCGCGGC GGGCGGGGCC TCCACCCTCA CTGTCTGGGC CGACAACTCG GCCAACACCG CCAAGGCCAT CCAGCCGCTG TGCGAGAAGT GGGCGGCCGA GAGCGGCGTC ACCTGCGTCG TGCGCATGTT CAACGGCGGG ACCGAGCTGC AGAACGCGCT GATCCAGGGC AACAGCACCG GTGACGTCCC GGACGTCTTC GAGGGCCCGC ACGACCAGAT CGGCCGGTTC CAGCAGAACG GCATTCTGGC CCCGGTCGAC CTGGGCGCCA ACACCGACAA GTTCATCCAG GCCGCCGTGC AGGGCGTCAC CTACAACGGC GCCACCGTGG GCGTGCCGTG GGCGATGGAG AACGTGGCCC TGCTGACCAA CAAGGCCCTC TCGCCGACCT GCCCGGCCAC CCTGGACGAG GCGGTGAGCA ACGCCGAGGC GCTGATTGCC GCGGGCACCG TGCAGTCCGG CCTGGGCATC GGTATGCAGA TCGGGGCGAC CGGTGACGGT TACCACTGGC AGCCGCTGTT CAGCGCCGAC GGTGGGTACG CGTTCAAGCA GAACGCCGAC GGCACCTTCG ACGCCAAGGA CATGGGCATC GGCAGCGAAG GCGCGATCGC CGCGGCCAAG CGCCTGCAGG ACCTGACCGC CAAGGGCATC TTCGGCGCCA ACGTCAGCTT CGACATCGCC AAGGAAGCAT TCACCACCGG CAAGTCGCCC TACTGGATCA CCGGTCCCTG GGCGATTCCG GACGCCAAGG CGGCGCTGGG CGACAACCTG ATGGTCTGCG CGATCCCGAA CTGGGAAGGC AGCCAGTACA AGTCGCAGCC GTTCATCGGC GTGCGGGCCT TCTTCCAGCC GAACAACGCC AAGAACCCGG TGTTGGCCTC GACCTTCCTG TCCGACGAGG TCCAGACCAC CGAGTTCATG GACGCCATGT TCGCGGTCGA CCCGCGCCCG CCGGCATGGA AGGCCTCGTT CGAGACGGCT GCCGCCGACC CGATCATCAA GGCCTTCGGC GAGTACGGCC AGCAGGGCAT CCCGATGCCG TCCATCCCGC AGATGTCCAA CGTGTTCGAG GACTGGGGCC TGGCTGAGTT CCAGGTGGCC GGTGGCGCCG ATCCGACGAC CACCATGCAG AACGCCGCCA CGTCGATCAA TCAGCGCAAC GCCAGCCTGA ACTGA
|
Protein sequence | MRRMLKIGLA LAAVPLVISA CSSGSSSSST TSAASSAATG ASSAASGSAA ATGGGAAGGA STLTVWADNS ANTAKAIQPL CEKWAAESGV TCVVRMFNGG TELQNALIQG NSTGDVPDVF EGPHDQIGRF QQNGILAPVD LGANTDKFIQ AAVQGVTYNG ATVGVPWAME NVALLTNKAL SPTCPATLDE AVSNAEALIA AGTVQSGLGI GMQIGATGDG YHWQPLFSAD GGYAFKQNAD GTFDAKDMGI GSEGAIAAAK RLQDLTAKGI FGANVSFDIA KEAFTTGKSP YWITGPWAIP DAKAALGDNL MVCAIPNWEG SQYKSQPFIG VRAFFQPNNA KNPVLASTFL SDEVQTTEFM DAMFAVDPRP PAWKASFETA AADPIIKAFG EYGQQGIPMP SIPQMSNVFE DWGLAEFQVA GGADPTTTMQ NAATSINQRN ASLN
|
| |