Gene Namu_4686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4686 
Symbol 
ID8450316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5209251 
End bp5210585 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content68% 
IMG OID645043727 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003203952 
Protein GI258654796 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.749978 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCGAA TGCTCAAGAT CGGCCTCGCG TTGGCCGCCG TTCCACTCGT GATCTCCGCG 
TGTAGCAGCG GGAGCAGTTC ATCAAGCACC ACCAGCGCGG CCTCCTCGGC CGCGACCGGC
GCCAGCTCGG CGGCCTCCGG TTCGGCCGCC GCGACCGGTG GCGGCGCGGC GGGCGGGGCC
TCCACCCTCA CTGTCTGGGC CGACAACTCG GCCAACACCG CCAAGGCCAT CCAGCCGCTG
TGCGAGAAGT GGGCGGCCGA GAGCGGCGTC ACCTGCGTCG TGCGCATGTT CAACGGCGGG
ACCGAGCTGC AGAACGCGCT GATCCAGGGC AACAGCACCG GTGACGTCCC GGACGTCTTC
GAGGGCCCGC ACGACCAGAT CGGCCGGTTC CAGCAGAACG GCATTCTGGC CCCGGTCGAC
CTGGGCGCCA ACACCGACAA GTTCATCCAG GCCGCCGTGC AGGGCGTCAC CTACAACGGC
GCCACCGTGG GCGTGCCGTG GGCGATGGAG AACGTGGCCC TGCTGACCAA CAAGGCCCTC
TCGCCGACCT GCCCGGCCAC CCTGGACGAG GCGGTGAGCA ACGCCGAGGC GCTGATTGCC
GCGGGCACCG TGCAGTCCGG CCTGGGCATC GGTATGCAGA TCGGGGCGAC CGGTGACGGT
TACCACTGGC AGCCGCTGTT CAGCGCCGAC GGTGGGTACG CGTTCAAGCA GAACGCCGAC
GGCACCTTCG ACGCCAAGGA CATGGGCATC GGCAGCGAAG GCGCGATCGC CGCGGCCAAG
CGCCTGCAGG ACCTGACCGC CAAGGGCATC TTCGGCGCCA ACGTCAGCTT CGACATCGCC
AAGGAAGCAT TCACCACCGG CAAGTCGCCC TACTGGATCA CCGGTCCCTG GGCGATTCCG
GACGCCAAGG CGGCGCTGGG CGACAACCTG ATGGTCTGCG CGATCCCGAA CTGGGAAGGC
AGCCAGTACA AGTCGCAGCC GTTCATCGGC GTGCGGGCCT TCTTCCAGCC GAACAACGCC
AAGAACCCGG TGTTGGCCTC GACCTTCCTG TCCGACGAGG TCCAGACCAC CGAGTTCATG
GACGCCATGT TCGCGGTCGA CCCGCGCCCG CCGGCATGGA AGGCCTCGTT CGAGACGGCT
GCCGCCGACC CGATCATCAA GGCCTTCGGC GAGTACGGCC AGCAGGGCAT CCCGATGCCG
TCCATCCCGC AGATGTCCAA CGTGTTCGAG GACTGGGGCC TGGCTGAGTT CCAGGTGGCC
GGTGGCGCCG ATCCGACGAC CACCATGCAG AACGCCGCCA CGTCGATCAA TCAGCGCAAC
GCCAGCCTGA ACTGA
 
Protein sequence
MRRMLKIGLA LAAVPLVISA CSSGSSSSST TSAASSAATG ASSAASGSAA ATGGGAAGGA 
STLTVWADNS ANTAKAIQPL CEKWAAESGV TCVVRMFNGG TELQNALIQG NSTGDVPDVF
EGPHDQIGRF QQNGILAPVD LGANTDKFIQ AAVQGVTYNG ATVGVPWAME NVALLTNKAL
SPTCPATLDE AVSNAEALIA AGTVQSGLGI GMQIGATGDG YHWQPLFSAD GGYAFKQNAD
GTFDAKDMGI GSEGAIAAAK RLQDLTAKGI FGANVSFDIA KEAFTTGKSP YWITGPWAIP
DAKAALGDNL MVCAIPNWEG SQYKSQPFIG VRAFFQPNNA KNPVLASTFL SDEVQTTEFM
DAMFAVDPRP PAWKASFETA AADPIIKAFG EYGQQGIPMP SIPQMSNVFE DWGLAEFQVA
GGADPTTTMQ NAATSINQRN ASLN