Gene Namu_1773 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1773 
Symbol 
ID8447375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1944081 
End bp1945478 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content64% 
IMG OID645040899 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003201152 
Protein GI258651996 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.504545 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.290135 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGAA CCCGTGCGGT GCGGTCGGCG TGGATCGCCG GTACCGCTGC CCTGGCCATG 
ATCCTGACGG CCTGCGGGAG CAGCAGTGAC AGCAGCAGCG CGAGCGTGTC GGTCGGCGGT
GGCGCCGCCG CGACCGCGGT GACCGGTGAC GTCTCGGGAA GCATCGACTA CTGGTTGTGG
GACTCCAACC AGCTGCCCGC CTACCAGGAG TGTGCCGACG CCTTCAGCAA GAAGTACCCG
AACGCCAAGG TCAACATCAC CCAGTACGGC TGGGACGACT ACTGGGCCAA GATCAACAAC
GGGTTCACCT CCGGCACCGG ACCGGACGTG TTCACCAACC ACCTGTCCAA GTACCCGGAG
TTCGTGAACA AGCAGTACAT CCTCAACCTG TCCGACGCGC TCAAGGCCGA CGGCGCCGAC
AAGGACATCT ACCAGAAGGG CCTGCTGTCG CTGTGGACCG CCCAGGACGG CGGAATCTAC
GGCCTGCCCA AGGACTTCGA CACCATCGGC TTGTTCTACA ACGAGGACAT GATCACCGCG
GCCGGCTACA CCGACGCGGA CCTGCAGAAC TTGACCTGGA ACACCACCGA CGGCGGCACG
TTCGAGAAGT TCATCGCCCA CATGACCATC GATGCCAACG GGGTCCGCGG TGACGAGCCC
GGGTTCGACA AGACCAACAT CAAGACCTAC GGATTCGGCC AGGAGAACCT GACCGACGGT
AACGGCCAGA CCCAGTGGAG CCCGTTCACC GGCAGTAACG GCTGGACCTA CACCGACAAG
AACCCGTGGG GCACCCAATT CAACTACGGC GACGACAAGT TCAACGAAAC GATGACCTTC
TACAAGTCGC TGTCCGAGAA GGGCTACTCC CCGACCATCG ACAAGACCGT CGGCGTGGAC
ACCGGAACTC AGCTCGCCGC CGGCACCTAC GCCACCATCT TCGAGGGGGA CTGGAACACC
AGCAGCTACC TGGGCAAGGG CGTGAACCTG AAGATCGCGC CGACCCCGAT CGGCCCGAGC
GGCGAGCGCG CCTCGATGTT CAACGGCCTG GCCGACTCGG TCAACGCCGG CACCAAGAAC
CAGGCCGCGG CGGTCAAGTG GGTCGAGTTC ACCGGGTCCC AGGAATGCCA GGACCTCGTC
GCCGCCAAGG CCGTCGTGTT CCCGGCCATC CCGGCCTCGA CCGACAAGGC GGAGGCCGCG
TTCAAGGCCA AGGGCGTGGA CATGTCCGGC TTCCTGGTGC AGGTCAAGGA CGGCACCACG
TTCCTGTTCC CGATCACCGA CCACGCCGCC GACGTCACCG CGATCATGGC CCCGGCCCTG
CAGGGCTTCA TGAGCGGCCA GGCCGACGTC AGCTCCTTCA AGGACGCCAA CGACCAGGTC
AACGCCCTGT TCCAGTAG
 
Protein sequence
MKRTRAVRSA WIAGTAALAM ILTACGSSSD SSSASVSVGG GAAATAVTGD VSGSIDYWLW 
DSNQLPAYQE CADAFSKKYP NAKVNITQYG WDDYWAKINN GFTSGTGPDV FTNHLSKYPE
FVNKQYILNL SDALKADGAD KDIYQKGLLS LWTAQDGGIY GLPKDFDTIG LFYNEDMITA
AGYTDADLQN LTWNTTDGGT FEKFIAHMTI DANGVRGDEP GFDKTNIKTY GFGQENLTDG
NGQTQWSPFT GSNGWTYTDK NPWGTQFNYG DDKFNETMTF YKSLSEKGYS PTIDKTVGVD
TGTQLAAGTY ATIFEGDWNT SSYLGKGVNL KIAPTPIGPS GERASMFNGL ADSVNAGTKN
QAAAVKWVEF TGSQECQDLV AAKAVVFPAI PASTDKAEAA FKAKGVDMSG FLVQVKDGTT
FLFPITDHAA DVTAIMAPAL QGFMSGQADV SSFKDANDQV NALFQ