Gene Noca_0059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0059 
Symbol 
ID4600110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp66676 
End bp67995 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content64% 
IMG OID639774673 
Productextracellular solute-binding protein 
Protein accessionYP_921295 
Protein GI119714330 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGGTC GACACCTCGT CAGTCGGCGC CGGCGCCGGA TCGCCGCCGC GGCGCTGGCA 
GTCTCGTCGC TCCTCGCCCT CGCGGCATGC AGCAGCGCGA GCGACTCCAA CAACGCGGAC
GGCAAGACGA CGATCACGTT GTCCATGCAG AACCCCGACG TGAAGACGGC CGACCCAGCT
ACGTGGGCCA TCGTCGAGGC GTTCGAGAAG GCCAATCCCG ACATCAAGGT CGAGGTCAGT
GGCCAGGCCG TAGCCGAGCA CCTCCAGAGC CTCTCCATCG CCGCGCAGAG TGACACGCTC
CCGGACGTGT TCTGGGTGTA CAAGGCGACC GCCGAAGACA TGCTCGAGGC CGGCAAGCTG
ATGGACCTCA AACCGACCTT GGACGAGCTG GGGATCACGG AGAACCTGCC GGAGAGCACC
GTCGCGAACT TCACGGCCGG CGACCAGATC TACGGCGTTC CCTACCAGGG GCTCCTCACC
GGTCTGTGGG TCAACAAGAA GATCCTGGCC GACAACGGGC TCGAGATGCC GAGCACATTC
GACGACCTCG TCAACGTCGC TCAGGAGCTG AGTAGCAAGG ACGTCGTCAC CATCTCTAAC
GGCGCGAACC AGTCGTCGTT CAGCGTCTGG TCCTTCCTGG TCTGGCTGGA CCGGTTCGGT
TTCGAGGACA AGATCAGCGG CATGCTCGAT GGATCGGAGA GCTACGACAA CCCTGACTTC
GTGCGTATGT ACAAGCACAT CGCCGAACTC CGCGACGCCG GTGCTTTCGC CTCCAACGTC
TCCACGCAGA CCTACCAGCA GGCTGTGGAC CAGTTCCTCC AGGGCAAGGC CGCGATGCTC
GACGCCGGGG TCTGGGCATC GAGCGCCATC CAGGACTCCC CTGTGGCAGC GGACACGGTG
TTCTGGAACG GCGCCACCTT CGACGACGGT GTCGGTGATC AGAACATCGT CATGAACGTG
GCATCCGCGC CGCTCGTCGT GGACCACAAG GTCGCAGATG ACCAGGACAA GCTCGACGCA
GTCAAGGCGT TCCTGGACTT CTACTACGGT GACGAGGCGC AGCAACTCCT CGTTGACAAC
GGCCAGCCGC CTGTGACGAA CTACGAGCCG CAGATCGACG CCACCAAGCA GAGTGTGCTG
AAGTCGGCCT TGGATGTCGC CACCGACCCG TCTCTGAGCA GCCCGCAGTC TCAGCCCGAC
CTGCTGGTGT CGACGGCCGT GGCGAGCGCG ATGTACGACA GCATCTACGG CGTGATCCAG
AACCAGCTCT CCCCCGAGGA GGCCGTGGCG CTGGTGCAGA AGGCGCTCGA CGCCGAGTGA
 
Protein sequence
MIGRHLVSRR RRRIAAAALA VSSLLALAAC SSASDSNNAD GKTTITLSMQ NPDVKTADPA 
TWAIVEAFEK ANPDIKVEVS GQAVAEHLQS LSIAAQSDTL PDVFWVYKAT AEDMLEAGKL
MDLKPTLDEL GITENLPEST VANFTAGDQI YGVPYQGLLT GLWVNKKILA DNGLEMPSTF
DDLVNVAQEL SSKDVVTISN GANQSSFSVW SFLVWLDRFG FEDKISGMLD GSESYDNPDF
VRMYKHIAEL RDAGAFASNV STQTYQQAVD QFLQGKAAML DAGVWASSAI QDSPVAADTV
FWNGATFDDG VGDQNIVMNV ASAPLVVDHK VADDQDKLDA VKAFLDFYYG DEAQQLLVDN
GQPPVTNYEP QIDATKQSVL KSALDVATDP SLSSPQSQPD LLVSTAVASA MYDSIYGVIQ
NQLSPEEAVA LVQKALDAE