Gene Smed_0343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0343 
Symbol 
ID5321176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp370334 
End bp371293 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content63% 
IMG OID640789278 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_001326036 
Protein GI150395569 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.158898 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.129182 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAAGG CTATTGCGAG ACTTTCTATC CTGCTTGCCG CCACGGCGAT TTCGGCAACC 
GCCGCCTCGG CCTCGGACGA TATCAGCGTG TCGCTTGTGC TCGGCCAGCG CAATAGCGGG
TTTCATGAGG CGATCGCCTG CGGCGCCCGC GCCGCCGCCA AGGAACTGGG CGTGAAGGTC
AACATCCAGG CCGCACCGAC CTACTCGGCC TCCGAGCAGA TCCCGGTGCT GAACGCCGTC
ATGGCGACGA GCCCTTCTGC CATCGTGCTC GATCCGACGA GCTCGACCGC GCTGATCGCG
CCACTGATGG AGGCAGCCGC CAACGGCGCC AAGATCGTCG CCGTCGACAC CACGCTCGAC
GACCCGTCGG TGCTCTCCGC CGTGGTCGGA ACCGACAATG AAAGCGTCGG CCGCGAAACC
GCCAAGGCGC TCGCGAAGGC ACTCGACGGA AAGTCCGGCA AGGTGGCGCA GATCAACAGC
ATTCCCGGCA TTTCCACCGT CGATGCCCGG ATCAAGGGTT TCGAAGAAGA GATCAAGAAG
TATCCGAATC TCACCTATAT CGGCAACCAG TTTGCAAGTG AGGATATCCC GAAGGCACAG
CAGGCCTACG TGTCGCTCAT GAGCGCCAAC CCCGACCTCA TCGGCGTCGT CTCCCAATCG
AACAATCCCG CGATCGGCGT TGCCGGCGGC ATCCGTTCCA CCGAGACGGC CGAGAGCGTC
GTCGCTATTG CGGTGGATGC CGATGAAGCC GAGATCGAGG CTCTGAATGA GGGGCTGCTC
GACGCTCTCG TCATCCAGCA GCCCTACGAA ATGGGCTATG TCGGCTTCAA GCAGGCCGTC
GCTGCCGTCA AGGGCGAGCC CGTCGAGACG CCGATCGGCA CCGGCACGGT AACCGCGACC
AAGGCGAATA TTGCCGACCC GGACGTGGCC AAATACCTTT ACGAAGGGAA TTGCATCTGA
 
Protein sequence
MTKAIARLSI LLAATAISAT AASASDDISV SLVLGQRNSG FHEAIACGAR AAAKELGVKV 
NIQAAPTYSA SEQIPVLNAV MATSPSAIVL DPTSSTALIA PLMEAAANGA KIVAVDTTLD
DPSVLSAVVG TDNESVGRET AKALAKALDG KSGKVAQINS IPGISTVDAR IKGFEEEIKK
YPNLTYIGNQ FASEDIPKAQ QAYVSLMSAN PDLIGVVSQS NNPAIGVAGG IRSTETAESV
VAIAVDADEA EIEALNEGLL DALVIQQPYE MGYVGFKQAV AAVKGEPVET PIGTGTVTAT
KANIADPDVA KYLYEGNCI