Gene Smed_5241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5241 
Symbol 
ID5319543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp203666 
End bp204616 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content61% 
IMG OID640777018 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_001313950 
Protein GI150377355 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.635479 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTGA CCGGGCTCAG AACCGCAGCG CTGGCTGCGG GCCTTGCCAC CTTGTTGACG 
TCAAGCGCAC TCGCGGCCGA CGTCAAGGCA ACGATGATCA TCTACCTCGA TCCGAGTGTG
CAGTTCTTTA ATCCCGTGGT GAAGGGAGCG CAAGACGCCG CCGCCCAATT CGGCGTCGAC
CTCGACGTGC AGTATGCCAA CAACGATCCG GTGCGCCAGA ATGACCTGAT CGAGAGCGCG
ACGGTCAGCG GCGTTGACGG CATCGCCGTC GCGATCTCCA GTTCGGACGC ATTCGACGAG
AGCATCTGCG CTGCAGTGAA GGCCGGCATC ATCGTCATCG GCTTCAACAA CGATGACCTC
GACGGCGCCA AAGGGAACTG TCGCCAGGCC TATGTCGGCA TGGACGAGCT TGCCTCAGGC
TATGAGCTCG GCAACCGCAT GATCAAGGAA TTTGGCCTCA AGTCCGGCGA CGTCGTCTTC
AACCCGCGCG AAATTCCGGA AGCGAGCTTT GCAGTCGCCC GTGGTGGCGG CATCGAGAAG
GCGATGACGG AAAACGGCAT CAAGGTGGAG ACGGTTCGTG CCGGCCTCGA CCCCGCCGAA
GCGCAGAACA TCATCGCGCA ATTCCTCATC GCCAACCCGA ACGTGAAGGC GCTGTTCGGC
ACCGGCTCGG TCACCTCCAC GGTGGGCGCG GGCGCCATCA AGGATGCCGG AGTAAACATT
CCATTCGGCG GTTTCGACCT TGCGGTCGAG ATCGTAAACG CGGTGGATTC CGGCGCTATG
TACGCGACCA TGGACCAGCA GCCCTATCTG CAGGGCTACT ACCCGATCGC CCAGATCGCG
CTCGCCAAAA AATACGGACT GACACCGACC GACATCGACA CGGGTCAGGG CGCCTTCCTC
GACAAGTCGC GCATCGGTTC GGTCAAGCCG CTGATCGGCA GCTATCGCTA A
 
Protein sequence
MKLTGLRTAA LAAGLATLLT SSALAADVKA TMIIYLDPSV QFFNPVVKGA QDAAAQFGVD 
LDVQYANNDP VRQNDLIESA TVSGVDGIAV AISSSDAFDE SICAAVKAGI IVIGFNNDDL
DGAKGNCRQA YVGMDELASG YELGNRMIKE FGLKSGDVVF NPREIPEASF AVARGGGIEK
AMTENGIKVE TVRAGLDPAE AQNIIAQFLI ANPNVKALFG TGSVTSTVGA GAIKDAGVNI
PFGGFDLAVE IVNAVDSGAM YATMDQQPYL QGYYPIAQIA LAKKYGLTPT DIDTGQGAFL
DKSRIGSVKP LIGSYR