Gene Smed_4919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4919 
Symbol 
ID5319131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1429854 
End bp1430903 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content63% 
IMG OID640776703 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_001313635 
Protein GI150377039 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.35548 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGGAA TTCGGCGCCT GGCGCAACAT CTCGATATTT CGATCGGAAC GGTTTCGCGT 
GCATTGAACG GCCGCCCCGA CGTTAACGAG GAGACGCGCA GGCGGGTCCT CGAAGCGGCC
GAGAAGCTCG GCTATGTGCC GAACCAGTCG GGCCGCAGCC TGAGGCAGGG CACCACCAAC
ATCATTGGCT TCATGATGCA GACCGGCACG GAGATCACCG GCCAAGGCGA CACCTTCTTC
ATGAGTGTCT TCGACGGCGT GCAGGCGGTC TTCGCCAGGC ATAAGCTCGA CCTCGTCGCC
CTGCTCTGCT CGTCCGAGGA GGATCAGAGC GATTACCTGC GCCGCGTCGT TGCGCGCGGT
TTCGCCGACG GCCTGATCCT CTCGGCCACG CAGCGGCACG ATCCGCGTAT CGAGTATCTG
GCCGAACGCA ACATCCCCTT CATTACCCTC GGCCGAAGCC TCACGGATGT CGGGCGGCCC
TGGCTGGATC TCGATTTCGA GGGAATGGCT CAGATCGCGA TCGACCGTCT CGTCGCCCGT
GGACATCGCC GTATCGCGGT CACCCGCCCC CATGACGACG CCAATCTCGG CTACATCTTC
GTCGACCGCT GCCGCGAAGC GCTCGCCGCG CATGGCCTCA CTCTGGAGGA GGAGCTGATC
TTCCGATCGA CGCCGAACGA AACCGGTGGC TATCAGATCG CACGCGAACT CCTGAAGCTC
GAGGACCGGC CGACGGCTGC CTTGCTCGTC AACGAGACGA TCGCCATCGG ATTTTACCAG
GGCCTTTCCG AAGCCGGCGT CAGGCCCGGC CGCGACATCG CGGTGATCGG GCGCTACAGC
CCGCATGCGC ATTTTCTTTC GCCGCCGCTC ACCTGTTTCC GTCTGTCGCT GCGCGACCTC
GGCATAGCGC TTGCGGAGAC GCTGCTTTCC ACTATGCCGA CTTTCCAGGA GCATTACCCG
CAGGCGTTGA CAAATGCGGT CTGGCCGATG GAGCTCATCG AAGGCGAAAG CGACGGCTTT
CGCGTCAATG GCGACGAAAG TCGCGGCTGA
 
Protein sequence
MKGIRRLAQH LDISIGTVSR ALNGRPDVNE ETRRRVLEAA EKLGYVPNQS GRSLRQGTTN 
IIGFMMQTGT EITGQGDTFF MSVFDGVQAV FARHKLDLVA LLCSSEEDQS DYLRRVVARG
FADGLILSAT QRHDPRIEYL AERNIPFITL GRSLTDVGRP WLDLDFEGMA QIAIDRLVAR
GHRRIAVTRP HDDANLGYIF VDRCREALAA HGLTLEEELI FRSTPNETGG YQIARELLKL
EDRPTAALLV NETIAIGFYQ GLSEAGVRPG RDIAVIGRYS PHAHFLSPPL TCFRLSLRDL
GIALAETLLS TMPTFQEHYP QALTNAVWPM ELIEGESDGF RVNGDESRG