Gene Smed_3579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3579 
Symbol 
ID5318083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp6785 
End bp7819 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content64% 
IMG OID640775394 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_001312327 
Protein GI150375731 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAAA GACCCTCGAG AAGACTCCGC CAAGCCGACA TCGCCGCCCA TGCCGGCGTA 
TCGGTTTCTA CCGTCTCGCG CGTGCTCGCC AACGAACCCG GCATCAGCGA AGATGTCCGG
GTACAGATCT TCAAGGTCGC AAGCGAGCTC GGCTACCCAC TCAAAGCCGG CACTGCAGCT
GGTTCACGCG CACTGGCATT GATCGCAAGC AACGGCGTTA CCGGCGGCTT GAGCGCTTTT
TACCAGGGCA TCGTCGATGG CTTGCGCTCA GAGGCAGCCG CGCAGGGCAT GTCGTTCGAC
ATCCGCCTCA TCAACGAGAT GAAGGCGACG CCGCAAGTCG TTGGCGAACA TCTGGAATCA
GTCGGGGCGC AAGGGCTCTT TCTGGTCGGG ATCGACCCCA GCGAGGCACT TGGCGACTGG
CTCGTGGAAA GCCGGTTGCC CGTCGTCCTC GTCAATGGCG TCGATCCACA ATTGCGCTTC
GACGGCATCT CGCCGCCAAA CTTCTTCGGC GCCTTCGCTG CCACGCGGAT GCTGCTGGAT
GCCGGGCACA GGCGCATCCT CCACCTGACC GGATCGCATC GCCATACGAT CCGCGAGCGT
GTGCGCGGCT TCGAAGCGGC CATCGCCTGC GCTGAAGGCT GCGCGGCGCG CATCGTCCGC
CTGCCCTTCG AGACCAATTC GAGCGCGGAA GCCCATGCGG CAACGCTCGA TGCGCTCGCC
GTGGACGGGA ATTTCACCGC GGCCTTCTGC ATGAACGATT TCATCGCAGT GGGCGTTCTC
GAGGCGGTCA CCGAACTTGG CCGCCGCGTA CCGGATGATT TCGCCATTAT CGGGTTCGAC
GACCTGCCCT GCGCCGAGAT GGCCAATCCG CGCCTTTCGA CGATGCATGT CGACCGTGCA
GCGCTCGGCC GGGAAGCGGT CGGCATGATG CAGTTTCGCT TCGCTCATCC GGACGTGCCC
GCGCGGCATG TCTCTCACGC GGTCACCCCG GTGCCCGGCG GCACGATTGC ACGAAGGACG
ACGCATGACC TATGA
 
Protein sequence
MIERPSRRLR QADIAAHAGV SVSTVSRVLA NEPGISEDVR VQIFKVASEL GYPLKAGTAA 
GSRALALIAS NGVTGGLSAF YQGIVDGLRS EAAAQGMSFD IRLINEMKAT PQVVGEHLES
VGAQGLFLVG IDPSEALGDW LVESRLPVVL VNGVDPQLRF DGISPPNFFG AFAATRMLLD
AGHRRILHLT GSHRHTIRER VRGFEAAIAC AEGCAARIVR LPFETNSSAE AHAATLDALA
VDGNFTAAFC MNDFIAVGVL EAVTELGRRV PDDFAIIGFD DLPCAEMANP RLSTMHVDRA
ALGREAVGMM QFRFAHPDVP ARHVSHAVTP VPGGTIARRT THDL