Gene Smed_5554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5554 
Symbol 
ID5319856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp519007 
End bp520017 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content58% 
IMG OID640777303 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_001314235 
Protein GI150377640 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0347599 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGGA TCACATTGCA GGACATCGCT GATCACACAG GCCTGTCCAA ATTCGCTGTC 
TCCTGCTCCC TTTCGGGAAA GCCGGGGGTC AGCGACACCA CGCGAAAACG CGTGCAGGAT
GCCGCCGTAC AACTCGGCTA TCAACGCTTA AAACCCGCAG AAGAGCGGCG TGAAGTCACC
CTGATTTTCC ATGATCAAGT CGACAGTGTC AGCTATGAGC TGCGAACGAT GCTGCAAGAC
GGGATGCAGC GCGAAGCGCA TCGGCTCGGC CAGCCGGTCA GGCTTCAATG GACGCATGAT
GCCAATCGGG TGAAAGCCAT GGTCAAGGAT AGTGCCGGGA TCATCCTGGT CGGTCCTCAC
GAACAGAAAA CGCTCGACAT CCTGAGAGCC TCCGGCGTTC CCGTTGTGCG TCTCGGCTGG
GTCGCCCCCC TCGAACAGGC CGATCATGTC GGCGGCACCG ACCACGAGGC AGGGATTGCA
GTTGGCGAAT ACCTGATCGG CCTCGGCCAT CGGGACATCG CCTTTCTCCA AGGGGAGGAA
GGGTATCGCG GCCGCATGGA GCGATATCAC GGTCTGCGCG AAAGTATCGA ACAGTATCCC
GATGCGCGGC TGCACAATTT GCACTTCAAG GAGGACGGGG GCTTCATTCC GGCGCTTCAA
TCTCTCCAGA CGACGGGAAT TGCGCCAACG GCGCTGTTCT GCGCGCATGA CGGACTGGCT
CTCACCGCCG TTTCGGAGCT CCTAGCGCGG GGTTACCGCA TTCCGGAAGA CATGTCCGTT
GTCGGCTTCG GTGATTTTTC TGCCGCAACG CAAATATCGC CACAGCTAAC CACCATCAAA
GTGCAAGGAC TTGAAATGGG GGCGACAGCG TTGCGGCTTC TGCTAGAGCG CATTGAAACA
CAAGGCAACA CCGTGCCTGC CAGACGCATT CTGATTGCAT CTACCTTCGT TGAGCGTCGA
TCATCGGGGC CTGCTCCGAA GCACGGAAAG AGCCTTGACA GACGAAAATG A
 
Protein sequence
MSRITLQDIA DHTGLSKFAV SCSLSGKPGV SDTTRKRVQD AAVQLGYQRL KPAEERREVT 
LIFHDQVDSV SYELRTMLQD GMQREAHRLG QPVRLQWTHD ANRVKAMVKD SAGIILVGPH
EQKTLDILRA SGVPVVRLGW VAPLEQADHV GGTDHEAGIA VGEYLIGLGH RDIAFLQGEE
GYRGRMERYH GLRESIEQYP DARLHNLHFK EDGGFIPALQ SLQTTGIAPT ALFCAHDGLA
LTAVSELLAR GYRIPEDMSV VGFGDFSAAT QISPQLTTIK VQGLEMGATA LRLLLERIET
QGNTVPARRI LIASTFVERR SSGPAPKHGK SLDRRK