Gene Smed_1730 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1730 
Symbol 
ID5322588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1810668 
End bp1811729 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content62% 
IMG OID640790668 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_001327400 
Protein GI150396933 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0821566 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGACA AGACGAAAAA CAGAGCGACG ATGGCGCCGC CGGCCGAAGG CGGCAGGCCG 
ACACTGAAGA CGATCGCTTT CATGACCGGG CTCGGTATTA CCACCGTCTC GCGCGCTCTG
AAGGACGCTC CCGATATAGG CGCAGAAACC AAGGAACGCG TGCGTCTCGT CGCAAAACAG
ATCGGCTATC AACCGAATCG CGCGGGCGTT CGCCTCAGAA CGGGCAAGAC GAATGTCATT
AGTCTGGTGC TGACTCTGGA GGAAGAGATC ATGGGCATCA CCAGCCCCAT GGTCATCGGC
ATCACCGAAA TCCTTGCCGG CACCCAATAT CACCTGGTTG TGACACCCTA TAGTTCAACC
AAGGATCCGC TCGGCCCCAT CCGCTATATT CTCGACACCG GCGCAGCCGA TGGCGTGATC
ATTTCGCGTA CCGAACCCAA CGACCCGCGG GTAACGCTTT TAACCGAGCG TCACCTGCCC
TTCGCCACCC ACGGGCGTAC CGAGATGGGC CTGATCCATC CCTATCACGA TTTCGACAAT
GAGCGCTTCG CCTACGAGGC CGTCCGCAAG CTCGTCGACC GCGGCAGGCG GCGGCTGGTG
CTCCTGGAGC CACCGCCGAA TCTAACTTTC CACACCCATA TGCGCACCGG CTTCGAGCGG
GGGCTGCGGG ATTTCGGAGC GGAATCGGTG AGCTTTCATC AGGTCAATAT CGACTACAGC
CTCGTCGCCA TTCGCGATGC GTTCGAGAGG CTTATGCACT CTTCGGATGC CCCGGACGGC
ATCGTTTCCG GCAGTGGATC CGGCGCCATC GCGCTGATCG CGGGCGTCGA GGCGGCCGGC
AAGAAGGTCG GCGATGACGC CGACATGGTC TCCAAAGTGC CGAGCGATTT CCTGCGCTGG
CTCCGGCCGG AGGTGATGAC GATGTATGAG GATATCCGCA TTGCCGGGCG CGAGCTCGCC
AAGGCAGTGA TCGGCCGCAT CGAAGGCCAC CCGCCGGATA AGCTTCAGAG CCTCAGCCAG
CCCGAATTCC AGCCGCCGGT GGCGGGGCCG GCGAGATTGT AG
 
Protein sequence
MDDKTKNRAT MAPPAEGGRP TLKTIAFMTG LGITTVSRAL KDAPDIGAET KERVRLVAKQ 
IGYQPNRAGV RLRTGKTNVI SLVLTLEEEI MGITSPMVIG ITEILAGTQY HLVVTPYSST
KDPLGPIRYI LDTGAADGVI ISRTEPNDPR VTLLTERHLP FATHGRTEMG LIHPYHDFDN
ERFAYEAVRK LVDRGRRRLV LLEPPPNLTF HTHMRTGFER GLRDFGAESV SFHQVNIDYS
LVAIRDAFER LMHSSDAPDG IVSGSGSGAI ALIAGVEAAG KKVGDDADMV SKVPSDFLRW
LRPEVMTMYE DIRIAGRELA KAVIGRIEGH PPDKLQSLSQ PEFQPPVAGP ARL