Gene Smed_5594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5594 
Symbol 
ID5319896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp560644 
End bp562251 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content61% 
IMG OID640777339 
Productextracellular solute-binding protein 
Protein accessionYP_001314271 
Protein GI150377676 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.459863 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.392289 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGACA TCACCAATTG GACCAGATCT GACGACGCTA TGATCGAAAC CGCCATTCGT 
CGCGGAGCGA CGCGCCGCGA ACTCCTGCAG ATGATGCTGG CCGGCGGTGC TGCCCTCTCT
GCCGGCAGTC TGATGCTCGG CCGAGCCGGC AATGCGGTTG CCGCAACGCC GGTGGCCGGC
GGTACGCTCA AAGCGGCCGG CTGGTCGGCT TCCACCGCCG ACACGCTGGA CCCGGCCAAG
GCATCGCTCT CCACCGACTA TGTCCGCTGC TGCTCCTTCT ACAACCGACT CACCTTCCTC
GATAAAGGCG GCACGCCGCA GATGGAACTG GCCGAAGCTA TCGAGACCAA GGATGCCAAG
ACCTGGACCG TCAAGCTGAG GAAGGGCGTT ACCTTCCATG ATGGCAAGCC GCTGACAGCC
GACGACGTGA TCTTCTCGCT GAAGCGCCAC CTCGACCCGG CGGTCGGTTC AAAGGTCGCC
AAGATCGCCG CGCAGATGAC GAGCTTCAAG GCGGTGGACA AGCAGACGGT CGAGATCACG
CTCGCGAGCC CGAACGCCGA CCTGCCGACG ATCCTCTCGA TGCACCACTT CATGATCGTC
GCCGACGGCA CCACCGACTT CTCGAAAGGG AACGGCACCG GCGCTTTTGT GAGAGAAGTC
TTCGAGCCCG GCGTGCGCTC CGTCGGCCTC AAGAACAAGA ACTACTGGAA GTCCGGACCG
AACGTCGATT CCTTCGAGTA CTTCGCCATC AGCGACGACA GTTCCCGGGT GAATGCGCTC
TTGTCCGGCG ACATCCACCT TGCCGCCACG ATCAATCCGC GCTCCATGCG CCTGGTCGAG
AGCCAAGGCG ACGGCTTCGT CCTCTCGAAG ACGACCTCCG GCAACTACAC CAACCTCAAC
ATGCGGCTGG ACATGGAACC CGGCAGCAAG CGCGACTTCG TCGAAGGCAT GAAGTACCTC
GTCAACCGCG AGCAGATCGT CAAGTCGGCG CTTCGGGGTC TAGGCGAGGT CGGCAACGAC
CAGCCGGTGT CCCCGGCCAA CTTCTATCAC AATCCCGACC TGAGGCCGCA CGCCTTCGAC
CCCGAGAAGG CGAAGTTCCA CTTCGAGAAG GCCGGCATGC TAGGCCAGTC GATCCCGGTG
GTTGCCTCCG ATGCAGCGAA CTCCGCGATC GATATGGCAA TGATCATTCA GGCCTCCGCT
GCCGAAATCG GATTGAAGCT CGATGTGCAG CGCGTTCCCG CAGACGGTTA CTGGGATAAT
TACTGGCTGA AGGCGCCGAT CCACTTCGGC AACATCAACC CGCGCCCCAC GCCGGATATC
CTGTTCTCTC TGCTCTACTC CTCGGAGGCT CCGTGGAACG AGAGCCAATA CAAGTCGGAG
AAATTCGATA AGATGCTGAT CGAAGCGCGT GGCTCGCTCG ACCAGGAGAA GCGCAAGGCG
ATCTACAATG AGATGCAGGT GATGGTCGCC AGTGAAGCCG GCACCATCAT TCCGGCCTAT
ATCTCCAACG TCGACGCGAT CACCGCCAAG CTCAAGGGCC TGGAAGCCAA TCCGCTTGGC
GGGCAGATGG GTTATGCTTT TGCGGAATAT GTCTGGCTCG AGGCCTGA
 
Protein sequence
MNDITNWTRS DDAMIETAIR RGATRRELLQ MMLAGGAALS AGSLMLGRAG NAVAATPVAG 
GTLKAAGWSA STADTLDPAK ASLSTDYVRC CSFYNRLTFL DKGGTPQMEL AEAIETKDAK
TWTVKLRKGV TFHDGKPLTA DDVIFSLKRH LDPAVGSKVA KIAAQMTSFK AVDKQTVEIT
LASPNADLPT ILSMHHFMIV ADGTTDFSKG NGTGAFVREV FEPGVRSVGL KNKNYWKSGP
NVDSFEYFAI SDDSSRVNAL LSGDIHLAAT INPRSMRLVE SQGDGFVLSK TTSGNYTNLN
MRLDMEPGSK RDFVEGMKYL VNREQIVKSA LRGLGEVGND QPVSPANFYH NPDLRPHAFD
PEKAKFHFEK AGMLGQSIPV VASDAANSAI DMAMIIQASA AEIGLKLDVQ RVPADGYWDN
YWLKAPIHFG NINPRPTPDI LFSLLYSSEA PWNESQYKSE KFDKMLIEAR GSLDQEKRKA
IYNEMQVMVA SEAGTIIPAY ISNVDAITAK LKGLEANPLG GQMGYAFAEY VWLEA