Gene Smed_2187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2187 
Symbol 
ID5323047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2262860 
End bp2264020 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content60% 
IMG OID640791125 
Productextracellular solute-binding protein 
Protein accessionYP_001327855 
Protein GI150397388 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.227535 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.141729 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAGA TCCTGAAATC CTGTACTGCG CTCACCCTGT CGATGGCGCT CGTCGCGCCT 
GCCTTTGCGC AGGAGCCGCC GAAGGAACTG GGGCCAGGCG AAGGCGCGCT CTCGATCGTC
GCCTGGGCGG GCTATATCGA GCGCGGCGAG ACTGACAAGA ATTACGATTG GGTGACCGAT
TTCGAGAGCA AGACGGGATG CAAGGTCAGC GTCAAGACGG CGGCAACCTC GGATGAGATG
GTCGCTCTGA TGAACGAAGG CGGCTTCGAC CTCGTCACCG CTTCCGGCGA CGCCTCCCTC
CGCCTTGTAG CCGGCAAACG CGTCCAGCCG ATAAACACCG ATCTCATCCC CAGTTGGAAG
ACGATCGACG AGCGCATGCA GAACGCCCCA TGGCACACGG TCGACGGTGT CCACTACGGC
ACACCCTATG TCTGGGGGCC GAATGTTCTG ATGTACAATA CCGAAGCCTT CAAGGATCAG
CCGCCGAAGA GCTGGAATGT CGTTTTCGAA GAGACGACAT TGCCCGACGG CAAGTCGAAC
AAGGGCCGCA TTCAGGCTTA TGACGGCCCC ATCCATGTGG CCGACGCTGC CAACTACCTG
ATGGCGCACA AGCCAGACCT CGGCATCAAA GACCCCTACG AGCTGAATGA GGACCAGTAC
AAGGCAGCAC TCGACCTGTT GCGGACCCAA CGCACACTGG TCGGCCGCTA CTGGCACGAC
GCGATGATCC AGATCGACGA TTTCAAGAAT GAAGGCGTCG TGGCCTCCGG CTCCTGGCCC
TTTCAGGTCA ATCTGATGCA GGCCGAAAAG CAGCCTGTAG CCTCGATCAT TCCGGAAGAG
GGAGTGACGG GCTGGGCCGA TACGACGATG CTGCATTCCG ACAGCGAACA TCCGAACTGC
GCCTATATGT GGATGGAGCA TTCGCTTTCG CCGAAGGTCC AGGGTGACGT CTCGGCCTGG
TTCGGCGCCA ACCCCTCGGT CGGCGCCGCC TGCAAAGGCA ACGCCCTTCT GACCGACGAG
GGTTGCAAGA CCAATGGCTA TGACGACTTC GAAAAGGTCA AGTTCTGGAA GACGCCGGTA
ACGAAATGCG AGAGCCAGGG CGAATGCGTG CCCTATCACC GCTGGGTCTC CGACTATATC
GGCGTCATCG GCGGGCGGTA A
 
Protein sequence
MKQILKSCTA LTLSMALVAP AFAQEPPKEL GPGEGALSIV AWAGYIERGE TDKNYDWVTD 
FESKTGCKVS VKTAATSDEM VALMNEGGFD LVTASGDASL RLVAGKRVQP INTDLIPSWK
TIDERMQNAP WHTVDGVHYG TPYVWGPNVL MYNTEAFKDQ PPKSWNVVFE ETTLPDGKSN
KGRIQAYDGP IHVADAANYL MAHKPDLGIK DPYELNEDQY KAALDLLRTQ RTLVGRYWHD
AMIQIDDFKN EGVVASGSWP FQVNLMQAEK QPVASIIPEE GVTGWADTTM LHSDSEHPNC
AYMWMEHSLS PKVQGDVSAW FGANPSVGAA CKGNALLTDE GCKTNGYDDF EKVKFWKTPV
TKCESQGECV PYHRWVSDYI GVIGGR