Gene Smed_5037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5037 
Symbol 
ID5319086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1556637 
End bp1558241 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content63% 
IMG OID640776818 
Productextracellular solute-binding protein 
Protein accessionYP_001313750 
Protein GI150377154 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.4089 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.963466 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAATTC AAAGCAGGTT TTGTTCACAT CGGCTTGCCG CATTGCTGAT CTCGCTCACC 
TTCTGGAGCG GTCTTCCAGG GCCTGGCCAT GCGCAGGAGG AGCTCCGGAT CGCGACATCG
TACAAGCTGA TGACACTCGA TCCGCATTAC GCAAATCTCA ACGAAAACAC CTCGCTGCTC
TCCCATATCT ACGAACGTCT GGTCTACCAG GATGACCGTC TCGATCTGCA ACCGGGTCTG
GCCGTCTCCT GGCGCGCGGT GTCGGACAGG CAGTGGGAGT TCAAGCTTCG CGAGAATGTC
CGCTTCCATG ACGGCTCGCC CTTGACGGCG GACGACGTCG TCTACACGAT CGAGCGCATA
CGCGATTTCC TCAAATCCCC GAGCGGCGGC TTCCGGTCCT ATCTGACCGG CATCGAATCG
GTGTCGGCAC CCGATCCCCT CACCGTCGTC ATCAACACCA AGGGCAATAT CCCCAACCTG
CCGCTGTCGT TCTCGTCGAT CTTTGTGATG AACCGGCCGG CACAGGGGTT CCAGACTACC
GAAGAGCTCA ATGCCGGCAG GGCCGGCCAC CCTCCGGTCG GCACAGGGCC GTATACATTC
GAAAGCTGGA GTTCCGGCGA GGTGCTGAAG CTCGCCAGGA ACGATGATTA CTGGGGCGGC
AGGCCTGCAT GGCCGCAGGT AACGTTTCGG GTCATCGAAA GCCCGGCTGC CCGCGTGGCG
GCACTCAGCA CCGGCGAAGT CGACCTGGCG GATGCTATTC CCGCACGCGA CGTTGCCTCT
TTGAAGCAGC GCGGCGCCAG GATAGCCAGC GTCGGCGCGG CGCGGATCAA CTTCCTGCAG
TTCGACGTGG AGCGAGACAG GCTTCCCGGC GTGACCGATA AGTCCGGCGA GCCGATCGCC
AATCCGTTCA AGGACGCCTT GGTCCGTCGT GCGCTCGCCA TGGCCACCGA TCGCGGAATT
CTGGTCGACA AGATCCTCTC GGGCTATGGC ACGGCCGCAG CCCAGCTCTT TCCCGGCGGC
TTGCCGGGTA CCTCGGAAAC CTTGCAGCCG GAGGCTCCGA AGTATGACGA AGCCAAGGCG
CTTCTCGCAA AGGCTGGTTT CCCTGACGGT TTCAACCTCA TTCTCGCCGG ACCTGCCGGG
CGTTATCCCG GCGACGGCGA GAGCCTTCAG GCGATTGCGC AAAGCTGGGC CCGCATCGGA
GTAAAGGTGC AGCCGGCGGC GGCGCCGTTT TCGGTTTTCA ATACAAAGCG TGCCGCCGGC
GACTATGCCG TCTGGTACGG CGGCGCTTCC GGCGAAGCGG TGGACATCAT CCTCCACGCT
CTGCTGGCCT CACCGGACCC TGAAAGCGGG AACGGCGCCT TGAACTTCGG GCATTATCGC
AACCAGGCTT TCGACGCGAT GCTCGCAAGG GCGGAAAGCA TCCAGGAGGG CCCTGAGCGC
AACAAGGCGC TCGCCGAAGC GACCGAGTTC GTGATGGCCG ATCAGCCGAT CATACCGCTT
TACCACTTCC ATCACATCGT CGGCTACGGC CCGCGCGTTG CCTCCTATGC GATGCATCCC
CGCGGCTGGA CCACGGCGAT GCAGACGCTT GCCGCGACGG AGTAA
 
Protein sequence
MQIQSRFCSH RLAALLISLT FWSGLPGPGH AQEELRIATS YKLMTLDPHY ANLNENTSLL 
SHIYERLVYQ DDRLDLQPGL AVSWRAVSDR QWEFKLRENV RFHDGSPLTA DDVVYTIERI
RDFLKSPSGG FRSYLTGIES VSAPDPLTVV INTKGNIPNL PLSFSSIFVM NRPAQGFQTT
EELNAGRAGH PPVGTGPYTF ESWSSGEVLK LARNDDYWGG RPAWPQVTFR VIESPAARVA
ALSTGEVDLA DAIPARDVAS LKQRGARIAS VGAARINFLQ FDVERDRLPG VTDKSGEPIA
NPFKDALVRR ALAMATDRGI LVDKILSGYG TAAAQLFPGG LPGTSETLQP EAPKYDEAKA
LLAKAGFPDG FNLILAGPAG RYPGDGESLQ AIAQSWARIG VKVQPAAAPF SVFNTKRAAG
DYAVWYGGAS GEAVDIILHA LLASPDPESG NGALNFGHYR NQAFDAMLAR AESIQEGPER
NKALAEATEF VMADQPIIPL YHFHHIVGYG PRVASYAMHP RGWTTAMQTL AATE