Gene Smed_5416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5416 
Symbol 
ID5319718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp380168 
End bp381772 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content63% 
IMG OID640777182 
Productextracellular solute-binding protein 
Protein accessionYP_001314114 
Protein GI150377519 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.604179 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAATAC GAAGCAGGTT TTGTTCACAT TGGCTTGCCG CATTGCTGCT CTCGCTCACC 
TTCTGGAGCG GTCTTCCAGG GTCTGGTCAT GCGCAGGAGG AGCTTCGGAT CGCAACATCG
TACAAGCTGA TGACACTCGA TCCGCATTAC GCAAATCTCA ACGAAAACAC CTCGCTGCTC
TCCCATATCT ACGAGCGTCT GGTCTACCAG GATGACCGTC TCGATCTGAA ACCGGGTCTG
GCCGTCTCCT GGCGCGCGGT GTCGGACAGG CAGTGGGAGT TCAAGCTTCG CGAGAATGTC
CGCTTCCATG ACGGCTCGCC CTTGACGGCG GACGACGTCG TCTACACGAT CGAGCGCATA
CGGGATTTCC TCAAATCCCC GAGCGGCGGC TTCCGGTCCT ATGTGACCGG CATCGAATCG
GTCTCGGCAC CCGATCCTCT CACCGTCGTC ATCGACACCA AGGGCAATAT CCCCAACCTG
CCGCTGTCGT TCTCGTCGAT CTTTGTGATG AACCGGCCGG CACAGGGGTT CCAGACCACC
GAAGAGCTCA ATGCCGGCAG GGCCGGCCAC CCTCCGGTCG GCACAGGGCC GTATACATTC
GAAAGCTGGA GTTCCGGCGA GGTGCTGAAG CTCGCCAGGA ACGATGATTA CTGGGGCGGC
AGGCCTGGCT GGCCGCAGGT CACGTTTCGG GTCATCGAAA GCCCGGCTGC CCGCGTGGCG
GCACTCAGCA CCGGCGAAGT CGACCTGGCG GATGCTATTC CCGCACGCGA CGTTGCCTCT
TTGAAGCAGC GCGGCGCCAG GATAGCCAGC GTCGGCGCGG CGCGGATCAA CTTCCTGCAG
TTCGACGTGG AGCGAGACAG GCTTCCCGGC GTGACCGATA AGTCCGGCGA GCCGATCGCC
AATCCGTTCA AGGACGCCTT GGTCCGTCGT GCGCTCGCCA TGGCCACCGA TCGCGGAATT
CTGGTCGACA AGATCCTCTC GGGCTATGGC ACGGCCGCAG CCCAGCTCTT TCCCGGCGGC
TTGCCGGGTA CCTCGGAAAC CTTGCAGCCG GAGGCTCCGA AGTATGACGA AGCCAAGGCG
CTTCTCGCAA AGGCTGGTTT CCCTGACGGT TTCAACCTCA TTCTCGCCGG ACCTGCCGGG
CGTTATCCCG GCGACGGCGA GAGCCTTCAG GCGATTGCGC AAAGCTGGGC CCGCATCGGA
GTAAAGGTGC AGCCGGCGGC GGCGCCGTTT TCGGTTTTCA ATACAAAGCG TGCCGCCGGC
GACTATGCCG TCTGGTACGG CGGCGCTTCC GGCGAAGCGG TGGACATCAT CCTCCACGCT
CTGCTGGCCT CACCGGACCC TGAAAGCGGG AACGGCGCCT TGAACTTCGG GCATTATCGC
AACCAGGCTT TCGACGCGAT GCTCGCAAGG GCGGAAAGCA TCCAGGAGGG CCCTGAGCGC
AACAAGGCGC TCGCCGAAGC GACCGAGTTC GTGATGGCCG ATCAGCCGAT CATACCGCTT
TACCACTTCC ATCACATCGT CGGCTACGGC CCGCGCGTTG CCTCCTATGC GATGCATCCC
CGCGGCTGGA CCACGGCGAT GCAGACGCTT GCCGCGACGG AGTAA
 
Protein sequence
MQIRSRFCSH WLAALLLSLT FWSGLPGSGH AQEELRIATS YKLMTLDPHY ANLNENTSLL 
SHIYERLVYQ DDRLDLKPGL AVSWRAVSDR QWEFKLRENV RFHDGSPLTA DDVVYTIERI
RDFLKSPSGG FRSYVTGIES VSAPDPLTVV IDTKGNIPNL PLSFSSIFVM NRPAQGFQTT
EELNAGRAGH PPVGTGPYTF ESWSSGEVLK LARNDDYWGG RPGWPQVTFR VIESPAARVA
ALSTGEVDLA DAIPARDVAS LKQRGARIAS VGAARINFLQ FDVERDRLPG VTDKSGEPIA
NPFKDALVRR ALAMATDRGI LVDKILSGYG TAAAQLFPGG LPGTSETLQP EAPKYDEAKA
LLAKAGFPDG FNLILAGPAG RYPGDGESLQ AIAQSWARIG VKVQPAAAPF SVFNTKRAAG
DYAVWYGGAS GEAVDIILHA LLASPDPESG NGALNFGHYR NQAFDAMLAR AESIQEGPER
NKALAEATEF VMADQPIIPL YHFHHIVGYG PRVASYAMHP RGWTTAMQTL AATE