Gene Smed_5796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5796 
Symbol 
ID5320098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp769200 
End bp770780 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content59% 
IMG OID640777500 
Productextracellular solute-binding protein 
Protein accessionYP_001314432 
Protein GI150377837 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00815808 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00391201 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCGCGTT TAAACAGGTT TCTCATTTCG GCGCTGACGG TAGCGGCGAT TACCGCGCCG 
GCGCTGTCCA CCTCCGCAAG TGCCGCCACG CTTCGCTGGG GCAGCCGCGC AGACATCTAT
TCGCTCGATC CGGATTCCGT TCCCTCGACA TCCAACCTTG CGTTCCTGAA CCACATCTAT
GAAGGTCTGA TCCGGTATGG ACCGAACTTC GAGATCGAGC CGGCGCTCGC CACCGAGTGG
AAGCTGATCG ACGACAAGCA CTGGCGTTTC ACGCTGCGCA AAGGCGTGAA GTTCCACAAC
GGCGCAGACT TCACCGCAGA CGACGTTGTC GCCTCCATGA ACCGCGTGTC GGACCCGGCC
TCGCCTCTGC GCGGCAACAT CCCGCTCTAT GTCGGCGTAA AGAAGGTGGA CGATTTCACG
GTCGACATCG AGGTTTCGGC GCCGACTGCG CTGTTCCTGA ACGACATGAC CAATATCTTC
ATGTTCAACG CGAAATGGCT GACGGATAAC AAGGCAGAAA AACCGACCGA TATCGCGTCC
AATACCGAGA ACTACGCGAC GCACAACACG AACGGTACGG GCCCGTTCAA GCTTGAGAGC
CGCGTTCCGG ACAGCAAGAC CGTTCTCATC GTCAACGACC TGTGGTGGGA TCAGAAGAAG
CACAATCTGG ACCGGATCGA GTATGTTCCG ATCGCATCGG CGGCGACGCG TGTCGCAGCG
CTTCTTTCCA ACGAAATCGA TCTGGTCGAT TCCGCACCCA TTCAGGACCT TCCTCGCCTG
GAATCCTCGC CTGGTATCAA AGTAAGCAAG CGCACGGAGC TGCGCACCGT GTTCATCGGC
TTCAACGGCA AGGCGAAGCT TGAAGATGGG CGTGCGAACC CGTTCCTCGA CGTTCGCGTT
CGTCAGGCCG TTGACGCAAG CATTGATCGC GATCTCATCA ACAAGAAGAT CATGCGCGGT
CTGGCGCGGC CCTCCGGCTC GCTCATTGCA CCGGAAATCG CCGGCTATGC GAAATCGCTT
GATACCTATC AGCCCGTCGA CACCGAGCTT GCCCAGAAGC TGCTCGCGGA AGCCGGCCAG
GAGGGTCTTG CCTTCACCTA TCTTTGCATG AACGACGAGA GCATCAACGA GGAAGACTTC
TGCTCGGGCA TCGCGAACAT GCTGAAGCGT GCCGGCTTTC AGCCCACAAT CGACATGGGA
CCGCGCGCCG TGCAGCAGCC TAAGCGCACC AATGGCAAGG CCGACATTTT CAACCTGAGC
TGGGCAAACG AACCGACGCT CGATGCCTAT TCGCTGCTCT CTCAAGTCCT CTCCACGCGC
AGCGGTTCGA CGGGCGTTTC CAACTATGGC GGCTGGTCCT ACCCGGAGCT CGACGAGCTG
GTGAAGAAGG CGGCACAGGA ACCTGACACC GCAAAGCGGC TCGCACTCGA AGAACAGGCC
CTGAAAATCG CCAAGGACAA GGCGATCCTG ATCCCGCTCC ACCAGCAGCC GATTGCCTGG
GGCATGCTGG ACACGGTCAA GAGCGTGGAT TTCCGCGCCG ACAACAAGCC GCGCCACTGG
CACACGCAAA TGGCCGAATA A
 
Protein sequence
MSRLNRFLIS ALTVAAITAP ALSTSASAAT LRWGSRADIY SLDPDSVPST SNLAFLNHIY 
EGLIRYGPNF EIEPALATEW KLIDDKHWRF TLRKGVKFHN GADFTADDVV ASMNRVSDPA
SPLRGNIPLY VGVKKVDDFT VDIEVSAPTA LFLNDMTNIF MFNAKWLTDN KAEKPTDIAS
NTENYATHNT NGTGPFKLES RVPDSKTVLI VNDLWWDQKK HNLDRIEYVP IASAATRVAA
LLSNEIDLVD SAPIQDLPRL ESSPGIKVSK RTELRTVFIG FNGKAKLEDG RANPFLDVRV
RQAVDASIDR DLINKKIMRG LARPSGSLIA PEIAGYAKSL DTYQPVDTEL AQKLLAEAGQ
EGLAFTYLCM NDESINEEDF CSGIANMLKR AGFQPTIDMG PRAVQQPKRT NGKADIFNLS
WANEPTLDAY SLLSQVLSTR SGSTGVSNYG GWSYPELDEL VKKAAQEPDT AKRLALEEQA
LKIAKDKAIL IPLHQQPIAW GMLDTVKSVD FRADNKPRHW HTQMAE