Gene Smed_4685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4685 
Symbol 
ID5319327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1200706 
End bp1202304 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content59% 
IMG OID640776483 
Productextracellular solute-binding protein 
Protein accessionYP_001313415 
Protein GI150376819 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.192999 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.292705 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTAC TCAGACATAA TCTCGCCGGC GCGGCGCTCA TCTGTTCGCT GCTCATGGGG 
GCAAGCCCCG CGCTCGCCCA GGCGGTACTT CACCGCGGCA ATGCCGGCGA GCCGCAGACG
CTCGACCAGG CTCACACCTC CATCAATATC GAGGAGTTCA TTCTCAAGGA CCTCTATGAA
GGCCTGACCA TCTATGATGC CGCGGGAAAG ATCGTACCGG GTGCCGCCGA AACCTGGGAG
CTTTCGGATG ATGGTACCGT CTATACCTTC AAACTGCGTG CCGATGCCAA GTGGTCTGAC
GGCTCACCGG TGACGGCAGA AGATTTCGCC TTCTCCCTCC GTCGGGTGGA AGATCCGAAG
ACGGCGGCCG AGTACGCCAA TATCCTGTTC CCGATCAAGA ACGCGGAAAA GGTCAACAAG
GGCGAACTGC CGGTCGACCA GCTCGGCGTG AAGGCCGTCG ACGAGAAGAC GCTCGAAGTC
ACCCTCGAAC GTCCGACGCC TTTCTTCCTG GAACTGCTCG CACATCAGAC GGCTCTTCCG
GTCAGCAAGG CCAACGTCGA GAAGAACGGT GCCGACTTCG TGAAGCCGGG CGTGATGGTT
TCGAACGGGG CCTTCAAACT GGCGTCACAT GTGCCGAACG ACAGTCTGAC CGTGGAGAAG
AACACGAACT ACTGGGATGC CGCCAACGTC AAGCTCGACA AGGTGATCTT CTATCCGATC
GATGATCAGG CCGCCTCGGT GCGCCGTTTC GAAGCGAAGG AAATGGACCT CGCCTATAAC
TTCTCGGCCG ACCAGATCGA CCGCCTGCGT AAATCCTATG GTGAACAGGT GCACGTTTCT
CCGACGCTTG CGACCTACTA CTACGCTTTC GACACGCGCC AGGAGCCCTA CAACGATGTC
CGGGTCCGCC GGGCACTCTC TATGGCGGTT GACCGCGACT TCCTTGCCAA GGAAATCTAC
AGCGGCTCGC AGCTGCCGTC CTATTCGATG GTGCCGCCAG GCATCGAGAG CTACGGAGAT
CCCGCCAAGG CCGATTTCGG GGACATGTCG CAACTCGACC GCGAGGACAA GGCGATCGAG
TTGATGAAGG AAGCCGGCTA CGGCGAAGGC GGCAAGCCGC TCAACATCGA AATCCGCTAC
AACACCAACC CCAACCATGA GCGTGTCGCG ACCGCGGTTG CCGACATGTG GAAGAACACC
TTCGGTGCCA AGGTCTCGCT GGTGAATCTC GATGTGTCGT CCCACTACGC CTATCTGCAG
GAAGGCGGCA AGTTCAACGT CGCGCGCGCA GGCTGGGTCG CCGATTACGC CGATGCCGAG
AACTTTCTGG CGCTGAGCCT CAGCACCAAC AAGACGTTCA ATTACGGCCA CTTCGAAAAT
GCGGAATTCG ACGCGTTGAT GAAGAAATCC TATGAAGAAC AGGATCCTGC AGCACGTTCG
AAAATCCTGC ATGAGGCCGA AACGCTGCTG ATGAAGGAAC AGCCGATAGC GCCCCTCCTG
ACCCAGGCCG ACCTCTGGCT CGTTTCGGAA CGGGTCCAGG GTTGGGTGGA CAATGCGCCG
AACGCTCACC TGAGCAAGTT CCTGAGCATC GCCGAGTAA
 
Protein sequence
MALLRHNLAG AALICSLLMG ASPALAQAVL HRGNAGEPQT LDQAHTSINI EEFILKDLYE 
GLTIYDAAGK IVPGAAETWE LSDDGTVYTF KLRADAKWSD GSPVTAEDFA FSLRRVEDPK
TAAEYANILF PIKNAEKVNK GELPVDQLGV KAVDEKTLEV TLERPTPFFL ELLAHQTALP
VSKANVEKNG ADFVKPGVMV SNGAFKLASH VPNDSLTVEK NTNYWDAANV KLDKVIFYPI
DDQAASVRRF EAKEMDLAYN FSADQIDRLR KSYGEQVHVS PTLATYYYAF DTRQEPYNDV
RVRRALSMAV DRDFLAKEIY SGSQLPSYSM VPPGIESYGD PAKADFGDMS QLDREDKAIE
LMKEAGYGEG GKPLNIEIRY NTNPNHERVA TAVADMWKNT FGAKVSLVNL DVSSHYAYLQ
EGGKFNVARA GWVADYADAE NFLALSLSTN KTFNYGHFEN AEFDALMKKS YEEQDPAARS
KILHEAETLL MKEQPIAPLL TQADLWLVSE RVQGWVDNAP NAHLSKFLSI AE