Gene Smed_2857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2857 
Symbol 
ID5323727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2983926 
End bp2985554 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content62% 
IMG OID640791802 
Productextracellular solute-binding protein 
Protein accessionYP_001328522 
Protein GI150398055 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.791512 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.700488 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCTCA TCAACAGGCG CGGCGCGCTG GGGCTCATCG GCGCAACTGC GGGCAGCATG 
ATCCTGCCGC GCTTCGCCGT GGGTCAGGGC ACGCGCCCAT CCGTCACCAT CGCCGTGCAG
AAGATCACCA TCAACAACAC GCTCGACGTC TGGAACGAGC AGTCGAATGT CGGCGAGCGT
GTGTTCTTCC CCAACCTCTG GGAAGGCCTG ATCCTGCGCA ACTGGATGGG CGATCAGGGT
CCGGTTCCCG GCCTTGCGAC GGAATGGAAG CGCATTGACG ACAAAACGCT CGAGCTGAAG
CTGCGCCAGG GCGTAAAGTT CCATAATGGC GACGAACTCA CGGCCGACGA CGTCGTCTTC
AGCTTCTCGG CGCAGCGCGT GTTCGGGGAC ACCCAGCCTG CCGGCGGCAA GACGGTCTTC
GAGGATGAGC ACAAGCCGGC GACTGCGAAG GAACTGCCTG CGGTCGTGCC GGGTACCGGT
CGCCGCCTGT GGCCGGCGCT GGCCGGTGTC GAGGCAGTCG ACAAGTATAC CGTGCGTTTC
CACAACGCGA CGCCCGACGT CACCATCGAG GGCCGCCTTT ATGCATTCGG CAGCCAGATC
GCAAATCGTC GCGCCTGGGA TGAGGCGCCG ACCTACATGG ATTGGGCACG CAAGCCGATC
ACCACCGGGC CCTACAGGGT CGGCGAACAT AAGCCGGACG TGTCCCTGAC GCTGGTTGCC
TTCGACGACT ATTGGGGAGG CCGCCCGCCA CTCGAGCAAA TCCGTTTCGT TGAGGTTCCG
GAGGTCTCGT CTCGCGTGAA CGGGCTCCTC TCCGGCGAAT ATGATTTCGC CTGCGACCTG
CCGCCGGACC AGATCGCTGC CGTTCAGGCC GCACCGGGCT ACGAGGTTCA GGGCTCGACG
ATCCACAACC ACCGCATCTC GGTCTTCAAT GTCCAGAACC CGACCCTTCA GGATCCGCTC
GTCCGCCGCG CCATGACCCA TTCGGTCGAT CGCCAGGCGA TCGTCGATGC GCTCTGGGCC
GGACAGACGA CCGTCCCCGC TGGCCTGCAA TTTCCGTTCT ACGGGGATAT GTTCGTCGAA
GGCTGGGCGG TCCCTGAATA CGATCCGCAA CTGGCCAAGG ACCTCTTGAA GCAGGCCAAT
TACAAAGGGG ATGCGATCCC GTTCCGCCTT CTCAACAACT ACTATACGAA CCAGACCGCA
AACGGCCAGA TCATGGTCGA GATGTGGAAG CAGGTCGGGC TGAACGTCGA GATTGAAATG
AAGGAGAACT GGGGCCAGAT CCACGACCCG TCGGGCGTCA AGGGCGTGCG CGACTGGTCA
GCCGGTGCAG CCTTCAGCGA CCCCGTCTCC TCGATCGTTG CCCAGTTCGG ACCCAATGGC
GAGGTCCAGC AGAAGAAGGA CTGGTCGAAC GCCGAGGCCA ACCAGATGTC CCAGATCCTC
GAAACGGAAA CCGATCAGGC AAAGCGCAAG AAGGCATTTG CCCGCATGCT CGAAATCTGC
GAGCGCGAAG ACCCGGTCTA TCAGGTGCTT CATCAGAATG CGGTCTTCAC CGGAATGAAA
TCCTCTTTGA AGTGGAAGGC GGCACCCGCC TTCGCCATGG ATTTCCGCGC CGCCAACTGG
TCCAGCTGA
 
Protein sequence
MLLINRRGAL GLIGATAGSM ILPRFAVGQG TRPSVTIAVQ KITINNTLDV WNEQSNVGER 
VFFPNLWEGL ILRNWMGDQG PVPGLATEWK RIDDKTLELK LRQGVKFHNG DELTADDVVF
SFSAQRVFGD TQPAGGKTVF EDEHKPATAK ELPAVVPGTG RRLWPALAGV EAVDKYTVRF
HNATPDVTIE GRLYAFGSQI ANRRAWDEAP TYMDWARKPI TTGPYRVGEH KPDVSLTLVA
FDDYWGGRPP LEQIRFVEVP EVSSRVNGLL SGEYDFACDL PPDQIAAVQA APGYEVQGST
IHNHRISVFN VQNPTLQDPL VRRAMTHSVD RQAIVDALWA GQTTVPAGLQ FPFYGDMFVE
GWAVPEYDPQ LAKDLLKQAN YKGDAIPFRL LNNYYTNQTA NGQIMVEMWK QVGLNVEIEM
KENWGQIHDP SGVKGVRDWS AGAAFSDPVS SIVAQFGPNG EVQQKKDWSN AEANQMSQIL
ETETDQAKRK KAFARMLEIC EREDPVYQVL HQNAVFTGMK SSLKWKAAPA FAMDFRAANW
SS