Gene Smed_2146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2146 
Symbol 
ID5323006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2215395 
End bp2216639 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content58% 
IMG OID640791084 
Productextracellular solute-binding protein 
Protein accessionYP_001327814 
Protein GI150397347 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAAAA GAAGTATCAT GCTAGCTGCA CTTATCCTGG CAGCTACATC GTCACTCGCC 
AAAGCCGAGG AGGTTACCCT TTGGGTGCGA ACTTCGTCGG GCGCGGTGCT TCAGGGGCTT
GCGGACAAGT ATAACGCCTC ACACGACGAC AAAGTTGTCG TCACGCAGAT TACGGCCGAG
CAGATGGTTC CGAAACTCGG TGCAGCGATC GCCGGTGGTT CGCCACCGGA CGGTGCAGTG
CTGGACCTGA TTTATCTTCC GACCTTTGCA GCCAGCGACA GCCTGGAGGA CGTCACCGAT
TTCGTCAAAG GACTGCCTTA TGCGGAGGCA CTCAGTCCGT CTCACATCCG TTTGGCGACC
TATGAAGACA GAATCTACGG CCTGCCTGCT CTGCCTGACG CATCCATCAT CGCTTATAAC
ACGGAACTCT TCGAAAAGGC TGGCCTCGAT CCCAAGAAAG CCCCGGCCTC CCTTGCCGAG
ATCGTCGACT ACGCGAAGAA GATACGCGGC ATCGGCGAGG ATACCTACGG TTTCTATTTC
GTCGCGAACT CGGGCAGCTG GCTGATCTAC GATTTTCTGC CTCACATCTG GGCGGCGAAC
GCGGACGTCC TGACGGATGA TGGACGAGAG GCGACCATCG ATACGCCAGC CATGCGCGAA
ACGATAGCTG CCTACCGGGA CATGTGGAGT GCCGGCGCCG TGCATCCGAC GTCCAGATCG
GGCAATGGCA ACAATGCAGT CGAGGCCTTC GCCTCCGGCA AGGTCGGAAT CCTGATGACG
GGTTCGTACA TCGTGAACCT GCTCACCAAC AAATATCCTG ACGTCAAATT TGATGTTGCA
CCGATCCCAG GCCCGAGCGG TGGCGTCTCC AGTTTCGCTG GTGGCGACAC TCTTTCTCTG
ATGAAGGGCA TCAGCGAGGA GAAGAAGAAG GTCCTGCTCG ACTTCGTCGA GTTCTATATG
CAGCCCGAGC AGCAGGTCTA TATCACGAAG GAGTCGGGCA TGCCCTCGCG AACCGACCTC
GCCGGCGAGG CATATGCGCA GTTCGACAAG CGCAATCTGG TCGCCTACGA CATACTGGCC
AACGCCCGCA CGCCTTACAC CTTCTCGTCC GACGAACTGT TCGTCAGCCG CACGGGTCCT
TTCCTTAACC TCATCCAGGG TTCGATCTTT GGCGATGACG TCGACGGCGC GATCGCCAAG
GCTCAGGATG GGTTCAGCAA GATCCTCGAA CGGACCAACC CCTGA
 
Protein sequence
MRKRSIMLAA LILAATSSLA KAEEVTLWVR TSSGAVLQGL ADKYNASHDD KVVVTQITAE 
QMVPKLGAAI AGGSPPDGAV LDLIYLPTFA ASDSLEDVTD FVKGLPYAEA LSPSHIRLAT
YEDRIYGLPA LPDASIIAYN TELFEKAGLD PKKAPASLAE IVDYAKKIRG IGEDTYGFYF
VANSGSWLIY DFLPHIWAAN ADVLTDDGRE ATIDTPAMRE TIAAYRDMWS AGAVHPTSRS
GNGNNAVEAF ASGKVGILMT GSYIVNLLTN KYPDVKFDVA PIPGPSGGVS SFAGGDTLSL
MKGISEEKKK VLLDFVEFYM QPEQQVYITK ESGMPSRTDL AGEAYAQFDK RNLVAYDILA
NARTPYTFSS DELFVSRTGP FLNLIQGSIF GDDVDGAIAK AQDGFSKILE RTNP