Gene Smed_0186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0186 
Symbol 
ID5321016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp205222 
End bp206415 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content62% 
IMG OID640789119 
Productextracellular solute-binding protein 
Protein accessionYP_001325880 
Protein GI150395413 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCT GGAAAATAGG CACCGCTCTC GCGGCATCGC TTCTGGCAAG CACCGCATCG 
GCGGAAACCG TTCGTTTCTG GTATCACTTC GACAATCCGG AAAACCCGAT GGCCGATCTG
ATCGCGAAAT TCGAAGCGGC CAATCCGGGT ATCGAGATCG AAGCGGAAAA CGTTCCGTGG
AACAGCTACT ACGATAATCT CTACACCGCG CTCGTCGGCG GCAACGCGCC GGACGCCGCG
ATGGTCAAGC TCTTCGCCCA GCCGCGCCTC ATCGAAATGG GCGCGCTGGA GCCGCTCGGC
GAGCGCATTG ACGGCTGGGC CGGCAAGGCG GACCTGCTCG ACAACCTCCT CGACCTCAAC
AAGGGGTCGG ACGGTCAGCA GTACTACCTG CCGATCCAGT ATGTCGTGCT TTATCTTTAC
TACCGCGCCG ACCTGTTCGA CGCCGCCGGC CTGAAGCCGC CGGCGACCTG CGACGCGTTC
CGCGACGCGG CGATCAAGCT CACCAAGCAG CCGGCGACCT ACGGCTTCGG CCTGCGCGGC
GGCAAGGGTG GCTGGGACCA GTGGGGGGCC TTCGTGCTGT CGCAGGGCGC GAAGCTTGAG
CCGGGCGGTC TGACGACGCC GCAGGCGATC GCTGCCAACC AGTGGCTGAT CGATCTGTTC
CAGAAGGACA AGGTCATTCC GCCCTCGGCA CCGAATGACG GCTTCCAGGA AATCACCGCC
GCCTTCAAGA AAGGGACCAC GGCCATGACC ATTCATCATG TCGGCTCGTC GAACGACATG
GTCAAGGCAC TCGGTGACAA GGTCTCGGCG GTGCCGTTGC CGGAATGCGG CGGCGGCCGC
TGGACGTCCT ATGGCGACGA GTCGTTGGCT ATCTTCTCCT CCTCGGAGGT GAAGGATTCC
GCGTGGAAGT GGATCTCGTT TCTTGCCGAG GGCGAGAACA ACGTCGCCTT CAACAAGGCG
ACCGGGCAAA TGACGGTGAC CAAGAGCGGT TCGGAAAATT GGACGCTGCA TGAGCGCCGC
TTTGTCGATG CGACGGTACA ATCGCTGCCC TTCGCCCATG TGCTGCCGCA GAACACCGCG
ACGTCCGAGT TCGTCAACAC GGCCTGGCAA ACGGCCATGC AACAGGCGCT GACGGGCCAG
ATCACCTCCG AAGAGATGAT GAAGCAGCTC GAAGCTCTTT TCGTGCAGCA ATGA
 
Protein sequence
MKIWKIGTAL AASLLASTAS AETVRFWYHF DNPENPMADL IAKFEAANPG IEIEAENVPW 
NSYYDNLYTA LVGGNAPDAA MVKLFAQPRL IEMGALEPLG ERIDGWAGKA DLLDNLLDLN
KGSDGQQYYL PIQYVVLYLY YRADLFDAAG LKPPATCDAF RDAAIKLTKQ PATYGFGLRG
GKGGWDQWGA FVLSQGAKLE PGGLTTPQAI AANQWLIDLF QKDKVIPPSA PNDGFQEITA
AFKKGTTAMT IHHVGSSNDM VKALGDKVSA VPLPECGGGR WTSYGDESLA IFSSSEVKDS
AWKWISFLAE GENNVAFNKA TGQMTVTKSG SENWTLHERR FVDATVQSLP FAHVLPQNTA
TSEFVNTAWQ TAMQQALTGQ ITSEEMMKQL EALFVQQ