Gene Smed_2950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2950 
Symbol 
ID5323827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3098606 
End bp3099913 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content62% 
IMG OID640791901 
Productextracellular solute-binding protein 
Protein accessionYP_001328614 
Protein GI150398147 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.110397 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGATGT TTGAACGCGG GTTGGCGTCC ATGCACGCCG CTAAACTAAC GGCGCTTGCC 
GCAGCCGCCG GCATGACGCT TCTCTTCGGT CCCGAAACGG CATCGGCCGA AACCGTCGTA
AAATGGCTGC ATCTGGAGAC GGTTCCCGCC TATCTGAAGC AATGGGAGGA CATCGCCGCC
AAGTACGAAA CCGAACATCC CGGCGTGGAT GTTCAGCTCC AATTCCTGGA AAACGAGGCT
TTCAAGGCGA AGTTGCCGAC GCTGCTGCAA TCGGACGACG CTCCCCATTT CTTCTACAGC
TGGGGCGGCG GAGTGCTGAA GCAGCAGGCC GAGACCGGCG CACTCAAGGA CCTGACGGAA
GCAATGCGTG CCGATGGCGG CGCCTGGGAG AAGAGCTACA ACCCGGCGGC AGTCAAGGGC
TTCACCTTTG AGGATCGAAT TTATGCGGTT CCCTTCAAAA TGGGAACGAT CAGCTTCTTC
TACAATAAGG AGCTGTTCCA GAAGGGCGGC GTCAAGGCCG AGGACATCAA GAGCTGGGAT
GATTTCCTCA CAGCGGTGAA AACGCTGAAG GAGGCCGGGA TCACGCCGAT CGCCGGCGGC
GGCGGGGACA AATGGCCGCT GCACTTCTAC TGGAGCTATC TCGTGATGCG CAATGGCGGC
CAGCAGGTAT TCGAGGATGC CAAGAACAAC GAGGGCGAGG GTTTCTTGCA CCCGGCGATC
CTAAAGGCCG GTGAACAACT CGCCGAACTC GGCAAGCTCG AGCCGTTCCA GGGCGGCTAT
CTCGGGGCGA ACTGGCCGCA AACGCTCGGC CTGTTCGGTG ACGGCAAGGC GGCGATGATC
CTGAGCTTCG AAACCACCGA AGCCACCCAG CGCGCCAATT CCGGCGACGG CAAGGGCCTG
GCACCCGAGA ACATCGGTCG CTTCCCCTTC CCGGCCGTCG AGGGCGGGGC AGGTGCGGCG
ACCGATACGC TCGGAGGCCT CAACGGCTGG GCGGTAACCA AGAATGCCCC GCCTGAGGCG
CTCGATTTCC TGCGCTATCT CACCAATGCT GAGAACGAGA GGCTTATGGC AAGTACCGGC
ATGATCGTAC CGGTGGCCGT GGGCGCGGAA GAGGGCATCA CCAACCCGCT GGTGCGTGCT
TCGGCCGACC AGCTTGCGGC CTCGACATGG CACCAGAACT ATTTCGACCA GGATCTCGGC
CCCTCGGTCG GCCGCGTCGT GAACGACGCA TCCGTCGAGA TCCTCTCCGG GCAGATGTCC
TCCGAGGAGG GAGCCCGGAT GATCCAGGAC GCACGCGAGC TGGAATGA
 
Protein sequence
MMMFERGLAS MHAAKLTALA AAAGMTLLFG PETASAETVV KWLHLETVPA YLKQWEDIAA 
KYETEHPGVD VQLQFLENEA FKAKLPTLLQ SDDAPHFFYS WGGGVLKQQA ETGALKDLTE
AMRADGGAWE KSYNPAAVKG FTFEDRIYAV PFKMGTISFF YNKELFQKGG VKAEDIKSWD
DFLTAVKTLK EAGITPIAGG GGDKWPLHFY WSYLVMRNGG QQVFEDAKNN EGEGFLHPAI
LKAGEQLAEL GKLEPFQGGY LGANWPQTLG LFGDGKAAMI LSFETTEATQ RANSGDGKGL
APENIGRFPF PAVEGGAGAA TDTLGGLNGW AVTKNAPPEA LDFLRYLTNA ENERLMASTG
MIVPVAVGAE EGITNPLVRA SADQLAASTW HQNYFDQDLG PSVGRVVNDA SVEILSGQMS
SEEGARMIQD ARELE