Gene Smed_3405 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3405 
Symbol 
ID5324289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3612265 
End bp3613524 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content60% 
IMG OID640792356 
Productextracellular solute-binding protein 
Protein accessionYP_001329061 
Protein GI150398594 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGCA CGACAATGAA AGGCCTACTG CTTGCATCGA GCATTCTCGG ATCGGCCGGG 
CTAGTACAAG CCCAGGACGC AACGCTGACG ATCGAAAGCT GGCGCAATGA CGACCTGGCG
ATCTGGCAGG AAAAGCTGAT CCCGGCCTTC GAGGCGAAGA ACCCCGGCAT CAAGGTGGTG
TTCGCTCCCT CGGCCCCGAC TGAATACAAT GCCGCGCTCA ACGCCAAGCT CGACGCCGGT
TCCGCGGGCG ACCTCATCAC ATGTCGGCCC TTCGATGCCT CACTCGAACT CTACAATAAG
AAACATCTCG CCGACCTGAC CGGTCTTTCC GGCATGGAGA ACTTCTCGGA CGTCGCCAAG
TCCGCCTGGA CGACAGACGA CGGCAAGGCG ACTTTCTGCG TACCGATGGC CTCCGTCATC
CACGGCTTCA TCTACAACAA GGATGCCTTC GACCAGCTCG GGCTCGCCAT TCCGGCGACG
GAAGAGGAAT TCTTCGCGGT TCTCGAAAAG ATCAAGGCGG ACGGCAACTA CATTCCAATG
GCGATGGGCA CCAAGGATCT CTGGGAAGCC GCGACGATGG GCTACCAGAA CATCGGCCCG
ACCTACTGGA AGGGCGAGGA AGGCCGTCTG GCTCTTCTGA AAGGCCAGCA GAAGCTCACC
GACGAACCAT GGGTGGAACC TTTCCGCGTA CTGGCAAAGT GGAAGGATTA TCTCGGCGAC
GGTTTCGAGG CACAGACCTA TCCGGACAGC CAGAACCTCT TCACGCTCGG CCGTGCGGCC
ATCTATCCGG CCGGCTCCTG GGAAATCTCG GGCTTCAACA CGCAGGCCGA ATTCAAGATG
GGGGCCTTTC CGCCGCCGGT GAAGAAGGCC GGCGACACCT GCTACATTTC CGACCATAAC
GACATCGGCA TCGGGCTCAA TGCCAAGAGC AAGAATGCCG ATGCCGCCAA AACCTTCCTC
ACCTGGGTCG CCTCGCCGGA ATTCGCGGAA ATCTATGCGA ACGCCCTGCC CGGCTTCTTC
AGCCTGAATT CGACGGCGGT AAAGATGTCC GATCCGCTCG CCCAGGAATT CGTCTCCTGG
CGGGAGAAAT GCAAGCCGAC CATCCGCTCG ACCTATCAGA TCCTGTCGCG CGGAACCCCG
AACCTCGAGA ATGAGACCTG GGTCATGTCG GCCAACGTCA TCAACGGCAC AGACACGCCG
GAGGCGGCCG CCAAGAAGCT CCAGGACGGG CTCGACAGCT GGTTCAAGCC GGTAAAATAA
 
Protein sequence
MTRTTMKGLL LASSILGSAG LVQAQDATLT IESWRNDDLA IWQEKLIPAF EAKNPGIKVV 
FAPSAPTEYN AALNAKLDAG SAGDLITCRP FDASLELYNK KHLADLTGLS GMENFSDVAK
SAWTTDDGKA TFCVPMASVI HGFIYNKDAF DQLGLAIPAT EEEFFAVLEK IKADGNYIPM
AMGTKDLWEA ATMGYQNIGP TYWKGEEGRL ALLKGQQKLT DEPWVEPFRV LAKWKDYLGD
GFEAQTYPDS QNLFTLGRAA IYPAGSWEIS GFNTQAEFKM GAFPPPVKKA GDTCYISDHN
DIGIGLNAKS KNADAAKTFL TWVASPEFAE IYANALPGFF SLNSTAVKMS DPLAQEFVSW
REKCKPTIRS TYQILSRGTP NLENETWVMS ANVINGTDTP EAAAKKLQDG LDSWFKPVK