Gene Smed_3591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3591 
Symbol 
ID5318971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp19283 
End bp20548 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content59% 
IMG OID640775406 
Productextracellular solute-binding protein 
Protein accessionYP_001312339 
Protein GI150375743 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.317951 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCA TCCGGAAATA TGCCGTCGCA ACGACGGTCG CAGCGATGCT GGCCTGCACG 
GCCCTGCCGG TCACCGCCAA GGCGGAAGTG CTCAAATTCG TCTCGTGGCA GAAGGACGAG
AAGGGCATCG GCGACTGGTG GGGAACGGTG GTCAAGGAGT GGGAGGCAAA GCACCCCGGT
AATACGATCG AATGGACCAA GGTGGAACGC AGCGCCTATG CCGACACCAT GACGACGCTC
TTTGCAGGCG GCACGCCGCC GGACATCGTC CACCTGGCCT CCTTCGAATT CCAGACCTTT
GCGAACAATG GCTGGCTCGA GGACCTTGGT CCCTGGGTCG AGAAGTCGGG GCTCAATCTC
GATGGATGGA GCGGGCAGGA CATCTGCAAT TTCCAGGACA CGACCGTGTG CATCATGATG
CTCTATTACG GCACGATCTT CGGCTATAAC GAGGAGATGC TGAAGCAGGC GGGCGTCGCG
GTCCCCACCA ATTACGAGGA GTTCCTCGCA GCGGCCCGCG CGACCACCAA GGACCTGAAT
GGCGACGGCA TCGTCGACCA ATTCGGAACC GGCCACGAGA CCAAAGGTGG CGGCGGGCAG
TATATCGCCG AGATGGCGAG CTATCTTTTC GATGCCGGCG CACGCTTCAC CAATGCAGAA
GGCAAGGTGA CGATCGACAC CCCCGAAATG GTCGAGGGCC TGACCCGCTG GAAAACCGTG
GTCAAGGAAA GCCTGACCCC GCGCGACCTC TCGGCGGGCG AGGTCCGGAA ACTCTTCGCC
GATGGAAAGA TCGCTTTAAA GGTCGACGGT CCCTGGATCT ATTCCATCAT GCAGCAGGGA
GCGGCAAAGG ATAAGCTGAA GCTTGCCAGC GTTCCCTTCG ACCCGCCGCT GGGAGGGTCA
TCCAACATTC TCGCGATGCC GAGCGAGATT TCCGATGAGA AGAAGCAGCT TGTCTGGGAT
TTCATCGCAA TTGCGACCTC CGACAAATTC CAGACCAGCT TCGCGACGCT TGCCGCCTCG
ACTCCGCCGA GCCCGCGCGC CGATCTCACC GAAGCCAAGG CGCAGATTCC ACATTTCGAT
CTGATGGCGA AGTCGCAGAA GGCTGCGGCA GAGCACAAGA TCGACCGCAT TCCGACCGGA
CTCGAGATCC AGTTCAACGA GTTCTCGAAA ATGATTCAGG AGGAGGCGCA GAGAATGATC
ATAGAAGATC TCGATCCTGC AGCCGTCGCC AAGACGATGC ACGAGAAGGC CGAGGCGCTT
CAGTAG
 
Protein sequence
MKIIRKYAVA TTVAAMLACT ALPVTAKAEV LKFVSWQKDE KGIGDWWGTV VKEWEAKHPG 
NTIEWTKVER SAYADTMTTL FAGGTPPDIV HLASFEFQTF ANNGWLEDLG PWVEKSGLNL
DGWSGQDICN FQDTTVCIMM LYYGTIFGYN EEMLKQAGVA VPTNYEEFLA AARATTKDLN
GDGIVDQFGT GHETKGGGGQ YIAEMASYLF DAGARFTNAE GKVTIDTPEM VEGLTRWKTV
VKESLTPRDL SAGEVRKLFA DGKIALKVDG PWIYSIMQQG AAKDKLKLAS VPFDPPLGGS
SNILAMPSEI SDEKKQLVWD FIAIATSDKF QTSFATLAAS TPPSPRADLT EAKAQIPHFD
LMAKSQKAAA EHKIDRIPTG LEIQFNEFSK MIQEEAQRMI IEDLDPAAVA KTMHEKAEAL
Q