Gene Smed_5835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5835 
Symbol 
ID5320137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp800292 
End bp801266 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content63% 
IMG OID640777531 
Productputative ABC transporter, periplasmic solute-binding protein 
Protein accessionYP_001314463 
Protein GI150377868 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGGA AATTGGTAGC AGGAACGGCC CTCGGGCTGA CGCTGATGGC CCCGGCCGCC 
AGCGCGGAGG GACTCAATAT CGCCTTCATC AGCCATTCCT CGGCCTCGAA TACATTTTGG
CAGGCGGTGA AGAAGGGCTA TGATGACGCT TGCGAAAAGG TCGGCGCCTC GTGCCAGCTG
ATCCTCACGC AGACGGAAGG CGCAGTCGAA CAGGCCGTGG CCAACCTGCA GGCGGCGATC
GCCTCCAGGC CGGACGCGAT TTTCGTCGCG ATCGTAGACA ACAATGCCTA TGACAACCTG
ATCAAGGAAG CCGTCGATTC CGGCATCCTT GTGCTCGCGG TCAATGGCGA CGACAGCGAA
GGCGCCAAGG GCAACGCCCG TAAGGCCTTC ATCGGCCAGG GTTTCTCCGC TGCCGGCTAC
TCGCTCGCCA AAGCCCAATC GGAAAACTTC CCGAAGGAGG GCCCGCTCAA CCTTGTGGTC
GGCGTCAATG CGCCGGGCCA GACCTGGTCG GAGCAACGCG CCGGCGGGGT TACGAAGTTC
CTCGAGGAGT ATAAGGCCGA GCACTCCGAT CGCGAGATCA ACATTACCCG TGTCGACTCT
GCGACGGACC TCGCGCTGAC GGCCGACCGT ATCGGCGCCT ACCTCAACGC CAATCCGGAT
ACGACCGCCT ATTTCGATAC GGGCTACTGG CATGCCGGTG TTGCAAAAGT ACTGAAGGAT
CGCGGCATCG AGCCCGGCAA GGTGCTGCTC GGCGGATTCG ACCTCGTGCC CGAGGTGCTG
CAGCAGATGC AGGCGGGCTA TGTCCAGGTG CAGGTCGACC AGCAGCCATA CATGCAGGGC
TTCATCCCCG TCATGCAGGC CTATCTCTGG AAGACCGCCG GGCTGACGCC CTCCGACGTC
GATACGGGGC AGGGCATCGT CACTCCTGAG GACGTGCCGA CGATCCTCGA ACTGGCGAAG
CAGGGTCTGC GCTGA
 
Protein sequence
MIRKLVAGTA LGLTLMAPAA SAEGLNIAFI SHSSASNTFW QAVKKGYDDA CEKVGASCQL 
ILTQTEGAVE QAVANLQAAI ASRPDAIFVA IVDNNAYDNL IKEAVDSGIL VLAVNGDDSE
GAKGNARKAF IGQGFSAAGY SLAKAQSENF PKEGPLNLVV GVNAPGQTWS EQRAGGVTKF
LEEYKAEHSD REINITRVDS ATDLALTADR IGAYLNANPD TTAYFDTGYW HAGVAKVLKD
RGIEPGKVLL GGFDLVPEVL QQMQAGYVQV QVDQQPYMQG FIPVMQAYLW KTAGLTPSDV
DTGQGIVTPE DVPTILELAK QGLR