Gene Smed_4136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4136 
Symbol 
ID5319278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp605247 
End bp606521 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content62% 
IMG OID640775941 
Productextracellular solute-binding protein 
Protein accessionYP_001312874 
Protein GI150376278 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.518326 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATTC AGGCGCGTGC CTATCTGCCG CGTATCGCTG CTCTCGCTCT CGCAGGCGCG 
AGCTTTCTGG GCGTGTCTGC GGCGCAGGCC AAGGAAATCA CCATCTGGTG CTGGGACCCG
AACTTCAACG TCGCGATCAT GAAGGAGGCG GGCGACCGCT ACACGAAGAC GCATCCAGAC
GTCACCTTCA ACATCGTCGA CTTCGCCAAG CTCGACGTCG AGCAGAAGCT GCAGACCGGC
CTTTCCTCCG GCACCGCCGA CGCGCTTCCC GACATCGTTC TCATCGAGGA TTACGGCGCG
CAGAAATACC TGCAATCCTT TCCGGGCGCC TTTGCGCCGC TCTCCGGCAC CGTCGATTAC
TCCGGTTTCG CCCCCTACAA GGTCGAGCTG ATGACCCTCG ATGGTGAAGT CTACGGAATG
CCCTTCGATT CCGGCGTCAC CGGGCTCTAT TACCGCAAGG ATTATCTCGA AGCCGCAGGC
TTCAAGCCGG AGGACATGCA GGATCTCACC TGGGATCGTT TCATCGAGAT CGGCAAGCAG
GTCGAGGCAA AGACCGGCAA GAAGATGATG GGCCTCGATC CCAACGACGC CGGCCTCGTC
CGCATCATCA TGCAGTCGGC CGGGCAATGG TATTTCGACA AGGAAGGCAA GCCGAACATC
ACCGGCAACG CGGCGCTGAA GGCAGCCCTC GAAACCATCG GCAAGATCAT GCAGGCCAAT
ATCTACAAGC CTGCCAACGG CTGGTCCGAC TGGGTCGGTA CCTTCACCTC CGGCGATGTC
GCGACCGTCG TCACCGGCGT CTGGATCACC GGCACCGTCA AGGCGCAACC GGACCAGTCC
GGCAACTGGG GCGTCGCCCC CATACCGGCG CTCTCTATCG AAGGCGCCAC GCATGCCTCC
AATCTCGGCG GCTCCAGCTG GTACGTGCTC GAAAGCTCCG AGGAGAAGGC AGAAGCGATC
GATTTCCTGA ACGAGATCTA TGCCAAGGAC ATCGATTTCT ATCAGAAGAT ACTCCAGGAT
CGCGGCGCGG TCGGCTCGCT GCTCGCTGCC CGCGGCGGCG CGGCCTACGA GGCCGCAGAC
CCCTTCTTCG GCGGCGAGAA GGTCTGGCAG AACTTCTCCG AATGGCTGGC GAAGGTTCCC
TCGGTCAATT ACGGCATCTT CACCAATGAG GCGGATCTCG CCGTTACCGC GCAGCTCCCA
GCCGTCACCC AGGGAACGCC CGTCGACGAA GTGCTGAAGG CGATCGAGGC CGAGATCGCC
GGCCAGATCC AGTAA
 
Protein sequence
MDIQARAYLP RIAALALAGA SFLGVSAAQA KEITIWCWDP NFNVAIMKEA GDRYTKTHPD 
VTFNIVDFAK LDVEQKLQTG LSSGTADALP DIVLIEDYGA QKYLQSFPGA FAPLSGTVDY
SGFAPYKVEL MTLDGEVYGM PFDSGVTGLY YRKDYLEAAG FKPEDMQDLT WDRFIEIGKQ
VEAKTGKKMM GLDPNDAGLV RIIMQSAGQW YFDKEGKPNI TGNAALKAAL ETIGKIMQAN
IYKPANGWSD WVGTFTSGDV ATVVTGVWIT GTVKAQPDQS GNWGVAPIPA LSIEGATHAS
NLGGSSWYVL ESSEEKAEAI DFLNEIYAKD IDFYQKILQD RGAVGSLLAA RGGAAYEAAD
PFFGGEKVWQ NFSEWLAKVP SVNYGIFTNE ADLAVTAQLP AVTQGTPVDE VLKAIEAEIA
GQIQ