Gene Smed_4739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4739 
Symbol 
ID5319107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1260715 
End bp1262208 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content63% 
IMG OID640776537 
ProductABC transporter related 
Protein accessionYP_001313469 
Protein GI150376873 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.383875 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00325994 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTTCTGT CCATGCACGG CATCTGCAAA TCCTTCAACG GCATTCCCGC GCTTCGCTCG 
GCTTCACTTG AAGTCGGCGA AGCCGAGGTG ATGGCGCTCG TTGGTCAGAA CGGCGCAGGA
AAGTCGACGC TTATCAAGAT ACTGACCGGC GCGTACCGGC GCGACGAGGG TTCGATCGCC
TTTGCGGGAG AGGACGTTTC CTTCAACATG CCGGCCGAGA GCCAGGCCAG GGGGATTGCA
ACGATCTACC AGGAGATCAA CCTCGCCCCG CAACGCTCCG TTGCCGAGAA CATCTATCTT
TCGCGCGAAC CCCGGCGCTT CGGTTTGATC GATAGGCGTG CAATGCGCGA AGGCGCGGTC
GCCGTCCTGC GGACCTTCAA CCTGGAGATC GACGTCGATC AGCCGGTCGC CCACTTCAGC
GCCGCGACCC GGCAGATGGT CGCGATTGCG CGCGCCGTCA CTCAGAAGGC ACGTCTCGTT
ATCATGGACG AGCCTACCTC TTCCCTTGAC GAGCGCGAGG TCGCCATCCT TTTCGAAACG
ATCAGAACGC TCAAACGCGG CGGCGTATCC GTCGTCTTCA TCGGCCATCG TCTGGACGAG
CTCTATCGGA TTTGCGACAG CGTTACGATC ATGCGGGACG GGAAGACGGT GGCGACGGGC
GCGATGGCGG AGATGCCGAA GCTCGCACTG GTGCGCCATA TGCTGGGGAA GGAGCTCGCT
GCCTTCGAAG CGATTGCCAA GGATGCGGAT GAGGGCGCGC AGCGGCCGGT GCGCCTATCG
GTCGAGAATG CCGGGGCCGG CGTTCGGGTG CGGAATGTCA GCCTGACGGT GCGCGAAGGC
GAGATCTCGG GGCTTGCCGG TCTGCTCGGC TCCGGCCGAA CCGAAACGGC CAATCTGATT
TTCGGCGCCG ACCGGCTTGA GCGCGGGGAA ATTCGCTACA AAGGCCAAGC GCGATCCTAT
CGTCAGCCCG CGGAAGCCAT TGCGGACGGC ATCGGTCTCG TTTCCGAGGA TCGGAAGGTC
GACGGCATCA TTCCGGATAT GAGCATCCGG GAAAACATGA CGCTCGCGCT TCTGCCGAAG
CTCGCCCGTA GCGGCATTGT CGATCGCGCG CGCCAGGATG AGATCGTCGC GAGCTACATC
GCCGCGCTTG GGATCAGATG CACTTCGCCC GACCAGCCTA TCAAGGAACT CTCTGGGGGC
AACCAGCAGA AGGTGCTGCT CGGACGATGG CTCTGCACCG ATCCGAAACT CCTGATCGTC
GACGAACCGA CTCGCGGCAT CGATATCGGC GCCAAAGCCG AAATTCTCCG CCTTTTGCGC
AGGTTAGCGG ACGAGGGTTT GGGCGTGCTG ATGATTTCGT CGGAGCTCGA GGAATTGCTC
GCGGCAGCCG ACCGGGTAAC CGTCCTCAGC GATGGAACCT CGGTGGCGGT TCTGCCGCGC
AGGGAGTTGA GCGAGGCAGC GCTCTTTGCC GCCATGGCGC ATCAGGTGGA GTAG
 
Protein sequence
MLLSMHGICK SFNGIPALRS ASLEVGEAEV MALVGQNGAG KSTLIKILTG AYRRDEGSIA 
FAGEDVSFNM PAESQARGIA TIYQEINLAP QRSVAENIYL SREPRRFGLI DRRAMREGAV
AVLRTFNLEI DVDQPVAHFS AATRQMVAIA RAVTQKARLV IMDEPTSSLD EREVAILFET
IRTLKRGGVS VVFIGHRLDE LYRICDSVTI MRDGKTVATG AMAEMPKLAL VRHMLGKELA
AFEAIAKDAD EGAQRPVRLS VENAGAGVRV RNVSLTVREG EISGLAGLLG SGRTETANLI
FGADRLERGE IRYKGQARSY RQPAEAIADG IGLVSEDRKV DGIIPDMSIR ENMTLALLPK
LARSGIVDRA RQDEIVASYI AALGIRCTSP DQPIKELSGG NQQKVLLGRW LCTDPKLLIV
DEPTRGIDIG AKAEILRLLR RLADEGLGVL MISSELEELL AAADRVTVLS DGTSVAVLPR
RELSEAALFA AMAHQVE