Gene Smed_1547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1547 
Symbol 
ID5322405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1639842 
End bp1641026 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content63% 
IMG OID640790492 
Productmajor facilitator transporter 
Protein accessionYP_001327224 
Protein GI150396757 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.197507 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGTTT TCCGGTCCGT CCTCGTCCTC TCCATCACTC AAATCATCGC CTGGGGAGCG 
ATGTTCATGT TCGTCTCGGT GACGGCGGCC GGGATGGCCG ATGATCTTCT ACTGCGACCT
TCGACCATCT ATCTCGGCCC GACGGTCATG CTGGTCGCGA TGGCCCTCTG TTCGCCGGTG
ATGGCGCCGA TCTACGCTCG CTGGGGTGCA AGGCTCGTGC TCGCATTCGG TTCGGCGGCT
GCAGCTCCCG GGCTGTGGTT GCTTGCCGGC GCCGAAGGCC CGGTTTCCTA TTTTTCAGCC
TGGGCCATCC TGGGTCTGGC CGGAGCGGCG GCACTTACGA CTTCGGCTCA GGTGTTTTTG
ACGGAGATAG CGGGGGAGCG TGCCCGCCGG GCAATCGGCG CACAGATGCT CGCCATGGCG
CTTGCACCGA CGATCGCATG GCCAGTCACA ACCATCTGCG AAGCGACTTT CGGCTGGCGC
GGCACATTCG TTCTCTATGG CGCTGTGATG CTGCTCGTCT GCACGCCTTT TCACCTCTTC
GGATTGCCGA GAACCGAGCC GGTAAAGCGC AACTCTTCGG TATCGAATTT CAAGCGCTTT
AGCGCGTCGG AACTGGCGCG CCGCTGGCGC ATCGTTGCCC TGATCACGGC GGCAGTCGCG
CTCAACGGCT TTGTCACCTG GGGCTTCCAG CTCGTCGTGA TAGACCTTTT TCGCAGCTTC
GCCGTGCCTG GTACCCTGGC AGTCGGCTTC GGATCTGCCA TAGGCTTCCT CCAGCTTTCG
GCGCGCCTGT TCGATTTTCT CGGCGGCAAT CGCTGGGACG GATTGACGAC GGGATTGGTA
GCCGCGGCGA TGATGCCGCC GGCATTGCTG GTGCTGGCGC TGGGCGAGGG GGCGGAATGG
TCCATCGTGC TTTTCCTCGT GCTCTACGGC CTTTCAAGCG GAGCAATGTC TGTCAGCCGG
GCGACGATGC CGTTGGTCTT CTTCTCGTCG GCGGAATACG GGACGGTCGT GGCGCGCCTC
GGCCTGCCGC TCAACCTCGC TTTCGCGGCG GCCCCGCCGT TCTTCTCCTT CCTGCTTGGC
GAGGCAGGCA ACAGGTGGGC GCTGACCTTC GCGCTTCTCT GCTCCCTCGG TGCATTGGCC
AGCATGGCTC TGCTCGCGCG CATGAGGCCT GCGAAGTCGG GTTAG
 
Protein sequence
MPVFRSVLVL SITQIIAWGA MFMFVSVTAA GMADDLLLRP STIYLGPTVM LVAMALCSPV 
MAPIYARWGA RLVLAFGSAA AAPGLWLLAG AEGPVSYFSA WAILGLAGAA ALTTSAQVFL
TEIAGERARR AIGAQMLAMA LAPTIAWPVT TICEATFGWR GTFVLYGAVM LLVCTPFHLF
GLPRTEPVKR NSSVSNFKRF SASELARRWR IVALITAAVA LNGFVTWGFQ LVVIDLFRSF
AVPGTLAVGF GSAIGFLQLS ARLFDFLGGN RWDGLTTGLV AAAMMPPALL VLALGEGAEW
SIVLFLVLYG LSSGAMSVSR ATMPLVFFSS AEYGTVVARL GLPLNLAFAA APPFFSFLLG
EAGNRWALTF ALLCSLGALA SMALLARMRP AKSG