Gene Smed_1262 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1262 
Symbol 
ID5322109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1355444 
End bp1356709 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content60% 
IMG OID640790203 
Productmajor facilitator transporter 
Protein accessionYP_001326947 
Protein GI150396480 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0631224 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACC AGAAATTTGT GAAGCGCGGT CTTTTCCTCG TCTTTATGAT CCTCTTCCTC 
GATATTATGG GTATCGCCAT CATTGTCCCT GTATTGCCGA GCTATCTCGA GGAATTGACG
GGGGCGGATG TTGGTGAAGC GGCAATCGAC GGCGGCTGGC TGCTGCTCGT TTATTCGGCG
ATGCAGTTCT TCTTCGCGCC GCTCATCGGG AATCTGAGCG ACCGATTCGG GCGCCGGCCG
ATCCTTTTGG CCTCCGTGCT GACCTTCGCT ATCGACAACC TGATCTGCGC GCTTGCGACG
AGCTACTGGA TGCTTTTCAT CGGGCGCAGC CTTGCCGGGA TCAGCGGCGC GAGTTTCGGC
ACCGCGTCCG CTTACATCGC CGATGTCAGC AATGACGAAA ACCGCGCCAA GAACTTCGGG
CTGATCGGCA TTGCCTTCGG AACCGGTTTT GCACTGGGCC CGGTCATAGG AGGTGTCCTT
GGGGAACTGG GACCAAGGGT GCCCTTTTAT GGTGCCGCAG CGCTGTCGTT CCTCAATTTC
GTCATGGGTG CCTTTCTCCT GCCGGAAACG CTCGATCCGG CCAATCGGCG TCGCTTCGAA
TGGCGCCGGG CCAATCCGTT CGGCGCGCTC AAGCAGATGC GTCATTACCC GGGTATCGGC
TGGGTGGGCC TCGTCTTCTT CCTCTACTGG CTGGCGCACG CCGTCTACCC CGCCGTCTGG
TCCTTCGTCG CTTCCTATCG CTATGGGTGG AGCGAAGGGC AGATCGGGCT TTCATTGGGG
ATTTTCGGCG TCGGCGGCGC AATCGTCATG GCGCTGGTAC TGCCGCGCGT CGTCCCGGCA
TTGGGAGAAA GGCGAACGGC CGCGCTCGGG CTTACATTCA CGGCTCTCGG AATGGCCGGC
TATGCTGCTG CCTGGGAAGG CTGGATGGTA TATGCTGTGA TCGTCGCGAC CGCGCTTGAA
AGCCTTGCCG ATCCGCCGCT CCGCAGCATA GCGTCGGTCC ATGTCCCGCC CTCCGCACAG
GGCGAATTGC AAGGCGCGCT CACCAGTATA TCGAGCATGA CGACGATCAT CGGACCTCTG
ATGTTTACGC AGATTTTTGC CTATTTCACT AACCCCGCCG CCACCTATGC CTTTTCCGGG
GCTCCCTATG CGGTGGCCGG CTGCCTGATC ATCGCCGCAT TGCTGATATT CCTTGCCAAG
GTCCGCACCG GACGGGGTGG CACCTATCGG GATGGGGTGC CCGGCATAGC GGAGGCATCA
TCCTGA
 
Protein sequence
MIDQKFVKRG LFLVFMILFL DIMGIAIIVP VLPSYLEELT GADVGEAAID GGWLLLVYSA 
MQFFFAPLIG NLSDRFGRRP ILLASVLTFA IDNLICALAT SYWMLFIGRS LAGISGASFG
TASAYIADVS NDENRAKNFG LIGIAFGTGF ALGPVIGGVL GELGPRVPFY GAAALSFLNF
VMGAFLLPET LDPANRRRFE WRRANPFGAL KQMRHYPGIG WVGLVFFLYW LAHAVYPAVW
SFVASYRYGW SEGQIGLSLG IFGVGGAIVM ALVLPRVVPA LGERRTAALG LTFTALGMAG
YAAAWEGWMV YAVIVATALE SLADPPLRSI ASVHVPPSAQ GELQGALTSI SSMTTIIGPL
MFTQIFAYFT NPAATYAFSG APYAVAGCLI IAALLIFLAK VRTGRGGTYR DGVPGIAEAS
S