Gene Smed_4892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4892 
Symbol 
ID5318230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1402319 
End bp1403518 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content61% 
IMG OID640776677 
Productmajor facilitator transporter 
Protein accessionYP_001313609 
Protein GI150377013 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.563893 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTCCG GAATCCCCGC AGGCATCGTC GCCGCACTCG GGCTCACCCA GATCGTCGGC 
TACGGAACGC TTTACTACAG CTTCAGCATC CTTGCGCCCG GAATGGCGCG CGACCTTGGC
CAGACTGTCG AGCAAGTGTT TGCCGTGTTC TCCGCTTCGT TGCTGGTGGG CGGCCTTTCG
GCTCCGATCA TGGGCGGTTG GATGGATCGG TTCGGTGCCG CGACGATCAT GACTTTTGGT
TCGGCCGTCT CCGCCGTGGC GCTTTTATTG TGCGCATGGT CTCCTTCGAT GCCGGTGTTC
GCCATAGCCA TCGTTCTTTT GGAGGTGGCG TCCGGCATGG TGCAATATCA GGCAGCCTTT
GCGACGCTCG TGGAAGTCCG CCCCAAGATG GCCTCGCGCA GTATCACCTA TCTGACGCTG
ATCGGGGGTT TCGCCTCGAC GATCTTCTGG CCGATTGCCG TGAACCTGAG TGAGCTTCTC
TCATGGCGTG AGATCTACGT CGCCTATGCG GGGTTGAACC TGCTCATCTG TCTTCCACTG
CATTACTGGA TTCTGCGAAG CAGGAAGCAC GGGGCGGTCG AACGTTCTCG CATCGATGGA
GAGGCTATCG CCGGAGCGCT TCCAACACGG GTGAGGCGGC GGGGCATGCT CCTTGTTTCA
TGGGCCTTTG CCCTCCAGGG GTTCACCCTT TCAGCAATAT TGACACATAT GGTACCGATG
CTCGGAGCAA TCGGGTTCGG GCCGGCAGCC GTCGTGATAG GGTCGCTGTT CGGACCGTCT
CAAGTTCTGA GCAGGCTGAT CAACATGACG CTAGGCGCTA ACCTGCTGCC GCCCATGCTC
GCGACGCTAT CGGCCGTGCT GATCGTCGCC GGTGTCGTCG TACTCGGTCT TTCCGGGACA
TGGCTACCCG GGGCCGTGGC TTTCGCGATT TGCCTCGGTC TTGGATCGGG CATCAACAGC
ATTGCGCAGG GCTCGCTTCC GCTCTGGCTG TTCGGATCCT CCGGCTATGG AGCCATTACG
GGCCGGATGG CCGCCGCTCG GCTGGCTGCA GCCGCCATGG CCCCCTTTGT CTTTTCCGTC
CTGATGGAAC GGTTCGGCAC CAATGTCGCC CTGATGGCCA ATGCCTGCCT GGGTGCGATC
GGTATAGCGG CATTCGTCGC GGTAGCCTCG GCGGCCAAAC GTCAGCCGGG TGCGCACTAG
 
Protein sequence
MKSGIPAGIV AALGLTQIVG YGTLYYSFSI LAPGMARDLG QTVEQVFAVF SASLLVGGLS 
APIMGGWMDR FGAATIMTFG SAVSAVALLL CAWSPSMPVF AIAIVLLEVA SGMVQYQAAF
ATLVEVRPKM ASRSITYLTL IGGFASTIFW PIAVNLSELL SWREIYVAYA GLNLLICLPL
HYWILRSRKH GAVERSRIDG EAIAGALPTR VRRRGMLLVS WAFALQGFTL SAILTHMVPM
LGAIGFGPAA VVIGSLFGPS QVLSRLINMT LGANLLPPML ATLSAVLIVA GVVVLGLSGT
WLPGAVAFAI CLGLGSGINS IAQGSLPLWL FGSSGYGAIT GRMAAARLAA AAMAPFVFSV
LMERFGTNVA LMANACLGAI GIAAFVAVAS AAKRQPGAH