Gene Smed_5083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5083 
Symbol 
ID5319385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp30856 
End bp32169 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content65% 
IMG OID640776863 
Productmajor facilitator transporter 
Protein accessionYP_001313795 
Protein GI150377200 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.418338 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGA TCACAGTGGA CGATGCGCTC GACCGCGCCG GGACAGGCGC GTATCAGCGC 
CGGCTCATGG CCATATTCGG GCTGGTGTGG GCGGCCGATG CGATGCAGGT GCTGGCGGTC
GGCTTCACCG CCGCATCGAT CGCCGCAACT TTCGGTCTTA CGGTCCCGCA GGCGCTGCAG
ACCGGTACCC TGTTCTTCCT TGGCATGCTG CTTGGCGCCG TCTCCTTCGG CAAGCTCGCC
GACCGGATCG GCCGTCGTCA TGTCCTGATC GTCACCGTCT CCCTGGATGC CCTCTTCGGC
CTGCTTTCGG TTTTTGCGCC GAATTTTGCC TTCTTGCTTC TGCTGCGCTT TCTGACAGGC
GCTGCAGTCG GCGGTACATT GCCGGTCGAC TACGCGATGA TGGCGGAGTT CTTGCCGGCC
CGCAATCGCG GCCGTTGGCT CGTCATGCTG GAGGGATTCT GGGCCGTCGG CACGCTCGTC
GTCGCGCTCG CCGCATGGGC TGCCAGTCTT GCCGGTGTCG CCGATGCCTG GCGCTACATC
TTTGCCGTGA CGGCGATCCC GGCGCTGATC GGAGTCGGCC TGCGTTTTCT GGTGCCGGAA
TCGCCGCTCT ATCTCCTGCG CCGCGGAAAA GCCCATGAAG CCAAGACCAT TGTCGAGCGT
ATCCTCCTCG TAAACGGCAA GAGTAAACTG GGCGCTGACG TGTCGCTGGT TTCGCCGCCG
CCGGTTGCGA GCGAAGGCAT CTTCTCCGCG GATATGCGCA GGCGCAGCCT GTTGATCCTG
GCGATCTGGT TCCTCGTCTC GGTATCCTAT TACGGCGTTT TTACCTGGAT GCCTCCGCGA
CTGGCGGGCG AGGGCTTCGG GTTCGTTCGC GGCTATGGCT TCCTCGTCTT CCTGGCCTTG
GCGCAGATAC CCGGCTATGC CCTCGCGGCC TATGGCGTGG AGAAGTGGGG CCGCCGGCCG
ACGCTGATCG GCTTCTGCCT GCTGTCGGCG CTTGGCTGCC TGCTCTTTGT GGCGGCCGAA
TCGGGCACGC TGATCGGTGC CTCTCTCCTG ACCATGAGCT TCGCCCTGCT TGGCACCTGG
GGTGCGCTTT ATGCCTATAC GCCGGAACTC TATCCTACCG CATCGCGGGC AACGGGCATG
GGCGCGGCGG GCGGCATGGC GCGGCTCGGG GGTCTTCTCG CGCCCTCGCT GATGGGGCTC
GTCGTGGCGC AGAGCTTCAC TCTTGCAGTC GGTATTTTCT CGGCCTTCCT GCTCGCGGCA
GCCCTGGCCG CCTTTCTCAT CGACACCGAG ACGCGTCGCG CGTCTCTCGC TTGA
 
Protein sequence
MTTITVDDAL DRAGTGAYQR RLMAIFGLVW AADAMQVLAV GFTAASIAAT FGLTVPQALQ 
TGTLFFLGML LGAVSFGKLA DRIGRRHVLI VTVSLDALFG LLSVFAPNFA FLLLLRFLTG
AAVGGTLPVD YAMMAEFLPA RNRGRWLVML EGFWAVGTLV VALAAWAASL AGVADAWRYI
FAVTAIPALI GVGLRFLVPE SPLYLLRRGK AHEAKTIVER ILLVNGKSKL GADVSLVSPP
PVASEGIFSA DMRRRSLLIL AIWFLVSVSY YGVFTWMPPR LAGEGFGFVR GYGFLVFLAL
AQIPGYALAA YGVEKWGRRP TLIGFCLLSA LGCLLFVAAE SGTLIGASLL TMSFALLGTW
GALYAYTPEL YPTASRATGM GAAGGMARLG GLLAPSLMGL VVAQSFTLAV GIFSAFLLAA
ALAAFLIDTE TRRASLA