Gene Smed_3859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3859 
Symbol 
ID5318296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp315667 
End bp316875 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content63% 
IMG OID640775671 
Productmajor facilitator transporter 
Protein accessionYP_001312604 
Protein GI150376008 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.526422 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAGTT CGCATTATCG CTGGGTGATC GTTGCTGCAG GCGGCCTGCT GGGGTGTATT 
GCAATCGGTG CCATGTTTTC CTTGCCGGTT TTCCTCGTCC CCATCTCGCG CGATACCGGA
TGGTCGGTGA CCGGCATCTC GAGCGCCATG ACCGTGGGTT TCCTTGCCAT GGCGCTCGCA
AGCATGGTCT GGGGTAGCGC CTCGGATCGC TGGGGGCCGC GCCCGGTCGT GCTCATCGGA
TCGGCGCTCC TTGCTTCCAG CCTGGCTCTT TCGAGTTTCG TGACCTCGCT CATCGCATTT
CAGCTCATCT TCGGCGTTTT CGTCGGCGGT GCCTGCGCGG CGATATTCGC GCCGATGATG
GCTTGCGTTA CGGGCTGGTT CGATACGCAT CGGAGCCTTG CCGTATCGCT GGTATCGGCC
GGCATGGGGA TGGCGCCCAT GACCATGTCT CCGCTGGCCG GCTGGCTGAT AACGATCTAC
GACTGGCGCA CATCGCTGCA GATCATAGCC GCCATTGCCG CCGTCACGAT GATTCCAGCC
GCGATGCTGC TGCGTCGCCC GCCGGTCCTG GAAGATCCGA ATGCCGGCCC TGCAAGCGAG
GGACAACCGG ACATGTCGCT TGGCCAGGCC TTGCGATCGC CGCAATTCGT CATCTTGCTG
CTGACGAACT TCTTCTGCTG CGCCACCCAT TCGGGCCCGA TTTTCCACAC CGTGAGCTAT
GCCGTGAGCT GCGGTATCCC GATGATGGCC GCGGTTTCCA TCTACAGCCT CGAGGGGCTG
GCGGGGATGG GCGGCCGTGT TGCCTTCGGC ATCCTCGGAG ACCGCTACGG CGCGAAGCGT
ATTCTCGTAT CGGGTCTGCT GCTGCAGGCT TTCGGCGCGC TCGCCTATTT CTTCGTGCGC
GACCTCGGCG CTTTTTATGC AGTGGCTGCC TTGTTCGGCT TTATCTATGC AGGCGTCATG
CCGCTTTACG CGGTGATCGC CCGAGAAAAC TTCCCGCTGC GTATGATGGG CACCGTAATC
GGCGGCACGG CAATGGCCGG CAGCCTCGGC ATGGCGATCG GCCCGGTTGC CGGAGGCGTG
ATCTACGATG TTTTCGCCAG CTACGGTTGG CTCTATATCG GCGCCTGGGG CATCGGCATC
GGTGCTTTCC TGATCGCGCT GACCTTCAAG CCTTTCCCCA AACGACGGCC GGCAGCGGCG
GCCGCTTGA
 
Protein sequence
MISSHYRWVI VAAGGLLGCI AIGAMFSLPV FLVPISRDTG WSVTGISSAM TVGFLAMALA 
SMVWGSASDR WGPRPVVLIG SALLASSLAL SSFVTSLIAF QLIFGVFVGG ACAAIFAPMM
ACVTGWFDTH RSLAVSLVSA GMGMAPMTMS PLAGWLITIY DWRTSLQIIA AIAAVTMIPA
AMLLRRPPVL EDPNAGPASE GQPDMSLGQA LRSPQFVILL LTNFFCCATH SGPIFHTVSY
AVSCGIPMMA AVSIYSLEGL AGMGGRVAFG ILGDRYGAKR ILVSGLLLQA FGALAYFFVR
DLGAFYAVAA LFGFIYAGVM PLYAVIAREN FPLRMMGTVI GGTAMAGSLG MAIGPVAGGV
IYDVFASYGW LYIGAWGIGI GAFLIALTFK PFPKRRPAAA AA