Gene Smed_3470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3470 
Symbol 
ID5324358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3678667 
End bp3679932 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content65% 
IMG OID640792422 
Productmajor facilitator transporter 
Protein accessionYP_001329123 
Protein GI150398656 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATAGAT CGGGTTCGTC GCTGCTGTCG ATCGCGAGCA TCGTCGCCTC GATGACGTCT 
GTCGCGGTCG GAAACGGCAT GATGCTCGCC TATGTACCCT TCGTTTTGAC GCGTTCGACG
GCGCCGGACT GGGTTGCGGG CGCTGCGGTG ACGGCGATTG CCTTCGGCAG CCTGATCGGC
TGTCTCATGG GTGGCTCGCT CATTCGCCGC GTCGGCCATG CCCGGGCGTT TTCCTGCTCC
ATGGCGCTCG TCATACTGGC GGCGCTGACG ATCAGCCTCG GCGTTCATCC GCTGCTCTGG
GTGGTTGCAC GCGGGCTCTA CGGCATTGCC GCCAGCATGA ATTTCATCAT CACACAAAGC
TGGCTGAACC ATGTCAGCGA GAACCACCGG CGCGGCCGGG CGATGGCGCT CTTCTACATG
GCCTACGTCA TCGGCCTCGG CGCCGGCGCC TGGCTCTTCG GTCAGATCCC GTCCGAGGGC
AATCTCGCGC CCGTCATCAC GATCTTCTTC ACCGCGCTCG CCATTCTGCC GATTGGCCTG
ACGCGGCTGC CGACCCCGCC CGCCCCCGCC AGGGTCAGCA TTGATATTCC GATGGTCTGG
CGCATTTCCC CGGTCGCCTT CGTCGGCGTG CTTGCGTCCG GCGGCCTCTC CATGCTGGTA
CAGGGCTTCA CGCCCATCTA TGCGGCCGCC AACTCGGTCA GCCAGAAAGA CGTGGCGGCC
TTGATGTTCG TGATGCAGTT CGGCCTGCTC TTCATTCAGT ATCCGCTGGG CGCCCTCTCC
GACCGCACCG ACCGGCGGAT CGTCATGGTC GTCACCTGTG CGCTCGTCAT CGCGGCGGGC
TTTGCTGCGC TCGCCGTCTC CTTCGACAAC CTCATCCTGC TCATGCTGGT TTTCGCGATC
TTCGCCGGGG CAGTCGAGAC GGTCTATTCG ATTGCCAATG CGCATGCCAA CGATCGCACC
GAGCCGGCGG ATTTCGTGCC GCTCGCCAGC ACCATGCTGA TCGCCTGGTC GGCGTCGGCG
ACGCTGGTGC CGATGCTCGT GACACTGCTG ACGCCGGCCT TCGGTGAGCG CACTTTCATC
TATGCGACGA TGACGGTGGC GCTCCTCTAT GCGCTGTTCG TGCTCGTTCG CCTCCTGTCG
CGCGAACGCG TCCCGCCCGA ACTGTGCGAA TCCTTTGAGT TCAAAAGCGC GCAGGTGCCG
AACGCCGCAG CTCTTGGCGA GACCGAGGCA CACGCGGAAT GCCCGCCCCG GCGCCCGGAT
TTATAG
 
Protein sequence
MDRSGSSLLS IASIVASMTS VAVGNGMMLA YVPFVLTRST APDWVAGAAV TAIAFGSLIG 
CLMGGSLIRR VGHARAFSCS MALVILAALT ISLGVHPLLW VVARGLYGIA ASMNFIITQS
WLNHVSENHR RGRAMALFYM AYVIGLGAGA WLFGQIPSEG NLAPVITIFF TALAILPIGL
TRLPTPPAPA RVSIDIPMVW RISPVAFVGV LASGGLSMLV QGFTPIYAAA NSVSQKDVAA
LMFVMQFGLL FIQYPLGALS DRTDRRIVMV VTCALVIAAG FAALAVSFDN LILLMLVFAI
FAGAVETVYS IANAHANDRT EPADFVPLAS TMLIAWSASA TLVPMLVTLL TPAFGERTFI
YATMTVALLY ALFVLVRLLS RERVPPELCE SFEFKSAQVP NAAALGETEA HAECPPRRPD
L