Gene Smed_2034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2034 
Symbol 
ID5322893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2084076 
End bp2085287 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content65% 
IMG OID640790971 
ProductRND family efflux transporter MFP subunit 
Protein accessionYP_001327702 
Protein GI150397235 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTTT GGAATCAGCT CGCGATCAGC GTCGTCGTCC TCGCCGTCGG GGGCGCGGCC 
TGGGTGCGTT TTGCACCCGG AGCGGGCGAG ACCTTGGCCG CGATCGGCGT TTCGCAGCCG
TTGATCGATG CCCTGTCAGG GCCGCAGGAC GGCCAAGCGG GACGCGGCGG TTCCGGTAAT
GCCGGCCGCG GGCAGGGCGG ACGCGGGCAG GGGGGGCTTG GCGGCTTTGC GGACGTTCCG
CTGGTCGTCG TCCGGCCGGC CGCGAGCTCA CTCGTCAACG ACAGACTGAA TGCCATCGGC
AACGGCGAAG CGATCCGTTC GGTTACGGTT ACGCCGACCG CAACCGGAAA CCTCACGGAA
ATACTGGTAA AATCGGGTGA CAGGATCGCG GAAGGCCAGG TAATCGCCCG TCTCGACAGC
GACGATCAGA TGATTGCTGC CGAGCAGGCA CGGTTGACCC GCGACAGCGC CCGGGAAAAA
GTCGAGCGCT ACCGCAATCT CAGCACCGCG CGCGCAGTGA CGGCGGTCGA AGTGCGTGAC
GCCGAATTTG CGCTGCAGGC GGCCGAACTG GCGCTGAAAA CGGCCGAACT CGACCTGAAG
CGGCGCGATA TCGCAGCGCC TTCAAAGGGC GTCGTGGGCA TCATCACCGT CAATATAGGA
GATTACGTCA CGACATCGAC GCCGATCGCG GTGGTTGACG ACCGTTCGCA AATCCTGGTC
GATTTCTGGG TTCCAGAGCG CTTCGCGGGC AAGATCTTCG TCGATCAGCC GGTGACCGCG
AACGCGATCG CGCGGCCAGC CCGCGCACTC CAGGGCGTTG TTCATGCGAT AGACAACCGC
CTGGACCCGG AGAGCCGAAC GCTCAGGGTC CGGGCAAGAC TCGAGAATCC GGACGACATG
CTGCGCGCCG GCATGTCCTT CTCGGTCACA GTGGCATTCG AAGGTGATCG TTATCCCACC
GTCGACCCGC TGGCGATCCA GTGGAGCTCC GAAGGATCCT TTGTCTGGCG CGTCAATGGC
GACAAGAGCG AGCGTGTGCC GATCAAAATT ATCCAGCGCA ACCCCGACAA GGTGCTCGTG
GAAGCGGAAC TCGCCGAGGG CGACCGAGTC GTCACCGAAG GCGTGCAGCG GCTGCGCGAC
GGCGGCGCCG TGCGCATTGC CGGCGAGCCT GCGGCCGAGG CCGGGCAGAA GGTTGCGGGA
GACGCGCAAT GA
 
Protein sequence
MRFWNQLAIS VVVLAVGGAA WVRFAPGAGE TLAAIGVSQP LIDALSGPQD GQAGRGGSGN 
AGRGQGGRGQ GGLGGFADVP LVVVRPAASS LVNDRLNAIG NGEAIRSVTV TPTATGNLTE
ILVKSGDRIA EGQVIARLDS DDQMIAAEQA RLTRDSAREK VERYRNLSTA RAVTAVEVRD
AEFALQAAEL ALKTAELDLK RRDIAAPSKG VVGIITVNIG DYVTTSTPIA VVDDRSQILV
DFWVPERFAG KIFVDQPVTA NAIARPARAL QGVVHAIDNR LDPESRTLRV RARLENPDDM
LRAGMSFSVT VAFEGDRYPT VDPLAIQWSS EGSFVWRVNG DKSERVPIKI IQRNPDKVLV
EAELAEGDRV VTEGVQRLRD GGAVRIAGEP AAEAGQKVAG DAQ