Gene Smed_3991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3991 
Symbol 
ID5317917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp443809 
End bp444924 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content64% 
IMG OID640775799 
Productsecretion protein HlyD family protein 
Protein accessionYP_001312732 
Protein GI150376136 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.713524 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGAA AGATCAAAGG CGTAGCGTCC GCGGCGACCG CGCTCCTGGT GTTTGGAGTG 
GGGGCGTATT TTGGACGGGC ATGGTGGCAA GAGTTCCGCA TGCACGAACG GACGGACAAC
GCCTATGTCC GCGCCGACAT CACCGCCATC AGTCCCAAAG TCGCCGGATA CGTATCCACG
GTGCTGGTCG ACGATAACCA GGTCGTCGAA GCCGGCGCCA TTCTTTTGCG AATAGACGAC
GATGACTATC TCGCTCAGCG AGATCGTGCG GCTGCGAGCG TCGCACAGGC CGAAGCAGCC
GTAGGAAATC TGACACGCCG TAAGAGCCTG CAGCTCGCCA ATATCCGCGA GGCCGAAGCG
ATGATCGATG TCGCCCGCGC GGATCTCGAG CTTTCGCGCC GGGAGCTGTC ACGAGCGACG
CGCCTGGTCG ATCAGGGATG GACGGCGCAG CGGAACCACG ATACGGCGAC GGCGAAAGCA
CAGAGTGCTC GCGCCACGCT CGTTCGCGCC GAAGCGGCGG CAGCGGCTGC CCGGGCGCAG
CTGGCAGTGC TTGATTCGGA ATCTCCGCAG ATCTCGGCGC GCCTCGCCGA GGCGCGCGCA
AACCTCCGAC TCGCGGAAAT TGCCCTTTCT GAAACGGTCA TCAGAGCGCC GGTGTCCGGA
GTTGTCGGGA ACCGAAAAGT GCGCGAGGGC GAATATGTCC GGCCCGGCAG CGTTCTTCTG
TCCGTGGTTC CCCTTGACGG CATCTGGGTC GTCGCCAATC TCAAGGAGAC ACAGCTTGCC
CGCGTCATGC CTGGCCAACG TGCCGAAATC CGCGTGGACG GTTATTCGAC GACGGTGATC
GAAGGACGCG TAGACAGCCT GGCACCCGCA AGCGGCGCCG CCTTCAGTCT TTTACCACCG
GACAATGCGA CCGGGAATTT CATCAAGGTC GTTCAGCGCG TGCCCGTCAA GATCCGGCTT
GAGCCCGATC ACGCTTTTCA GGGCCGGCTC GTCCCGGGTC TGTCGGTGGA CGTCGCCATA
CACCTGGCTC CCGAGCCCGA GCGGCCACCC TCCCAAAGCA ATCCGGTCGC CGCTGGCCGC
CCGACTTCAT CCCTAACTGC CAGGAGAGAG CCATGA
 
Protein sequence
MKRKIKGVAS AATALLVFGV GAYFGRAWWQ EFRMHERTDN AYVRADITAI SPKVAGYVST 
VLVDDNQVVE AGAILLRIDD DDYLAQRDRA AASVAQAEAA VGNLTRRKSL QLANIREAEA
MIDVARADLE LSRRELSRAT RLVDQGWTAQ RNHDTATAKA QSARATLVRA EAAAAAARAQ
LAVLDSESPQ ISARLAEARA NLRLAEIALS ETVIRAPVSG VVGNRKVREG EYVRPGSVLL
SVVPLDGIWV VANLKETQLA RVMPGQRAEI RVDGYSTTVI EGRVDSLAPA SGAAFSLLPP
DNATGNFIKV VQRVPVKIRL EPDHAFQGRL VPGLSVDVAI HLAPEPERPP SQSNPVAAGR
PTSSLTARRE P