Gene TM1040_1551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1551 
Symbol 
ID4075849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1658100 
End bp1659422 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content63% 
IMG OID638006864 
ProductMATE efflux family protein 
Protein accessionYP_613546 
Protein GI99081392 
COG category[V] Defense mechanisms 
COG ID[COG0534] Na+-driven multidrug efflux pump 
TIGRFAM ID[TIGR00797] putative efflux protein, MATE family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.959023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGAGG CGTCAGGCCC CATCACCCAT TCCCGGGTGC TCAAGATCGC GTTGCCGATT 
GTTTTGTCCA ATGCCACCGT GCCGATCCTC GGCGCCGTCG ACACCGGCGT GGTGGGACAA
CTTGGCGAAG CGGCCCCGAT CGGAGCGGTG GGCGTTGGCA CGGTCATTCT GTCGACGATC
TACTGGGTCT TTGGCTTTTT GCGGATGGGC ACCACCGGGC TGGCCTCGCA GGCGCGTGGC
GCCGGCGATC TGGCTGAGAC GGGCGCGCTG CTGATGCGGG GGCTGCTCTT GGCCTTTGGG
GCGGGTGCGT TTTTCATTGT TGCTCAGGCG CTTGTGTTCT GGGGCGCATT TACGATTGCC
CCTGCCAGCG CCGAGGTCGA GGAACTGGCG CGGCGCTATC TCGAGATCCG AATCTGGGGC
GCGCCCGCAA CCATCGGGCT TTATGCGGTG ACAGGCTGGC TGATCGCCAT CGAGCGCACC
CGCGCCGTGT TCCTGCTGCA GATCTGGATG AATGGGCTCA ATATCCTGCT GGATCTGTGG
TTTGTGCTCG GGCTCGACTG GGGGGTCGAG GGCGTGGCCA TTGCGACCCT GATCGCAGAG
TGTTCGGGGC TGGTGCTTGG ACTTTGGTAT TGCCGCTCAG CCTTTGCGGG CGATCAATGG
CATGACTGGG GCCGGATCTT TGACCGTGCG CGCCTCAAGC GGATGGTTCA GGTCAATGGC
GACATCATGG TGCGTTCGGT GCTGCTCACA CTGTCGTTCA CCACATTCTT GTTTCTCAGC
GCGGATATGG GCGATGTGCG GCTCGCCTCC AATCAGGTGT TGATTCAGTT TCTGCACATC
ACGGCCTTTG CGCTCGACGG GTTTGCCTTC AGCGCCGAGG CGCTGGTGGG TGGGGCGGTC
GGGGCGCGGG ATCGTGGTCG GCTGCGTCGC GCCGCATTGG TATCGAGCTA TTGGGGCATC
GGCTTTGCCG TGGCGCTCGG GGTGGGGTTC TGGCTGTGGG GGCCGTGGAT TGTTGACCTG
ATGACAACCG CACCCGACGT GCGCGAGACC GCGCGCGCCT ACCTGATCTG GCTGGCGTTT
GCGCCACTTC TGAGTGTCGC AAGCTATATG TTTGACGGGA TCTTTATCGG CGCGACCTGG
ACACGCGACA TGAGGATTGC CGCTTTGCAG TCGGTGGCTG TTTACGGCGT TGCGCTTGCA
ATTTGCGTGC CGATGTTTGG AAACCATGGA TTGTGGATGG CGCTGATGGT GCTCAATGTC
ACACGCGCGC TGACTCTGGC GCTGCGCTAC CCAAGACTGG AGGCCGGCAT CACCGCGCCC
TGA
 
Protein sequence
MSEASGPITH SRVLKIALPI VLSNATVPIL GAVDTGVVGQ LGEAAPIGAV GVGTVILSTI 
YWVFGFLRMG TTGLASQARG AGDLAETGAL LMRGLLLAFG AGAFFIVAQA LVFWGAFTIA
PASAEVEELA RRYLEIRIWG APATIGLYAV TGWLIAIERT RAVFLLQIWM NGLNILLDLW
FVLGLDWGVE GVAIATLIAE CSGLVLGLWY CRSAFAGDQW HDWGRIFDRA RLKRMVQVNG
DIMVRSVLLT LSFTTFLFLS ADMGDVRLAS NQVLIQFLHI TAFALDGFAF SAEALVGGAV
GARDRGRLRR AALVSSYWGI GFAVALGVGF WLWGPWIVDL MTTAPDVRET ARAYLIWLAF
APLLSVASYM FDGIFIGATW TRDMRIAALQ SVAVYGVALA ICVPMFGNHG LWMALMVLNV
TRALTLALRY PRLEAGITAP