Gene GM21_4033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4033 
Symbol 
ID8139407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4616438 
End bp4617814 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content64% 
IMG OID644871649 
ProductMATE efflux family protein 
Protein accessionYP_003023807 
Protein GI253702618 
COG category[V] Defense mechanisms 
COG ID[COG0534] Na+-driven multidrug efflux pump 
TIGRFAM ID[TIGR00797] putative efflux protein, MATE family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.00000078929 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACCACG GCCACGAGCT GCTGCACCAA CCGATCCCGG GGTTGATCAG GAAACTGGCC 
GTTCCCACCA GCGTCGGCTA TTTCTTCAAC ACCATGTTCA ACGTGGTCGA CACCTTCTAC
GGCGGAAGGG TCTCCACCGA GGCGCTCGCC GCACTTTCCC TTTCCTTTCC CATTTTCTTC
CTCATCATCG CCATCGGCGC CGGGATCTCC ACCGGGGCGA CCGCGCTCAT AGGCCACGAG
CTCGGCGCGG GCAACGCCGA AGAGGCGCGG CACCTGGCGG GACAGACCAT CTCCTTCGGC
ATCGTGCACG GCGTACTGGT CGCGGCGGTC GGCTTCTCGG CCGCTCCGCT GCTCTTCCAG
CTCCTGGGCG CCAAAGGGGC TGTCTTGCAG CTCGCGCTGC AGTACATGGA CACCATCTTC
ATCGGGAGCA TCTTCTTCCT GATCAACTAC GTGTTGAACT CCATCCTGAA CGCGACCGGC
GACAGCCGCA GCTTCCGTAA CTTCCTGGTC GTCGGGTTCT TCCTCAACCT CGTCTTCGAC
CCCTGGTTTC TCTACGGCGG GCTCGGGGTG CCGGCGCTGG GGATATCCGG CATAGCCTGG
GCCACCATAC TGATCCAGGC CATCGGTAAC TGTTATCTGG CGGCGAGGGT CAGGCAGTCG
GGGATGCTGG AGGGCTTCCG CTGGAGGGAG CTTATTCCCA GCCGGCACGC TTACTCCCAA
CTGGCCCGGC AGGGGTTTCC CTCCAGCCTC AACATGATGA CCGTCGCCAG CGGCATCTTC
CTGATCACCT GGTTCGTCGG GCGCTTCGGG AGCGAGGCGG TGGCGGCTTA CGGTATCGGC
TCCCGGATCG AGCAGATCGC GCTCCTTCCG GTGATGGGGA TGAACGTGGC GACGCTCGCG
CTTGTGGCGC AAAACAGCGG GGCCAGGCAG TTGGAGCGGG TGGTGCAGAC CATCAAGACC
GCGCTGCGGG TAGGGGTGGC GCTGATGGGC GCCGGAACGG TGGTCGTGTT CCTTGCGGCC
CGGCCGCTGA TGGGGCTATT CAGCAACGAC CCCAAGGTGG TGGAGATAGG GGTCGGCTAT
CTCAGGATCG AGTCCTTCGT CTTCATGGCC TACGTCATCC TCTACACCTG CGTCGCCGTG
CTTCAGGGGT TGAAGAGGCC AGGGTTTGCC CTGATGATCG GGTTGATGAG GCAGATCGTT
TTCCCCCTTC CGGTCTTCTA CCTCCTGGCG GTGTTCTTGG GGTTCGGTCT CACCGGGATC
TGGTGGGGAA TACTGCTGGT GACCTGGGGA GCCGCCTGCG TCACCGTCGT GTACGTGCTG
CGGCTGGCGG CAGGCATGAG CCCCGCCGGC GCTGGGCTGG AGAGGGCTGC CGACTGA
 
Protein sequence
MNHGHELLHQ PIPGLIRKLA VPTSVGYFFN TMFNVVDTFY GGRVSTEALA ALSLSFPIFF 
LIIAIGAGIS TGATALIGHE LGAGNAEEAR HLAGQTISFG IVHGVLVAAV GFSAAPLLFQ
LLGAKGAVLQ LALQYMDTIF IGSIFFLINY VLNSILNATG DSRSFRNFLV VGFFLNLVFD
PWFLYGGLGV PALGISGIAW ATILIQAIGN CYLAARVRQS GMLEGFRWRE LIPSRHAYSQ
LARQGFPSSL NMMTVASGIF LITWFVGRFG SEAVAAYGIG SRIEQIALLP VMGMNVATLA
LVAQNSGARQ LERVVQTIKT ALRVGVALMG AGTVVVFLAA RPLMGLFSND PKVVEIGVGY
LRIESFVFMA YVILYTCVAV LQGLKRPGFA LMIGLMRQIV FPLPVFYLLA VFLGFGLTGI
WWGILLVTWG AACVTVVYVL RLAAGMSPAG AGLERAAD