Gene EcSMS35_2077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2077 
SymbolmdtG 
ID6145402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2091807 
End bp2093033 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content52% 
IMG OID641616953 
Productdrug efflux system protein MdtG 
Protein accessionYP_001744129 
Protein GI170679802 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.894184 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACCCT GTGAAAATGA CACCCCTATA AACTGGAAAC GAAACCTGAT CGTCGCCTGG 
CTAGGCTGTT TTCTTACCGG TGCCGCCTTC AGTCTGGTAA TGCCCTTCTT ACCCCTCTAC
GTTGAGCAGC TTGGCGTTAC CGGTCACTCC GCCCTGAATA TGTGGTCCGG TATTGTCTTC
AGCATTACAT TTTTATTTTC GGCCATCGCC TCACCGTTTT GGGGTGGACT CGCCGACCGT
AAAGGCCGAA AACTCATGCT ATTACGCTCT GCTCTCGGCA TGGGCATCGT GATGGTGTTG
ATGGGACTGG CACAAAATAT CTGGCAGTTT TTGATCCTGC GGGCGCTTCT TGGTTTACTT
GGCGGATTTG TCCCCAACGC TAATGCTCTT ATCGCCACAC AAGTACCGCG TAATAAAAGC
GGCTGGGCGC TGGGTACGCT CTCCACAGGC GGCGTTAGCG GCGCGTTGCT CGGCCCAATG
GCTGGCGGTT TGCTCGCTGA TAGCTACGGC TTACGTCCGG TATTCTTTAT TACCGCCAGT
GTGCTCATAC TCTGCTTTTT CGTCACCCTG TTTTGCATCA GAGAAAAATT CCAGCCGGTC
AGCAAAAAAG AGATGCTGCA TATGCGGGAA GTGGTGACAT CACTTAAAAA CCCGAAACTG
ATACTCAGCC TGTTTGTCAC CACGTTAATC ATCCAGGTGG CGACGGGCTC AATTGCCCCC
ATTCTGACGC TGTATGTCCG CGAACTGGCG GGTAACGTCA GTAACGTCGC TTTTATCAGT
GGCATGATCG CCTCGGTGCC AGGCGTGGCG GCTCTGCTGA GTGCACCACG ACTCGGCAAA
CTTGGCGATC GAATCGGCCC CGAAAAGATC CTGATTACGG CGCTGATCTT TTCTGTACTG
CTGTTGATCC CAATGTCATA CGTTCAGACG CCATTACAAC TTGGGATTTT ACGTTTTTTG
CTCGGTGCCG CCGATGGTGC ACTACTCCCC GCCGTACAGA CACTGTTGGT TTACAACTCG
AGCAACCAAA TCGCCGGGCG TATCTTCAGC TATAACCAGT CGTTTCGTGA TATTGGAAAC
GTTACCGGAC CATTGATGGG AGCCGCGATT TCAGCGAACT ACGGTTTCAG AGCGGTATTT
CTCGTCACCG CTGGCGTAGT GTTATTCAAC GCAGTCTATT CATGGAACAG TCTACGTCGT
CGTCGAATAC CCCAGGTATC GAACTGA
 
Protein sequence
MSPCENDTPI NWKRNLIVAW LGCFLTGAAF SLVMPFLPLY VEQLGVTGHS ALNMWSGIVF 
SITFLFSAIA SPFWGGLADR KGRKLMLLRS ALGMGIVMVL MGLAQNIWQF LILRALLGLL
GGFVPNANAL IATQVPRNKS GWALGTLSTG GVSGALLGPM AGGLLADSYG LRPVFFITAS
VLILCFFVTL FCIREKFQPV SKKEMLHMRE VVTSLKNPKL ILSLFVTTLI IQVATGSIAP
ILTLYVRELA GNVSNVAFIS GMIASVPGVA ALLSAPRLGK LGDRIGPEKI LITALIFSVL
LLIPMSYVQT PLQLGILRFL LGAADGALLP AVQTLLVYNS SNQIAGRIFS YNQSFRDIGN
VTGPLMGAAI SANYGFRAVF LVTAGVVLFN AVYSWNSLRR RRIPQVSN