Gene ECH74115_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1039 
SymbolmacA 
ID6968079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1053666 
End bp1054781 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content53% 
IMG OID643385052 
Productmacrolide transporter subunit MacA 
Protein accessionYP_002269552 
Protein GI209396080 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.184306 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGC GGAAAACCGT GAAGAAGCGT TACGTTATTG CGCTGGTGAT AGTCATCGCC 
GGACTGATTA CGTTATGGAG AATTCTTAAC GCACCCGTGC CGACTTATCA GACACTGATT
GTGCGCCCCG GTGATTTACA GCAAAGCGTG CTGGCGACCG GAAAGCTGGA CGCGCTGCGT
AAGGTTGACG TGGGCGCGCA GGTCAGCGGT CAGTTGAAAA CGCTGTCGGT GGCGATTGGC
GATAAAGTAA AAAAAGACCA GCTTTTAGGG GTTATTGATC CTGAACAGGC TGAAAACCAG
ATCAAGGAGG TCGAAGCAAC GCTGATGGAG CTACGCGCGC AGCGGCAGCA GGCGGAAGCG
GAGCTGAAAC TGGCGCGGGT GACGTATTCC CGTCAGCAAC GTCTGGCACA AACGCAGGCG
GTTTCACTGC AGGATCTCGA CACCGCCGCG ACGGAGATGG CTGTGAAACA GGCGCAAATT
GGCACCATTG ACGCGCAAAT CAAGCGCAAT CAGGCTTCTC TCGATACGGC TAAAACCAAT
CTCGATTACA CTCGCATCGT TGCCCCGATG GCCGGGGAAG TCACGCAAAT CACCACTCTG
CAAGGCCAGA CGGTGATTGC CGCACAACAA GCACCGAACA TTCTGACGCT GGCAGATATG
AGCACCATGC TGGTAAAAGC GCAGGTTTCT GAAGCGGATG TAATCCACCT GAAGCCGGGG
CAAAAAGCCT GGTTTACGGT ACTTGGCGAT CCACTGACGC GCTACGAGGG GCAAATCAAG
GATGTACTAC CGACGCCGGA AAAGGTTAAC GACGCTATTT TCTATTACGC CCGTTTTGAA
GTCCCCAACC CCAATGGTTT GCTGCGGCTG GATATGACTG CGCAAGTGCA TATTCAGCTC
ACCGATGTGA AAAATGTGCT GACGATCCCT CTGTCGGCGT TAGGCGATCC GGTTGGCGAT
AATCGTTATA AAGTCAAATT GTTGCGTAAT GGTGAAACAC GCGAGCGTGA AGTGACGATT
GGCGCACGTA ACGATACCGA TGTTGAGATT GTCAAAGGGC TGGAAGCGGG CGATGAAGTG
GTGATTGGTG AGGCCAAACC AGGAGCTGCA CAATGA
 
Protein sequence
MKKRKTVKKR YVIALVIVIA GLITLWRILN APVPTYQTLI VRPGDLQQSV LATGKLDALR 
KVDVGAQVSG QLKTLSVAIG DKVKKDQLLG VIDPEQAENQ IKEVEATLME LRAQRQQAEA
ELKLARVTYS RQQRLAQTQA VSLQDLDTAA TEMAVKQAQI GTIDAQIKRN QASLDTAKTN
LDYTRIVAPM AGEVTQITTL QGQTVIAAQQ APNILTLADM STMLVKAQVS EADVIHLKPG
QKAWFTVLGD PLTRYEGQIK DVLPTPEKVN DAIFYYARFE VPNPNGLLRL DMTAQVHIQL
TDVKNVLTIP LSALGDPVGD NRYKVKLLRN GETREREVTI GARNDTDVEI VKGLEAGDEV
VIGEAKPGAA Q