Gene EcSMS35_0506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0506 
SymbolacrA 
ID6144292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp512739 
End bp513968 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content54% 
IMG OID641615400 
Productacriflavine resistance protein A 
Protein accessionYP_001742607 
Protein GI170681835 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.951175 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCAATT TGAAATCGGA CACTCGAGGT TTACATATGA ACAAAAACAG AGGGTTTACG 
CCTCTGGCGG TCGTTCTGAT GCTCTCAGGC AGCTTAGCCC TAACAGGATG TGACGACAAA
CAGGCCCAAC AAGGTGGCCA GCAGATGCCC GCCGTTGGCG TAGTAACAGT CAAAACTGAA
CCTCTGCAGA TCACAACCGA GCTTCCGGGT CGCACCAGTG CCTACCGGAT CGCAGAAGTT
CGTCCTCAAG TTAGCGGGAT TATCCTGAAG CGTAATTTCA AAGAAGGTAG CGACATCGAA
GCAGGTGTCT CTCTCTATCA GATTGATCCT GCGACCTATC AGGCGGCATA CGACAGTGCG
AAAGGTGATC TGGCGAAAGC CCAGGCTGCA GCCAATATCG CGCAATTGAC GGTGAATCGT
TATCAGAAAT TGCTCGGTAC TCAGTACATC AGTAAGCAAG AGTACGATCA GGCTCTGGCT
GATGCGCAAC AGGCGAATGC TGCGGTCACT GCGGCGAAAG CTGCCGTTGA AACTGCGCGG
ATCAATCTGG CTTACACCAA AGTCACCTCG CCGATTAGCG GTCGCATTGG TAAGTCGAAC
GTGACAGAAG GCGCATTGGT ACAGAACGGT CAGGCGACTG CGCTGGCAAC CGTGCAGCAA
CTTGATCCGA TCTACGTTGA TGTGACCCAG TCCAGCAACG ACTTCCTGCG CCTGAAACAG
GAACTGGCAA ATGGCACGCT GAAACAAGAG AACGGCAAAG CCAAAGTGTC GCTGATCACC
AGTGACGGCA TTAAGTTCCC GCAGGACGGT ACGCTGGAAT TCTCTGACGT TACCGTTGAT
CAGACCACCG GGTCTATCAC CCTACGCGCT ATCTTCCCGA ACCCGGATCA CACTCTGCTG
CCGGGTATGT TCGTACGTGC GCGTCTGGAA GAAGGGCTTA ATCCAAACGC TATTTTAGTC
CCGCAACAGG GCGTAACCCG TACGCCGCGT GGCGATGCCA CCGTACTGGT GGTTGGCGCG
GATGACAAAG TGGAAACCCG TCCGATCGTT GCAAGCCAGG CTATCGGCGA TAAGTGGCTG
GTGACAGAAG GTCTGAAAGC AGGCGATCGC GTAGTAATAA GTGGGCTGCA GAAAGTGCGT
CCTGGTGTCC AGGTAAAAGC ACAAGAAGTT ACCGCTGATA ATAACCAGCA AGCCGCAAGC
GGTGCTCAGC CTGAACAGTC CAAGTCTTAA
 
Protein sequence
MTNLKSDTRG LHMNKNRGFT PLAVVLMLSG SLALTGCDDK QAQQGGQQMP AVGVVTVKTE 
PLQITTELPG RTSAYRIAEV RPQVSGIILK RNFKEGSDIE AGVSLYQIDP ATYQAAYDSA
KGDLAKAQAA ANIAQLTVNR YQKLLGTQYI SKQEYDQALA DAQQANAAVT AAKAAVETAR
INLAYTKVTS PISGRIGKSN VTEGALVQNG QATALATVQQ LDPIYVDVTQ SSNDFLRLKQ
ELANGTLKQE NGKAKVSLIT SDGIKFPQDG TLEFSDVTVD QTTGSITLRA IFPNPDHTLL
PGMFVRARLE EGLNPNAILV PQQGVTRTPR GDATVLVVGA DDKVETRPIV ASQAIGDKWL
VTEGLKAGDR VVISGLQKVR PGVQVKAQEV TADNNQQAAS GAQPEQSKS