Gene ECH74115_1075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1075 
SymbolmsbA 
ID6971625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1099619 
End bp1101367 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content51% 
IMG OID643385087 
Productlipid transporter ATP-binding/permease protein 
Protein accessionYP_002269586 
Protein GI209398481 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID[TIGR02203] lipid A export permease/ATP-binding protein MsbA 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.220185 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATAACG ACAAAGATCT CTCTACGTGG CAGACATTCC GCCGACTGTG GCCAACCATT 
GCGCCTTTTA AAGCGGGTCT GATCGTGGCG GGCGTAGCGT TAATCCTCAA CGCAGCCAGC
GATACCTTCA TGTTATCGCT CCTTAAGCCA CTTCTTGATG ATGGCTTTGG TAAAACAGAT
CGCTCCGTGC TGGTGTGGAT GCCGCTGGTG GTGATCGGGC TGATGATTTT GCGTGGTATC
ACCAGCTATG TCTCCAGCTA CTGTATCTCC TGGGTATCAG GAAAGGTGGT AATGACCATG
CGTCGCCGCC TGTTTGGTCA CATGATGGGA ATGCCGGTTT CATTCTTTGA CAAACAGTCA
ACGGGTACGC TGTTGTCACG TATCACCTAC GATTCCGAAC AGGTTGCTTC TTCCTCTTCC
GGCGCACTGA TTACTGTTGT GCGTGAAGGT GCGTCGATCA TCGGCCTGTT CATCATGATG
TTCTATTACA GTTGGCAATT GTCGATCATT TTGATTGTGC TGGCACCGAT TGTTTCGATT
GCGATTCGCG TAGTATCGAA GCGTTTTCGC AACATCAGTA AAAACATGCA GAACACCATG
GGGCAGGTGA CCACCAGCGC TGAACAAATG CTGAAAGGCC ATAAAGAAGT ATTGATTTTC
GGTGGTCAGG AAGTGGAAAC GAAACGCTTC GATAAAGTCA GCAACCGAAT GCGTCTTCAG
GGGATGAAAA TGGTTTCAGC CTCTTCCATC TCTGATCCGA TCATTCAGCT GATCGCCTCT
TTGGCGCTGG CGTTTGTTCT GTATGCGGCG AGCTTCCCAA GTGTCATGGA TAGCCTGACT
GCCGGTACGA TTACCGTTGT TTTCTCTTCA ATGATTGCAC TGATGCGTCC GCTGAAATCG
CTGACTAACG TTAACGCCCA GTTCCAGCGC GGTATGGCGG CTTGTCAGAC ACTGTTTACC
ATTCTGGACA GTGAGCAGGA GAAAGACGAA GGTAAGCGCG TGATCGAGCG TGCGACTGGC
GACGTGGAAT TCCGCAACGT CACCTTTACT TATCCGGGAC GTGACGTACC CGCATTGCGT
AACATCAACC TGAAAATTCC GGCAGGGAAA ACGGTCGCTC TGGTTGGACG CTCTGGTTCA
GGTAAATCAA CCATCGCCAG CCTGATCACG CGTTTTTACG ATATTGATGA AGGCGAAATC
CTGATGGATG GTCACGATCT GCGCGAGTAT ACCCTGGCGT CGTTGCGTAA CCAGGTTGCT
CTGGTGTCGC AGAATGTCCA TCTGTTTAAC GATACGGTTG CTAACAACAT TGCTTACGCA
CGAACTGAAC AGTACAGCCG TGAGCAAATT GAAGAAGCGG CGCGTATGGC CTACGCCATG
GACTTCATCA ATAAGATGGA TAACGGTCTC GATACAGTGA TTGGTGAAAA CGGCGTACTG
CTCTCTGGCG GTCAGCGTCA GCGTATTGCT ATCGCTCGAG CCTTGTTGCG TGATAGCCCG
ATTCTGATTC TGGATGAAGC TACCTCGGCT CTGGATACCG AATCCGAACG TGCGATTCAG
GCGGCACTGG ATGAGTTGCA GAAAAACCGT ACCTCTCTGG TGATTGCCCA CCGCTTGTCT
ACCATTGAAA AGGCAGACGA AATCGTGGTC GTCGAGGATG GTGTCATTGT GGAACGCGGT
ACGCATAACG ATTTGCTTGA GCACCGCGGC GTTTACGCGC AACTTCACAA AATGCAGTTT
GGCCAATGA
 
Protein sequence
MHNDKDLSTW QTFRRLWPTI APFKAGLIVA GVALILNAAS DTFMLSLLKP LLDDGFGKTD 
RSVLVWMPLV VIGLMILRGI TSYVSSYCIS WVSGKVVMTM RRRLFGHMMG MPVSFFDKQS
TGTLLSRITY DSEQVASSSS GALITVVREG ASIIGLFIMM FYYSWQLSII LIVLAPIVSI
AIRVVSKRFR NISKNMQNTM GQVTTSAEQM LKGHKEVLIF GGQEVETKRF DKVSNRMRLQ
GMKMVSASSI SDPIIQLIAS LALAFVLYAA SFPSVMDSLT AGTITVVFSS MIALMRPLKS
LTNVNAQFQR GMAACQTLFT ILDSEQEKDE GKRVIERATG DVEFRNVTFT YPGRDVPALR
NINLKIPAGK TVALVGRSGS GKSTIASLIT RFYDIDEGEI LMDGHDLREY TLASLRNQVA
LVSQNVHLFN DTVANNIAYA RTEQYSREQI EEAARMAYAM DFINKMDNGL DTVIGENGVL
LSGGQRQRIA IARALLRDSP ILILDEATSA LDTESERAIQ AALDELQKNR TSLVIAHRLS
TIEKADEIVV VEDGVIVERG THNDLLEHRG VYAQLHKMQF GQ