Gene EcSMS35_2206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2206 
SymbolmsbA 
ID6147458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2218888 
End bp2220636 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content51% 
IMG OID641617082 
Productlipid transporter ATP-binding/permease protein 
Protein accessionYP_001744256 
Protein GI170680104 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID[TIGR02203] lipid A export permease/ATP-binding protein MsbA 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0719842 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.317534 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATAACG ACAAAGATCT CTCTACGTGG CAGACGTTCC GCCGACTGTG GCCAACCATT 
GCGCCTTTTA AAGCGGGTCT GATCGTGGCG GGCGTAGCGT TAATCCTCAA CGCAGCCAGC
GATACCTTCA TGTTATCGCT CCTTAAGCCA CTTCTTGATG ATGGCTTTGG TAAAACAGAT
CGCTCCGTGC TGGTGTGGAT GCCGCTGGTG GTGATCGGGC TGATGATTTT GCGTGGTATC
ACCAGCTATG TCTCCAGCTA CTGTATCTCC TGGGTATCAG GAAAGGTGGT AATGACCATG
CGTCGCCGCC TGTTTGGTCA CATGATGGGA ATGCCGGTTT CATTCTTTGA CAAACAGTCA
ACGGGTACGC TGTTGTCACG TATCACCTAC GATTCCGAAC AGGTTGCTTC TTCCTCTTCC
GGCGCACTGA TTACTGTTGT GCGTGAAGGT GCGTCGATCA TCGGCCTGTT CATCATGATG
TTCTATTACA GTTGGCAACT GTCGATCATT TTGATTGTGC TGGCACCGAT TGTTTCGATT
GCGATTCGCG TTGTATCGAA GCGTTTTCGC AATATCAGTA AAAACATGCA GAACACCATG
GGGCAGGTGA CCACCAGCGC CGAACAAATG CTGAAAGGCC ATAAAGAAGT ATTGATTTTC
GGTGGTCAGG AAGTGGAAAC GAAACGCTTC GATAAAGTCA GCAACCGAAT GCGTCTTCAG
GGGATGAAAA TGGTTTCAGC CTCTTCCATC TCTGATCCGA TCATTCAGCT GATCGCCTCT
TTGGCGCTGG CGTTTGTTCT GTATGCGGCA AGCTTCCCAA GTGTCATGGA TAGCCTGACT
GCCGGTACGA TTACCGTTGT TTTCTCTTCA ATGATTGCAC TTATGCGTCC GCTGAAATCG
CTGACCAACG TTAACGCCCA GTTCCAGCGC GGTATGGCGG CTTGTCAGAC GCTGTTTACC
ATTCTGGACA GTGAGCAGGA GAAAGACGAA GGTAAGCGCG TGATCGAGCG TGCGACTGGC
GACGTGGAAT TCCGCAATGT CACCTTTACT TATCCGGGAC GTGACGTACC CGCATTGCGT
AACATCAACC TGAAAATTCC GGCAGGGAAG ACGGTTGCTC TAGTTGGACG CTCTGGTTCA
GGTAAATCAA CCATCGCCAG CCTGATCACG CGTTTTTACG ATATTGATGA AGGCGAAATC
CTGATGGATG GTCACGATCT GCGCGAGTAT ACCCTGGCGT CGTTACGTAA CCAGGTTGCT
CTGGTGTCGC AGAATGTCCA TCTGTTTAAC GATACGGTTG CCAACAACAT TGCTTACGCA
CGGACTGAAC AGTACAGCCG TGAGCAAATT GAAGAAGCGG CGCGTATGGC CTACGCCATG
GACTTCATCA ATAAGATGGA TAACGGTCTC GATACAGTGA TTGGTGAAAA CGGCGTGCTG
CTCTCTGGCG GTCAGCGTCA GCGTATTGCT ATCGCTCGAG CCTTGTTGCG TGATAGCCCG
ATTCTGATTC TGGACGAAGC TACCTCGGCG TTGGATACCG AATCCGAACG TGCGATTCAG
GCGGCACTGG ATGAGTTGCA GAAAAACCGT ACCTCTCTGG TGATTGCCCA CCGCTTGTCT
ACCATTGAAA AGGCAGACGA AATCGTGGTC GTCGAGGATG GTGTCATTGT GGAACGCGGT
ACGCATAACG ATTTGCTTGA GCACCGTGGC GTTTACGCGC AACTTCACAA AATGCAGTTT
GGCCAATGA
 
Protein sequence
MHNDKDLSTW QTFRRLWPTI APFKAGLIVA GVALILNAAS DTFMLSLLKP LLDDGFGKTD 
RSVLVWMPLV VIGLMILRGI TSYVSSYCIS WVSGKVVMTM RRRLFGHMMG MPVSFFDKQS
TGTLLSRITY DSEQVASSSS GALITVVREG ASIIGLFIMM FYYSWQLSII LIVLAPIVSI
AIRVVSKRFR NISKNMQNTM GQVTTSAEQM LKGHKEVLIF GGQEVETKRF DKVSNRMRLQ
GMKMVSASSI SDPIIQLIAS LALAFVLYAA SFPSVMDSLT AGTITVVFSS MIALMRPLKS
LTNVNAQFQR GMAACQTLFT ILDSEQEKDE GKRVIERATG DVEFRNVTFT YPGRDVPALR
NINLKIPAGK TVALVGRSGS GKSTIASLIT RFYDIDEGEI LMDGHDLREY TLASLRNQVA
LVSQNVHLFN DTVANNIAYA RTEQYSREQI EEAARMAYAM DFINKMDNGL DTVIGENGVL
LSGGQRQRIA IARALLRDSP ILILDEATSA LDTESERAIQ AALDELQKNR TSLVIAHRLS
TIEKADEIVV VEDGVIVERG THNDLLEHRG VYAQLHKMQF GQ