Gene EcHS_A1021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1021 
SymbolmsbA 
ID5592454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1026959 
End bp1028707 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content51% 
IMG OID640920188 
Productlipid transporter ATP-binding/permease protein 
Protein accessionYP_001457753 
Protein GI157160435 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID[TIGR02203] lipid A export permease/ATP-binding protein MsbA 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.0261521 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATAACG ACAAAGATCT CTCTACGTGG CAGACATTCC GCCGACTGTG GCCAACCATT 
GCGCCTTTCA AAGCGGGTCT GATCGTGGCG GGCGTAGCGT TAATCCTCAA CGCAGCCAGC
GATACCTTCA TGTTATCGCT CCTTAAGCCA CTTCTTGATG ATGGCTTTGG TAAAACAGAT
CGCTCCGTGC TGGTGTGGAT GCCGCTGGTG GTGATCGGGC TGATGATTTT ACGTGGTATC
ACCAGCTATG TCTCCAGCTA CTGTATCTCC TGGGTATCAG GAAAGGTGGT AATGACCATG
CGTCGCCGCC TGTTTGGTCA CATGATGGGA ATGCCAGTTT CATTCTTTGA CAAACAGTCA
ACGGGTACGC TGTTGTCACG TATTACCTAC GATTCCGAAC AGGTTGCTTC TTCTTCTTCC
GGCGCACTGA TTACTGTTGT GCGTGAAGGT GCGTCGATCA TCGGCCTGTT CATCATGATG
TTCTATTACA GTTGGCAACT GTCGATCATT TTGATTGTGC TGGCACCGAT TGTTTCGATT
GCGATTCGCG TTGTATCGAA GCGTTTTCGC AACATCAGTA AAAACATGCA GAACACCATG
GGGCAGGTGA CCACCAGCGC AGAACAAATG CTGAAGGGCC ACAAAGAAGT ATTGATTTTC
GGTGGTCAGG AAGTGGAAAC GAAACGCTTT GATAAAGTCA GCAACCGAAT GCGTCTTCAG
GGGATGAAAA TGGTTTCAGC CTCTTCCATC TCTGATCCGA TCATTCAGCT GATCGCCTCT
TTGGCGCTGG CGTTTGTTCT GTATGCGGCG AGCTTCCCAA GTGTCATGGA TAGCCTGACT
GCCGGTACGA TTACCGTTGT TTTCTCTTCA ATGATTGCAC TGATGCGTCC GCTGAAATCG
CTGACCAACG TTAACGCCCA GTTCCAGCGC GGTATGGCGG CTTGTCAGAC GCTGTTTACC
ATTCTGGACA GTGAGCAGGA GAAAGATGAA GGTAAGCGCG TGATCGAGCG TGCGACTGGC
GACGTGGAAT TCCGCAATGT CACCTTTACT TATCCGGGAC GTGACGTACC TGCATTGCGT
AACATCAACC TGAAAATTCC GGCAGGGAAG ACGGTTGCTC TGGTTGGACG CTCTGGTTCG
GGTAAATCAA CCATCGCCAG CCTGATCACG CGTTTTTACG ATATTGATGA AGGCGAAATC
CTGATGGATG GTCACGATCT GCGCGAGTAT ACCCTGGCGT CGTTACGTAA CCAGGTTGCT
CTGGTGTCGC AGAATGTCCA TCTGTTTAAC GATACGGTTG CTAACAACAT TGCTTACGCA
CGGACTGAAC AGTACAGCCG TGAGCAAATT GAAGAAGCGG CGCGTATGGC CTACGCCATG
GACTTCATCA ATAAGATGGA TAACGGTCTC GATACAGTGA TTGGTGAAAA CGGCGTGCTG
CTCTCTGGCG GTCAGCGTCA GCGTATTGCT ATCGCTCGAG CCTTGTTGCG TGATAGCCCG
ATTCTGATTC TGGACGAAGC TACCTCGGCT CTGGATACCG AATCCGAACG TGCGATTCAG
GCGGCACTGG ATGAGTTGCA GAAAAACCGT ACCTCTCTGG TGATTGCCCA CCGCTTGTCT
ACCATTGAAA AGGCAGACGA AATCGTGGTC GTCGAGGATG GTGTCATTGT GGAACGCGGT
ACGCATAACG ATTTGCTTGA GCACCGCGGC GTTTACGCGC AACTTCACAA AATGCAGTTT
GGCCAATGA
 
Protein sequence
MHNDKDLSTW QTFRRLWPTI APFKAGLIVA GVALILNAAS DTFMLSLLKP LLDDGFGKTD 
RSVLVWMPLV VIGLMILRGI TSYVSSYCIS WVSGKVVMTM RRRLFGHMMG MPVSFFDKQS
TGTLLSRITY DSEQVASSSS GALITVVREG ASIIGLFIMM FYYSWQLSII LIVLAPIVSI
AIRVVSKRFR NISKNMQNTM GQVTTSAEQM LKGHKEVLIF GGQEVETKRF DKVSNRMRLQ
GMKMVSASSI SDPIIQLIAS LALAFVLYAA SFPSVMDSLT AGTITVVFSS MIALMRPLKS
LTNVNAQFQR GMAACQTLFT ILDSEQEKDE GKRVIERATG DVEFRNVTFT YPGRDVPALR
NINLKIPAGK TVALVGRSGS GKSTIASLIT RFYDIDEGEI LMDGHDLREY TLASLRNQVA
LVSQNVHLFN DTVANNIAYA RTEQYSREQI EEAARMAYAM DFINKMDNGL DTVIGENGVL
LSGGQRQRIA IARALLRDSP ILILDEATSA LDTESERAIQ AALDELQKNR TSLVIAHRLS
TIEKADEIVV VEDGVIVERG THNDLLEHRG VYAQLHKMQF GQ