Gene SeD_A1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1049 
SymbolmsbA 
ID6873596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1050408 
End bp1052156 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content52% 
IMG OID642784234 
Productlipid transporter ATP-binding/permease protein 
Protein accessionYP_002214908 
Protein GI198244768 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID[TIGR02203] lipid A export permease/ATP-binding protein MsbA 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value0.183717 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATAACG ATAAAGATCT CTCTACGTGG CAGACCTTCC GCCGACTGTG GCCAACCATA 
GCGCCTTTTA AAGCAGGTCT GATCGTGGCG GGCATAGCGT TAATTCTCAA CGCAGCCAGC
GATACCTTCA TGCTATCGCT CCTCAAGCCA TTACTGGATG ATGGTTTCGG TAAAACGGAT
CGCTCAGTGT TGCTGTGGAT GCCGCTGGTG GTTATTGGGC TGATGATATT ACGAGGCATC
ACTAGCTATA TCTCCAGCTA CTGTATTTCA TGGGTGTCAG GCAAGGTGGT AATGACCATG
CGCCGTCGCC TGTTTGGCCA TATGATGGGA ATGCCCGTCG CTTTCTTTGA TAAACAGTCT
ACCGGTACGC TGCTGTCGCG TATTACATAC GATTCAGAAC AGGTTGCCTC TTCTTCATCT
GGCGCGCTGA TTACCGTGGT GCGTGAAGGG GCATCGATCA TCGGATTGTT TATCATGATG
TTCTATTACA GCTGGCAGCT GTCGATCATC CTGGTTGTTT TAGCGCCGAT TGTGTCGATT
GCGATTCGCG TTGTCTCAAA GCGGTTCCGC AGCATCAGTA AAAATATGCA GAACACGATG
GGACAAGTGA CTACCAGCGC TGAACAAATG CTGAAAGGAC ACAAAGAGGT ACTGATTTTT
GGCGGTCAGG AAGTCGAAAC TAAACGCTTT GATAAAGTCA GCAATAAGAT GCGACTGCAA
GGCATGAAAA TGGTCTCTGC CTCGTCAATT TCCGATCCTA TCATTCAGCT CATTGCCTCG
CTGGCGCTGG CGTTTGTCCT CTATGCTGCG AGCTTCCCAA GCGTAATGGA TAGCCTGACG
GCAGGGACCA TCACCGTGGT GTTCTCCTCC ATGATCGCGC TGATGCGTCC ATTAAAATCG
CTGACAAACG TTAACGCGCA GTTCCAGCGT GGGATGGCGG CTTGTCAGAC GTTGTTTGCG
ATTCTGGACA GCGAACAGGA GAAAGATGAA GGTAAACGTG TCATTGACCG CGCGACCGGC
GATCTCGAAT TCCGCAATGT GACGTTTACT TACCCGGGCC GTGAAGTGCC GGCATTGCGT
AACATCAATT TGAAAATTCC TGCCGGGAAA ACCGTGGCGC TGGTGGGGCG TTCCGGATCG
GGTAAATCAA CTATCGCCAG TCTGATCACC CGTTTCTACG ATATTGATGA AGGACACATC
CTGATGGATG GTCACGATCT ACGCGAATAC ACTCTGGCCT CTCTACGTAA TCAGGTGGCG
CTGGTTTCGC AAAACGTGCA TCTGTTTAAC GACACGGTCG CCAATAACAT TGCTTATGCC
CGGACGGAAG AATACAGCCG CGAGCAGATT GAAGAGGCGG CGCGCATGGC CTATGCCATG
GACTTTATCA ATAAGATGGA TAATGGCCTG GATACCATCA TCGGCGAAAA CGGCGTACTG
CTTTCCGGCG GTCAGCGCCA GCGTATCGCG ATCGCCCGCG CCTTACTGCG TGACAGCCCG
ATTCTGATCC TTGATGAAGC TACGTCCGCG CTGGATACCG AATCTGAACG TGCGATTCAG
GCAGCGTTGG ATGAGCTGCA GAAAAACCGT ACCTCTCTGG TGATTGCGCA CCGTCTCTCC
ACCATCGAAC AGGCGGATGA GATCGTTGTA GTCGAAGACG GTATTATCGT TGAGCGCGGC
ACTCATAGCG AGCTGCTGGC GCAACACGGC GTTTACGCCC AGCTACATAA GATGCAATTT
GGCCAATGA
 
Protein sequence
MHNDKDLSTW QTFRRLWPTI APFKAGLIVA GIALILNAAS DTFMLSLLKP LLDDGFGKTD 
RSVLLWMPLV VIGLMILRGI TSYISSYCIS WVSGKVVMTM RRRLFGHMMG MPVAFFDKQS
TGTLLSRITY DSEQVASSSS GALITVVREG ASIIGLFIMM FYYSWQLSII LVVLAPIVSI
AIRVVSKRFR SISKNMQNTM GQVTTSAEQM LKGHKEVLIF GGQEVETKRF DKVSNKMRLQ
GMKMVSASSI SDPIIQLIAS LALAFVLYAA SFPSVMDSLT AGTITVVFSS MIALMRPLKS
LTNVNAQFQR GMAACQTLFA ILDSEQEKDE GKRVIDRATG DLEFRNVTFT YPGREVPALR
NINLKIPAGK TVALVGRSGS GKSTIASLIT RFYDIDEGHI LMDGHDLREY TLASLRNQVA
LVSQNVHLFN DTVANNIAYA RTEEYSREQI EEAARMAYAM DFINKMDNGL DTIIGENGVL
LSGGQRQRIA IARALLRDSP ILILDEATSA LDTESERAIQ AALDELQKNR TSLVIAHRLS
TIEQADEIVV VEDGIIVERG THSELLAQHG VYAQLHKMQF GQ