Gene EcSMS35_0817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0817 
Symbol 
ID6144028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp818941 
End bp820677 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content56% 
IMG OID641615705 
ProductABC transporter, ATP-binding protein 
Protein accessionYP_001742897 
Protein GI170680534 
COG category[V] Defense mechanisms 
COG ID[COG1131] ABC-type multidrug transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGATG CCGTTATCAC GCTGAACGGC CTGGAAAAAC GCTTTCCGGG CATGGACAAG 
CCCGCCGTCG CGCCGCTCGA TTGTACCATT CACGCCGGTT ATGTGACGGG GTTGGTGGGG
CCGGACGGTG CAGGTAAAAC CACGCTGATG CGGATGTTGG CGGGATTACT GAAACCCGAC
AGCGGCAGTG CCACGGTGAT TGGCTTTGAT CCGATCAAAA ACGACGGCGC GCTGCACGCC
GTGCTCGGCT ATATGCCGCA GAAATTTGGT CTGTATGAAG ATCTCACGGT GATGGAGAAC
CTCAATCTGT ACGCGGATTT GCGCAGCGTC ACCGGCGAGG CACGGAAGCA AACTTTTGCT
CGCCTGCTGG AGTTTACGTC TCTTGGGCCG TTTACCGGAC GCCTGGCGGG CAAGCTCTCC
GGTGGGATGA AACAAAAACT CGGTCTGGCC TGTACCCTGG TGGGCGAACC GAAAGTGTTG
CTGCTCGATG AACCCGGCGT CGGCGTTGAC CCTATCTCAC GGCGCGAACT ATGGCAGATG
GTGCATGAGC TGGCAGGCGA AGGGATGTTA ATCCTCTGGA GTACCTCGTA TCTCGACGAA
GCCGAGCAGT GCCGTGACGT ATTGCTGATG AACGAAGGCG AGCTGCTGTA TCAGGGAGAA
CCGACGGCCC TGACTCAAAC CATGGCCGGA CGCAGCTTTC TGATGACCAG CCCGCACGAG
GGCAACCGCA AACTGTTGCA ACGGGCATTG AAACTGCCGC AGGTCAGCGA CGGCATGATT
CAGGGGAAAT CGGTACGTCT GATCCTCAAA AAAGAGGCTA CACCAGACGA TATTCGCCAT
GCCGACGGGA TGCCGGAAAT CAACATTAAC GAAACTACGC CACGTTTTGA AGATGCGTTT
ATTGATTTGC TGGGCGGTGC CGGAACCTCG GAATCGCCGC TGGGCGCAAT ATTGCATACG
GTGGAAGGTA CACCTGGCGA GACGGTGATC GAAGCGAAAG AACTGACCAA GAAATTTGGT
GATTTTGCCG CCACCGATCA CGTCAACTTT GCCGTTAAAC GCGGGGAGAT TTTTGGTTTG
CTGGGGCCAA ACGGCGCGGG CAAATCGACC ACCTTTAAGA TGATGTGCGG TTTGCTGGTG
CCGACTTCCG GCCAGGCGCT GGTGCTGGGG ATGGATCTGA AAGAGAGTTC CGGTAAAGCG
CGCCAGCATC TCGGCTATAT GGCGCAAAAA TTTTCGCTCT ACGGCAACCT GACGGTCGAA
CAGAATTTAC GCTTTTTCTC TGGTGTGTAT GGCTTACGCG GTCGGGCGCA GAACGAAAAA
ATCTCCCGTA TGAGCGAGGC GTTCGGCCTG AAAAGTATCG CCTCCCACGC GACCGATGAA
CTGCCATTAG GTTTTAAACA GCGGCTGGCG CTGGCCTGTT CGCTGATGCA TGAACCGGAC
ATTCTGTTTC TCGACGAACC GACGTCCGGC GTTGATCCCC TCACCCGCCG TGAATTTTGG
CTACATATCA ACAGTATGGT AGAGAAAGGC GTCACGGTGA TGGTCACCAC CCACTTTATG
GATGAAGCGG AATATTGCGA CCGCATCGGC CTGGTGTACC GCGGGAAATT AATCGCCAGC
GGCACGCCGG ACGATTTGAA AGCGCAGTCG GCCAACGATG AGCAACCCGA TCCCACGATG
GAGCAAGCCT TTATTCAGTT GATCCACGAC TGGGATAAGG AGCATGGCAA TGAGTAA
 
Protein sequence
MNDAVITLNG LEKRFPGMDK PAVAPLDCTI HAGYVTGLVG PDGAGKTTLM RMLAGLLKPD 
SGSATVIGFD PIKNDGALHA VLGYMPQKFG LYEDLTVMEN LNLYADLRSV TGEARKQTFA
RLLEFTSLGP FTGRLAGKLS GGMKQKLGLA CTLVGEPKVL LLDEPGVGVD PISRRELWQM
VHELAGEGML ILWSTSYLDE AEQCRDVLLM NEGELLYQGE PTALTQTMAG RSFLMTSPHE
GNRKLLQRAL KLPQVSDGMI QGKSVRLILK KEATPDDIRH ADGMPEININ ETTPRFEDAF
IDLLGGAGTS ESPLGAILHT VEGTPGETVI EAKELTKKFG DFAATDHVNF AVKRGEIFGL
LGPNGAGKST TFKMMCGLLV PTSGQALVLG MDLKESSGKA RQHLGYMAQK FSLYGNLTVE
QNLRFFSGVY GLRGRAQNEK ISRMSEAFGL KSIASHATDE LPLGFKQRLA LACSLMHEPD
ILFLDEPTSG VDPLTRREFW LHINSMVEKG VTVMVTTHFM DEAEYCDRIG LVYRGKLIAS
GTPDDLKAQS ANDEQPDPTM EQAFIQLIHD WDKEHGNE