Gene ECH74115_3318 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3318 
Symbol 
ID6969196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3052592 
End bp3054181 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content52% 
IMG OID643387130 
ProductABC transporter, ATP-binding protein 
Protein accessionYP_002271594 
Protein GI209396920 
COG category[R] General function prediction only 
COG ID[COG4172] ABC-type uncharacterized transport system, duplicated ATPase component 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.561111 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.65051 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAAA CTCTGTTAGC GATTGAAGAT TTGTCGGTGG GTTTTCGCCA TCAGCAAACC 
GTACGTACAG TAGTCAATGA TGTTTCACTA CAGATTGAGG CTGGCGAAAC GCTGGCGCTG
GTGGGTGAGT CAGGTTCAGG CAAAAGCGTT ACCGCGCTGT CAATTTTACG CCTGCTCCCT
TCCCCGCCGG TTGAATATCT CTCCGGCGAT ATTCGTTTTC ATGGCGAATC GCTGCTTCAT
GCCAGCGACC AAACGTTGCG CGGTGTACGC GGTAATAAGA TCGCCATGAT TTTTCAAGAG
CCGATGGTGT CGTTAAATCC ATTGCATACC CTGGAAAAAC AGCTTTATGA AGTGCTTTCA
CTCCACCGCG GGATGCGTCG GGAAGCGGCT CGTGGCGAAA TTCTTAACTG CCTTGATCGC
GTTGGTATCC GCCAGGCGGC AAAACGGCTG ACAGATTATC CGCATCAGCT CTCCGGCGGC
GAACGCCAGC GGGTGATGAT TGCGATGGCG CTGTTAACGC GACCGGAATT ATTAATTGCC
GATGAACCGA CCACCGCACT GGACGTCTCT GTCCAGGCGC AGATTTTACA GCTGTTGCGC
GAACTGCAAG GCGAGTTGAA TATGGGCATG CTGTTTATTA CTCATAACCT CAGCATTGTC
AGAAAACTGG CCCACCGCGT GGCGGTAATG CAAAACGGTC GCTGTGTCGA GCAAAATTAC
GCCGCTACGC TATTTGCCTC ACCCACTCAT CCTTACACAC AAAAGCTACT CAACAGTGAA
CCGTCAGGCG ATCCAGTGCC GTTGCCAGAA CCTGCCTCAA CGTTGCTGGA TGTTGAACAG
CTTCAGGTTG CCTTCCCCAT TCGCAAAGGG ATTTTGAAGC GCATTGTGGA TCATAATGTG
GTGGTGAAAA ACATCAGTTT TACGCTACGA GCGGGTGAAA CACTGGGTTT AGTGGGCGAG
TCCGGTTCCG GGAAAAGCAC GACGGGACTG GCGCTGCTGC GACTGATTAA TTCTCAGGGC
AGCATCATCT TTGACGGTCA GCCACTGCAA AATTTAAATC GCCGCCAGCT GTTACCTATT
CGTCATCGCA TTCAGGTGGT ATTTCAGGAT CCAAACTCCT CGCTCAACCC ACGACTCAAC
GTTTTGCAGA TTATTGAGGA AGGCTTACGG GTTCACCAGC CGACGCTTTC TGCCGCACAA
CGCGAACAAC AAGTGATAGA CGTGATGCAT GAAGTGGGAT TAGATCCTGA AACACGCCAC
CGTTATCCGG CGGAGTTCTC TGGTGGTCAG CGACAACGTA TTGCGATTGC CAGGGCATTA
ATTCTTAAGC CCTCGCTGAT CATACTTGAT GAACCGACAT CATCACTCGA CAAAACGGTA
CAGGCGCAAA TATTGACGCT ATTGAAATCA TTGCAACAAA AGCATCAACT GGCCTATTTG
TTTATCAGCC ACGATTTGCA CGTTGTCCGC GCGTTATGTC ATCAGGTTAT CGTACTGCGA
CAAGGGGAAG TAGTGGAACA AGGACCGTGC GCGCGCGTGT TTGCCACACC GCAGCAGGAG
TATACGCGTC AGCTACTGGC GTTGAGCTGA
 
Protein sequence
MTQTLLAIED LSVGFRHQQT VRTVVNDVSL QIEAGETLAL VGESGSGKSV TALSILRLLP 
SPPVEYLSGD IRFHGESLLH ASDQTLRGVR GNKIAMIFQE PMVSLNPLHT LEKQLYEVLS
LHRGMRREAA RGEILNCLDR VGIRQAAKRL TDYPHQLSGG ERQRVMIAMA LLTRPELLIA
DEPTTALDVS VQAQILQLLR ELQGELNMGM LFITHNLSIV RKLAHRVAVM QNGRCVEQNY
AATLFASPTH PYTQKLLNSE PSGDPVPLPE PASTLLDVEQ LQVAFPIRKG ILKRIVDHNV
VVKNISFTLR AGETLGLVGE SGSGKSTTGL ALLRLINSQG SIIFDGQPLQ NLNRRQLLPI
RHRIQVVFQD PNSSLNPRLN VLQIIEEGLR VHQPTLSAAQ REQQVIDVMH EVGLDPETRH
RYPAEFSGGQ RQRIAIARAL ILKPSLIILD EPTSSLDKTV QAQILTLLKS LQQKHQLAYL
FISHDLHVVR ALCHQVIVLR QGEVVEQGPC ARVFATPQQE YTRQLLALS