Gene ECH74115_5906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5906 
Symbol 
ID6966645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5559278 
End bp5560945 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content55% 
IMG OID643389521 
Productputative ABC transporter ATP-binding protein 
Protein accessionYP_002273912 
Protein GI209400314 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTCAAT TCGTTTATAC CATGCATCGT GTCGGCAAAG TTGTTCCGCC GAAACGTCAT 
ATTTTGAAAA ACATCTCTCT GAGTTTCTTC CCTGGGGCAA AAATTGGTGT CCTGGGTCTG
AACGGCGCGG GTAAGTCCAC CCTGCTGCGC ATTATGGCGG GCATTGATAA AGACATCGAA
GGTGAAGCGC GTCCGCAGCC AGACATCAAG ATTGGTTATC TGCCGCAGGA ACCGCAGCTG
AACCCGGAAC ACACCGTACG TGAGTCCATT GAAGAAGCGG TTTCAGAAGT GGTTAACGCC
CTGAAACGCC TGGATGAAGT GTATGCGCTG TACGCCGATC CGGATGCCGA TTTTGACAAG
CTGGCCGCTG AACAAGGCCG TCTGGAAGAG ATCATTCAGG CTCACGACGG TCATAATCTG
AACGTACAGC TGGAGCGTGC GGCGGATGCG CTACGTCTGC CGGACTGGGA CGCGAAAATC
GCTAACCTCT CCGGTGGTGA GCGTCGTCGC GTGGCGTTGT GCCGCCTGCT GCTGGAAAAA
CCAGACATGC TGCTGCTCGA CGAACCGACC AACCACCTGG ATGCCGAATC CGTGGCCTGG
CTGGAACGCT TCCTGCACGA CTTCGAAGGC ACCGTGGTGG CGATTACCCA CGACCGTTAC
TTCCTCGATA ACGTTGCGGG CTGGATCCTC GAACTTGACC GCGGTGAAGG TATTCCGTGG
GAAGGTAACT ACTCCTCCTG GCTGGAGCAG AAAGATCAGC GTCTGGCGCA GGAAGCTTCA
CAAGAAGCGG CGCGTCGTAA GTCGATCGAG AAAGAGCTGG AGTGGGTACG TCAGGGAACT
AAAGGCCGCC AGTCGAAAGG TAAAGCACGT CTGGCACGCT TTGAAGAACT GAACAGCACC
GAATATCAGA AACGTAACGA AACCAACGAA CTGTTTATTC CACCTGGACC GCGTCTGGGC
GATAAAGTGC TGGAAGTCAG CAACCTGCGT AAATCCTATG GCGATCGTCT GCTGATTGAT
GACCTGAGCT TCTCGATCCC GAAAGGAGCG ATCGTCGGGA TCATCGGTCC GAACGGTGCG
GGTAAATCGA CCCTGTTCCG TATGATCTCT GGTCAGGAAC AGCCGGACAG CGGCACCATC
ACTTTGGGTG AAACGGTGAA ACTGGCGTCG GTTGATCAGT TCCGTGACTC AATGGATAAC
AGCAAAACCG TTTGGGAAGA AGTTTCCGGC GGGCTGGATA TCATGAAGAT CGGCAACACC
GAGATGCCAA GCCGCGCCTA CGTTGGCCGC TTTAACTTTA AAGGGGTTGA TCAGGGTAAA
CGCGTTGGTG AACTCTCCGG TGGTGAGCGC GGTCGTCTGC ATCTGGCGAA GCTGCTGCAG
GTTGGCGGCA ACATGCTGCT GCTCGACGAA CCAACCAACG ACCTGGATAT CGAAACCCTG
CGCGCGCTGG AAAACGCCCT GCTGGAGTTC CCGGGCTGTG CGATGGTTAT CTCGCACGAC
CGTTGGTTCC TCGACCGTAT CGCCACGCAC ATTCTGGATT ACCAGGATGA AGGTAAAGTT
GAGTTCTTCG AAGGTAACTT TACCGAGTAC GAAGAGTACA AGAAACGCAC GCTGGGCGCA
GACGCGCTGG AGCCGAAGCG TATCAAGTAC AAGCGTATTG CGAAGTAA
 
Protein sequence
MAQFVYTMHR VGKVVPPKRH ILKNISLSFF PGAKIGVLGL NGAGKSTLLR IMAGIDKDIE 
GEARPQPDIK IGYLPQEPQL NPEHTVRESI EEAVSEVVNA LKRLDEVYAL YADPDADFDK
LAAEQGRLEE IIQAHDGHNL NVQLERAADA LRLPDWDAKI ANLSGGERRR VALCRLLLEK
PDMLLLDEPT NHLDAESVAW LERFLHDFEG TVVAITHDRY FLDNVAGWIL ELDRGEGIPW
EGNYSSWLEQ KDQRLAQEAS QEAARRKSIE KELEWVRQGT KGRQSKGKAR LARFEELNST
EYQKRNETNE LFIPPGPRLG DKVLEVSNLR KSYGDRLLID DLSFSIPKGA IVGIIGPNGA
GKSTLFRMIS GQEQPDSGTI TLGETVKLAS VDQFRDSMDN SKTVWEEVSG GLDIMKIGNT
EMPSRAYVGR FNFKGVDQGK RVGELSGGER GRLHLAKLLQ VGGNMLLLDE PTNDLDIETL
RALENALLEF PGCAMVISHD RWFLDRIATH ILDYQDEGKV EFFEGNFTEY EEYKKRTLGA
DALEPKRIKY KRIAK