Gene ECH74115_0980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0980 
SymbolgsiA 
ID6967896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp995199 
End bp997088 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content54% 
IMG OID643384996 
Productglutathione transporter ATP-binding protein 
Protein accessionYP_002269496 
Protein GI209400349 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAAAGG GGACACCGTT GCCACACAGT GATGAACTTG ATGCCGGTAA TGTGCTGGCG 
GTTGAAAATC TTAATATTGC CTTTATGCAG GACCAGCAGA AAATAGCTGC GGTCCGCAAT
CTCTCTTTTA GTCTGCAACG CGGTGAGACG CTGGCAATTG TTGGCGAATC CGGCTCCGGT
AAGTCAGTGA CTGCGCTGGC ATTGATGCGT CTGTTGGAAC AGGCGGGCGG TTTAGTGCAG
TGCGATAAAA TGCTGTTGCG GCGGCGCAGT CGTGATGTGA TTGAACTTAG CGAGCAGAGC
GCTGCACAAA TGCGCCATGT GCGCGGTGCG GATATGGCGA TGATATTTCA GGAGCCGATG
ACATCGCTGA ACCCAGTATT TACTGTGGGT GAACAGATTG CCGAATCAAT TCGTCTGCAT
CAGAACGCCA GTCGTGAAGA AGCGATGGTC GAGGCGAAGC GGATGCTGGA TCAGGTACGC
ATTCCGGAGG CACAAACCAT TCTTTCACGT TATCCGCATC AACTCTCTGG CGGGATGCGC
CAGCGAGTGA TGATTGCGAT GGCGCTGTCA TGCCGCCCGG CAGTGCTGAT AGCCGATGAG
CCAACCACCG CGCTGGATGT CACTATTCAG GCGCAGATCC TGCAATTAAT CAAAGTATTG
CAAAAAGAGA TGTCGATGGG CGTTATCTTT ATCACTCACG ATATGGGCGT GGTGGCAGAG
ATTGCCGATC GGGTACTGGT GATGTATCAG GGCGAGGCGG TGGAAACGGG TACCGTCGAA
CAGATTTTTC ATGCACCGCA ACATCCCTAC ACCCGTGCGC TGTTAGCTGC TGTTCCGCAA
CTTGGTGCGA TGAAAGGGTT AGATTATCCC CGACGTTTCC CATTGATATC GCTTGAACAT
CCAGCGAAAC AGGCCCCCCC CATCGAGCAG AAAACGGTGG TGGATGGCGA ACCTGTTTTA
CGGGTGCGTA ATCTGGTCAC CCGTTTCCCT TTGCGCAGCG GTTTGTTGAA TCGCGTAACG
CGGGAAGTGC ATGCCGTTGA GAAAGTCAGT TTTGATCTCT GGCCTGGCGA AACGCTATCG
CTGGTGGGCG AGTCTGGCAG CGGTAAATCC ACTACCGGGC GGGCGTTGCT GCGCCTGGTC
GAATCGCAGG GCGGCGAAAT TATCTTTAAC GGTCAGCGAA TCGATACCTT GTCACCCGGC
AAACTTCAGG CATTGCGCCG CGATATTCAG TTTATTTTTC AGGACCCTTA CGCTTCGCTG
GACCCACGTC AGACCATCGG TGATTCGATT ATCGAACCGC TGCGCGTACA CGGTTTATTG
CCAGGTAAAG AAGCGGTTGC ACGCGTTGCG TGGTTGCTGG AGCGCGTGGG CCTGTTACCT
GAACATGCCT GGCGTTACCC GCATGAGTTT TCCGGCGGTC AGCGCCAGCG CATCTGCATT
GCTCGCGCGT TGGCATTGAA TCCAAAAGTG ATCATTGCCG ACGAAGCCGT TTCGGCGCTG
GATGTTTCAA TTCGCGGGCA GATTATCAAC TTGTTGCTCG ATCTCCAGCG TGATTTCGGC
ATTGCGTATC TGTTTATCTC CCACGATATG GCTGTGGTAG AGCGGATTAG TCATCGTGTG
GCGGTGATGT ATCTCGGGCA AATTGTTGAA ATTGGTCCAC GGCGCGCGGT CTTCGAAAAC
CCGCAGCATC CTTATACGCG TAAATTACTG GCGGCAGTTC CGGTCGCTGA ACCGTCCCGA
CAACGACCGC AGCGTGTACT GCTGTCGGAC GATCTTCCCA GCAATATTCA TCTGCGTGGC
GAAGAGGTGG CAGCCGTCTC GTTGCAATGC GTCGGGCCGG GGCATTACGT CGCACAACCA
CAATCAGAAT ACGCATTCAT GCGTAGATAA
 
Protein sequence
MKKGTPLPHS DELDAGNVLA VENLNIAFMQ DQQKIAAVRN LSFSLQRGET LAIVGESGSG 
KSVTALALMR LLEQAGGLVQ CDKMLLRRRS RDVIELSEQS AAQMRHVRGA DMAMIFQEPM
TSLNPVFTVG EQIAESIRLH QNASREEAMV EAKRMLDQVR IPEAQTILSR YPHQLSGGMR
QRVMIAMALS CRPAVLIADE PTTALDVTIQ AQILQLIKVL QKEMSMGVIF ITHDMGVVAE
IADRVLVMYQ GEAVETGTVE QIFHAPQHPY TRALLAAVPQ LGAMKGLDYP RRFPLISLEH
PAKQAPPIEQ KTVVDGEPVL RVRNLVTRFP LRSGLLNRVT REVHAVEKVS FDLWPGETLS
LVGESGSGKS TTGRALLRLV ESQGGEIIFN GQRIDTLSPG KLQALRRDIQ FIFQDPYASL
DPRQTIGDSI IEPLRVHGLL PGKEAVARVA WLLERVGLLP EHAWRYPHEF SGGQRQRICI
ARALALNPKV IIADEAVSAL DVSIRGQIIN LLLDLQRDFG IAYLFISHDM AVVERISHRV
AVMYLGQIVE IGPRRAVFEN PQHPYTRKLL AAVPVAEPSR QRPQRVLLSD DLPSNIHLRG
EEVAAVSLQC VGPGHYVAQP QSEYAFMRR