Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0980 |
Symbol | gsiA |
ID | 6967896 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 995199 |
End bp | 997088 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643384996 |
Product | glutathione transporter ATP-binding protein |
Protein accession | YP_002269496 |
Protein GI | 209400349 |
COG category | [R] General function prediction only |
COG ID | [COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase |
TIGRFAM ID | [TIGR02323] phosphonate C-P lyase system protein PhnK |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAAAAGG GGACACCGTT GCCACACAGT GATGAACTTG ATGCCGGTAA TGTGCTGGCG GTTGAAAATC TTAATATTGC CTTTATGCAG GACCAGCAGA AAATAGCTGC GGTCCGCAAT CTCTCTTTTA GTCTGCAACG CGGTGAGACG CTGGCAATTG TTGGCGAATC CGGCTCCGGT AAGTCAGTGA CTGCGCTGGC ATTGATGCGT CTGTTGGAAC AGGCGGGCGG TTTAGTGCAG TGCGATAAAA TGCTGTTGCG GCGGCGCAGT CGTGATGTGA TTGAACTTAG CGAGCAGAGC GCTGCACAAA TGCGCCATGT GCGCGGTGCG GATATGGCGA TGATATTTCA GGAGCCGATG ACATCGCTGA ACCCAGTATT TACTGTGGGT GAACAGATTG CCGAATCAAT TCGTCTGCAT CAGAACGCCA GTCGTGAAGA AGCGATGGTC GAGGCGAAGC GGATGCTGGA TCAGGTACGC ATTCCGGAGG CACAAACCAT TCTTTCACGT TATCCGCATC AACTCTCTGG CGGGATGCGC CAGCGAGTGA TGATTGCGAT GGCGCTGTCA TGCCGCCCGG CAGTGCTGAT AGCCGATGAG CCAACCACCG CGCTGGATGT CACTATTCAG GCGCAGATCC TGCAATTAAT CAAAGTATTG CAAAAAGAGA TGTCGATGGG CGTTATCTTT ATCACTCACG ATATGGGCGT GGTGGCAGAG ATTGCCGATC GGGTACTGGT GATGTATCAG GGCGAGGCGG TGGAAACGGG TACCGTCGAA CAGATTTTTC ATGCACCGCA ACATCCCTAC ACCCGTGCGC TGTTAGCTGC TGTTCCGCAA CTTGGTGCGA TGAAAGGGTT AGATTATCCC CGACGTTTCC CATTGATATC GCTTGAACAT CCAGCGAAAC AGGCCCCCCC CATCGAGCAG AAAACGGTGG TGGATGGCGA ACCTGTTTTA CGGGTGCGTA ATCTGGTCAC CCGTTTCCCT TTGCGCAGCG GTTTGTTGAA TCGCGTAACG CGGGAAGTGC ATGCCGTTGA GAAAGTCAGT TTTGATCTCT GGCCTGGCGA AACGCTATCG CTGGTGGGCG AGTCTGGCAG CGGTAAATCC ACTACCGGGC GGGCGTTGCT GCGCCTGGTC GAATCGCAGG GCGGCGAAAT TATCTTTAAC GGTCAGCGAA TCGATACCTT GTCACCCGGC AAACTTCAGG CATTGCGCCG CGATATTCAG TTTATTTTTC AGGACCCTTA CGCTTCGCTG GACCCACGTC AGACCATCGG TGATTCGATT ATCGAACCGC TGCGCGTACA CGGTTTATTG CCAGGTAAAG AAGCGGTTGC ACGCGTTGCG TGGTTGCTGG AGCGCGTGGG CCTGTTACCT GAACATGCCT GGCGTTACCC GCATGAGTTT TCCGGCGGTC AGCGCCAGCG CATCTGCATT GCTCGCGCGT TGGCATTGAA TCCAAAAGTG ATCATTGCCG ACGAAGCCGT TTCGGCGCTG GATGTTTCAA TTCGCGGGCA GATTATCAAC TTGTTGCTCG ATCTCCAGCG TGATTTCGGC ATTGCGTATC TGTTTATCTC CCACGATATG GCTGTGGTAG AGCGGATTAG TCATCGTGTG GCGGTGATGT ATCTCGGGCA AATTGTTGAA ATTGGTCCAC GGCGCGCGGT CTTCGAAAAC CCGCAGCATC CTTATACGCG TAAATTACTG GCGGCAGTTC CGGTCGCTGA ACCGTCCCGA CAACGACCGC AGCGTGTACT GCTGTCGGAC GATCTTCCCA GCAATATTCA TCTGCGTGGC GAAGAGGTGG CAGCCGTCTC GTTGCAATGC GTCGGGCCGG GGCATTACGT CGCACAACCA CAATCAGAAT ACGCATTCAT GCGTAGATAA
|
Protein sequence | MKKGTPLPHS DELDAGNVLA VENLNIAFMQ DQQKIAAVRN LSFSLQRGET LAIVGESGSG KSVTALALMR LLEQAGGLVQ CDKMLLRRRS RDVIELSEQS AAQMRHVRGA DMAMIFQEPM TSLNPVFTVG EQIAESIRLH QNASREEAMV EAKRMLDQVR IPEAQTILSR YPHQLSGGMR QRVMIAMALS CRPAVLIADE PTTALDVTIQ AQILQLIKVL QKEMSMGVIF ITHDMGVVAE IADRVLVMYQ GEAVETGTVE QIFHAPQHPY TRALLAAVPQ LGAMKGLDYP RRFPLISLEH PAKQAPPIEQ KTVVDGEPVL RVRNLVTRFP LRSGLLNRVT REVHAVEKVS FDLWPGETLS LVGESGSGKS TTGRALLRLV ESQGGEIIFN GQRIDTLSPG KLQALRRDIQ FIFQDPYASL DPRQTIGDSI IEPLRVHGLL PGKEAVARVA WLLERVGLLP EHAWRYPHEF SGGQRQRICI ARALALNPKV IIADEAVSAL DVSIRGQIIN LLLDLQRDFG IAYLFISHDM AVVERISHRV AVMYLGQIVE IGPRRAVFEN PQHPYTRKLL AAVPVAEPSR QRPQRVLLSD DLPSNIHLRG EEVAAVSLQC VGPGHYVAQP QSEYAFMRR
|
| |