Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0854 |
Symbol | gsiA |
ID | 6145406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 859199 |
End bp | 861070 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641615742 |
Product | glutathione transporter ATP-binding protein |
Protein accession | YP_001742934 |
Protein GI | 170680445 |
COG category | [R] General function prediction only |
COG ID | [COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase |
TIGRFAM ID | [TIGR02323] phosphonate C-P lyase system protein PhnK |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCACACA GTGATGAACT TGATGCCGGT GATGTGCTGG CGGTTGAAAA TCTGAATATT GCCTTTATGC AGGACCAGCA GAAAATAGCT GCGGTCCGCA ATCTCTCTTT TAGCCTGCAA CGCGGTGAGA CGCTGGCAAT TGTTGGCGAA TCCGGCTCCG GTAAGTCAGT GACTGCGCTG GCATTGATGC GTCTGTTGGA ACAGGCGGGC GGTTTAGTAC AGTGCGATAA AATGCTGTTG CGGCGGCGCA GTCGCGAAGT GATTGAACTT AGCGAGCAGA GCGCTGCACA AATGCGCCAT GTGCGCGGTG CGGATATGGC GATGATATTT CAGGAACCGA TGACATCGCT GAACCCGGTA TTTACTGTGG GTGAACAGAT TGCCGAATCA ATTCGTCTGC ATCAGAACGC CAGTCGTGAA GAAGCGATGG TCGAGGCGAA GCGGATGCTG GATCAGGTAC GCATTCCGGA GGCACAAACC ATTCTTTCAC GTTATCCGCA TCAACTCTCT GGCGGGATGC GCCAGCGAGT GATGATTGCG ATGGCGCTGT CATGCTGCCC GGCGGTGCTG ATTGCCGATG AGCCAACCAC CGCGCTGGAT GTCACTATTC AGGCGCAGAT CCTGCAATTA ATCAAAGTAT TGCAAAAAGA GATGTCGATG GGCGTTATCT TTATCACTCA CGATATGGGC GTGGTGGCAG AGATTGCCGA TCGGGTTCTG GTGATGTATC AGGGCGAGGC GGTGGAAACG GGTAGCGTCG AACAGATTTT TCATGCACCG CAACATCCTT ATACCCGTGC GCTGTTAGCT GCTGTTCCGC AACTTGGTGC GATGAAAGGG TTAGATTATC CCCGACGTTT CCCGTTGATA TCGCTTGAAC ATCCAGCGAA ACAGGAGCCA CCCATCGAGC AGAAAACGGT GGTGGATGGC GAACCTGTTT TACGGGTGCG TAATCTGGTC ACCCGTTTCC CTTTGCGCAG CGGTTTGTTG AATCGCGTAA CGCGGGAAGT GCATGCCGTT GAGAAAGTCA GTTTTGATCT CTGGCCTGGT GAAACGCTAT CGCTGGTGGG CGAGTCTGGC AGCGGTAAAT CCACTACCGG GCGGGCGTTG CTGCGCCTGG TCGAATCGCA GGGCGGCGAA ATTATCTTTA ACGGTCAGCG AATCGATACC TTGTCATCCG GTAAACTTCA GGCATTGCGC CGCGATATTC AGTTTATTTT TCAGGACCCT TACGCTTCGC TGGACCCACG TCAGACCATC GGTGATTCGA TTATCGAACC GCTGCGCGTA CACGGTTTAT TGCCAGGTAA AGAAGCGGCT GCACGTGTTG CGTGGTTGCT GGAGCGCGTG GGCCTGTTAC CTGAACATGC CTGGCGTTAC CCGCATGAGT TTTCCGGCGG TCAGCGCCAG CGCATCTGCA TTGCTCGCGC GTTGGCATTG AATCCAAAAG TGATCATTGC CGACGAAGCC GTCTCGGCGC TGGATGTTTC TATTCGCGGG CAGATTATCA ACTTGTTGCT CGATCTCCAG CGTGATTTCG GCATTGCGTA TCTGTTTATC TCCCACGATA TGGCCGTGGT AGAGCGGATT AGTCATCGTG TGGCGGTGAT GTATCTCGGG CAAATTGTTG AAATTGGTCC ACGGCGCGCG GTCTTCGAAA ACCCGCAGCA TCCTTATACG CGTAAATTAC TGGCGGCAGT TCCGGTCGCT GAACCGTCCC GACAACGACC GCAGCGTGTA CTGCTGTCGG ACGATCTTCC CAGCAATATT CATCTGCGTG GCGAAGAGGT TGCAGCCGTC TCGTTGCAAT GCGTCGGGCC GGGGCATTAC GTCGCACAAC CACAATCAGA ATACGCATTC ATGCGTAGAT AA
|
Protein sequence | MPHSDELDAG DVLAVENLNI AFMQDQQKIA AVRNLSFSLQ RGETLAIVGE SGSGKSVTAL ALMRLLEQAG GLVQCDKMLL RRRSREVIEL SEQSAAQMRH VRGADMAMIF QEPMTSLNPV FTVGEQIAES IRLHQNASRE EAMVEAKRML DQVRIPEAQT ILSRYPHQLS GGMRQRVMIA MALSCCPAVL IADEPTTALD VTIQAQILQL IKVLQKEMSM GVIFITHDMG VVAEIADRVL VMYQGEAVET GSVEQIFHAP QHPYTRALLA AVPQLGAMKG LDYPRRFPLI SLEHPAKQEP PIEQKTVVDG EPVLRVRNLV TRFPLRSGLL NRVTREVHAV EKVSFDLWPG ETLSLVGESG SGKSTTGRAL LRLVESQGGE IIFNGQRIDT LSSGKLQALR RDIQFIFQDP YASLDPRQTI GDSIIEPLRV HGLLPGKEAA ARVAWLLERV GLLPEHAWRY PHEFSGGQRQ RICIARALAL NPKVIIADEA VSALDVSIRG QIINLLLDLQ RDFGIAYLFI SHDMAVVERI SHRVAVMYLG QIVEIGPRRA VFENPQHPYT RKLLAAVPVA EPSRQRPQRV LLSDDLPSNI HLRGEEVAAV SLQCVGPGHY VAQPQSEYAF MRR
|
| |