Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2519 |
Symbol | gsiA |
ID | 6271818 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 2317321 |
End bp | 2319192 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641726503 |
Product | glutathione transporter ATP-binding protein |
Protein accession | YP_001880983 |
Protein GI | 187730108 |
COG category | [R] General function prediction only |
COG ID | [COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase |
TIGRFAM ID | [TIGR02323] phosphonate C-P lyase system protein PhnK |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.503965 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCCACACA GTGATGAACT TGATGCCGGT AATGTGCTGG CGGTTGAAAA TCTGAATATT GCCTTTATGC AGGACCAGCA GAAAATAGCT GCGGTCCGCA ATCTCTCTTT TAGTCTGCAA CGCGGTGAGA CGCTGGCAAT TGTTGGCGAA TCCGGCTCCG GTAAGTCAGT GACTGCGTTG GCATTGATGC GCCTGTTGGA ACAGGCGGGC GGTTTAGTAC AGTGCGATAA AATGCTGTTG CAGCGGCGCA GTCGCGAAGT GATTGAACTT AGCGAGCAGA GCGCTGCACA AATGCGCCAT GTTCGCGGTG CGGATATGGC GATGATATTT CAGGAGCCGA TGACATCGCT GAACCCGGTA TTTACTGTGG GTGAACAGAT TGCCGAATCA ATTCGTCTGC ATCAGAACGC CAGTCGTGAA GAAGCGATGG TCGAGGCGAA GCGGATGCTG GATCAGGTAC GCATTCCTGA GGCACAAACC ATTCTTTCAC GTTATCCGCA TCAACTCTCT GGCGGGATGC GCCAGCGAGT GATGATTGCG ATGGCGCTGT CATGCCGCCC GGCGGTGCTG ATTGCCGATG AGCCAACCAC CGCGCTGGAT GTCACTATTC AGGCGCAGAT CCTGCAATTA ATCAAAGTAT TGCAAAAAGA GATGTCGATG GGCGTTATCT TTATCACGCA CGATATGGGC GTGGTGGCAG AGATAGCCGA TCGGGTACTG GTGATGTATC AGGGCGAGGC GGTGGAAACG GGTACCGTCG AACAGATTTT TCATGCACCG CAACATCCTT ACACCCGTGC GCTGTTAGCT GCTGTTCCGC AACTTGGTGC GATGAAAGGG TTAGATTATC CCCGACGTTT CCCATTGATA TCGCTTGAAC ATCCAGCGAA ACAGGACCTC CCCATCGAGC AGAAAACGGT GGTGGATGGC GAACCTGTTT TACGAGTGCG TAATCTTGTC ACCCGTTTCC CTTTGCGCAG CGGTTTGTTG AATCGCGTAA CGCGGGAAGT GCATGCCGTT GAGAAAGTCA GTTTTGATCT CTGGCCTGGC GAAACGCTAT CGCTGGTGGG CGAGTCTGGC AGCGGTAAAT CCACTACCGG GCGGGCGTTG CTGCGCCTGG TCGAATCGCA GGGCGGCGAA ATTATCTTTA ACGGTCAGCG AATCGATACC TTGTCACCTG GCAAACTTCA GGCATTGCGC CGGGATATTC AGTTTATTTT TCAGGACCCT TACGCTTCGC TGGACCCACG TCAGACCATC GGTGATTCGA TTATCGAACC GCTGCGTGTA CACGGTTTAT TGCCAGGTAA AGACGCGGCT GCACGCGTTG CGTGGTTGCT GGAGCGCGTG GGCCTGTTAC CTGAACATGC CTGGCGTTAC CCGCATGAGT TTTCCGGCGG TCAGCGCCAG CGCATCTGCA TTGCTCGCGC GTTGGCATTG AATCCAAAAG TGATCATTGC CGACGAAGCC GTTTCGGCGC TGGATGTTTC TATTCGCGGG CAGATTATCA ACTTGTTGCT CGATCTCCAG CGTGATTTCG GCATTGCGTA TCTGTTTATC TCCCACGATA TGGCCGTGGT AGAGCGGATT AGTCATCGTG TGGCGGTGAT GTATCTCGGG CAAATTGTTG AAATTGGTCC ACGGCGCGCG GTCTTCGAAA ACCCGCAGCA TCCTTATACG CGTAAATTAC TGGCGGCAGT TCCGGTCGCT GAACCGTCCC GACAACGACC GCAGCGTGTA CTGCTGTCGG ACGATCTTCC CAGCAATATT CATCTGCGTG GCGAAGAGGT GGCAGCCGTC TCGTTGCAAT GCGTCGGGCC GGGGCATTAC GTCGCACAAC CACAATCAGA ATACGCATTC ATGCGTAGAT AA
|
Protein sequence | MPHSDELDAG NVLAVENLNI AFMQDQQKIA AVRNLSFSLQ RGETLAIVGE SGSGKSVTAL ALMRLLEQAG GLVQCDKMLL QRRSREVIEL SEQSAAQMRH VRGADMAMIF QEPMTSLNPV FTVGEQIAES IRLHQNASRE EAMVEAKRML DQVRIPEAQT ILSRYPHQLS GGMRQRVMIA MALSCRPAVL IADEPTTALD VTIQAQILQL IKVLQKEMSM GVIFITHDMG VVAEIADRVL VMYQGEAVET GTVEQIFHAP QHPYTRALLA AVPQLGAMKG LDYPRRFPLI SLEHPAKQDL PIEQKTVVDG EPVLRVRNLV TRFPLRSGLL NRVTREVHAV EKVSFDLWPG ETLSLVGESG SGKSTTGRAL LRLVESQGGE IIFNGQRIDT LSPGKLQALR RDIQFIFQDP YASLDPRQTI GDSIIEPLRV HGLLPGKDAA ARVAWLLERV GLLPEHAWRY PHEFSGGQRQ RICIARALAL NPKVIIADEA VSALDVSIRG QIINLLLDLQ RDFGIAYLFI SHDMAVVERI SHRVAVMYLG QIVEIGPRRA VFENPQHPYT RKLLAAVPVA EPSRQRPQRV LLSDDLPSNI HLRGEEVAAV SLQCVGPGHY VAQPQSEYAF MRR
|
| |