Gene SbBS512_E2519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2519 
SymbolgsiA 
ID6271818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2317321 
End bp2319192 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content54% 
IMG OID641726503 
Productglutathione transporter ATP-binding protein 
Protein accessionYP_001880983 
Protein GI187730108 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.503965 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCACACA GTGATGAACT TGATGCCGGT AATGTGCTGG CGGTTGAAAA TCTGAATATT 
GCCTTTATGC AGGACCAGCA GAAAATAGCT GCGGTCCGCA ATCTCTCTTT TAGTCTGCAA
CGCGGTGAGA CGCTGGCAAT TGTTGGCGAA TCCGGCTCCG GTAAGTCAGT GACTGCGTTG
GCATTGATGC GCCTGTTGGA ACAGGCGGGC GGTTTAGTAC AGTGCGATAA AATGCTGTTG
CAGCGGCGCA GTCGCGAAGT GATTGAACTT AGCGAGCAGA GCGCTGCACA AATGCGCCAT
GTTCGCGGTG CGGATATGGC GATGATATTT CAGGAGCCGA TGACATCGCT GAACCCGGTA
TTTACTGTGG GTGAACAGAT TGCCGAATCA ATTCGTCTGC ATCAGAACGC CAGTCGTGAA
GAAGCGATGG TCGAGGCGAA GCGGATGCTG GATCAGGTAC GCATTCCTGA GGCACAAACC
ATTCTTTCAC GTTATCCGCA TCAACTCTCT GGCGGGATGC GCCAGCGAGT GATGATTGCG
ATGGCGCTGT CATGCCGCCC GGCGGTGCTG ATTGCCGATG AGCCAACCAC CGCGCTGGAT
GTCACTATTC AGGCGCAGAT CCTGCAATTA ATCAAAGTAT TGCAAAAAGA GATGTCGATG
GGCGTTATCT TTATCACGCA CGATATGGGC GTGGTGGCAG AGATAGCCGA TCGGGTACTG
GTGATGTATC AGGGCGAGGC GGTGGAAACG GGTACCGTCG AACAGATTTT TCATGCACCG
CAACATCCTT ACACCCGTGC GCTGTTAGCT GCTGTTCCGC AACTTGGTGC GATGAAAGGG
TTAGATTATC CCCGACGTTT CCCATTGATA TCGCTTGAAC ATCCAGCGAA ACAGGACCTC
CCCATCGAGC AGAAAACGGT GGTGGATGGC GAACCTGTTT TACGAGTGCG TAATCTTGTC
ACCCGTTTCC CTTTGCGCAG CGGTTTGTTG AATCGCGTAA CGCGGGAAGT GCATGCCGTT
GAGAAAGTCA GTTTTGATCT CTGGCCTGGC GAAACGCTAT CGCTGGTGGG CGAGTCTGGC
AGCGGTAAAT CCACTACCGG GCGGGCGTTG CTGCGCCTGG TCGAATCGCA GGGCGGCGAA
ATTATCTTTA ACGGTCAGCG AATCGATACC TTGTCACCTG GCAAACTTCA GGCATTGCGC
CGGGATATTC AGTTTATTTT TCAGGACCCT TACGCTTCGC TGGACCCACG TCAGACCATC
GGTGATTCGA TTATCGAACC GCTGCGTGTA CACGGTTTAT TGCCAGGTAA AGACGCGGCT
GCACGCGTTG CGTGGTTGCT GGAGCGCGTG GGCCTGTTAC CTGAACATGC CTGGCGTTAC
CCGCATGAGT TTTCCGGCGG TCAGCGCCAG CGCATCTGCA TTGCTCGCGC GTTGGCATTG
AATCCAAAAG TGATCATTGC CGACGAAGCC GTTTCGGCGC TGGATGTTTC TATTCGCGGG
CAGATTATCA ACTTGTTGCT CGATCTCCAG CGTGATTTCG GCATTGCGTA TCTGTTTATC
TCCCACGATA TGGCCGTGGT AGAGCGGATT AGTCATCGTG TGGCGGTGAT GTATCTCGGG
CAAATTGTTG AAATTGGTCC ACGGCGCGCG GTCTTCGAAA ACCCGCAGCA TCCTTATACG
CGTAAATTAC TGGCGGCAGT TCCGGTCGCT GAACCGTCCC GACAACGACC GCAGCGTGTA
CTGCTGTCGG ACGATCTTCC CAGCAATATT CATCTGCGTG GCGAAGAGGT GGCAGCCGTC
TCGTTGCAAT GCGTCGGGCC GGGGCATTAC GTCGCACAAC CACAATCAGA ATACGCATTC
ATGCGTAGAT AA
 
Protein sequence
MPHSDELDAG NVLAVENLNI AFMQDQQKIA AVRNLSFSLQ RGETLAIVGE SGSGKSVTAL 
ALMRLLEQAG GLVQCDKMLL QRRSREVIEL SEQSAAQMRH VRGADMAMIF QEPMTSLNPV
FTVGEQIAES IRLHQNASRE EAMVEAKRML DQVRIPEAQT ILSRYPHQLS GGMRQRVMIA
MALSCRPAVL IADEPTTALD VTIQAQILQL IKVLQKEMSM GVIFITHDMG VVAEIADRVL
VMYQGEAVET GTVEQIFHAP QHPYTRALLA AVPQLGAMKG LDYPRRFPLI SLEHPAKQDL
PIEQKTVVDG EPVLRVRNLV TRFPLRSGLL NRVTREVHAV EKVSFDLWPG ETLSLVGESG
SGKSTTGRAL LRLVESQGGE IIFNGQRIDT LSPGKLQALR RDIQFIFQDP YASLDPRQTI
GDSIIEPLRV HGLLPGKDAA ARVAWLLERV GLLPEHAWRY PHEFSGGQRQ RICIARALAL
NPKVIIADEA VSALDVSIRG QIINLLLDLQ RDFGIAYLFI SHDMAVVERI SHRVAVMYLG
QIVEIGPRRA VFENPQHPYT RKLLAAVPVA EPSRQRPQRV LLSDDLPSNI HLRGEEVAAV
SLQCVGPGHY VAQPQSEYAF MRR