Gene EcSMS35_0854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0854 
SymbolgsiA 
ID6145406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp859199 
End bp861070 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content54% 
IMG OID641615742 
Productglutathione transporter ATP-binding protein 
Protein accessionYP_001742934 
Protein GI170680445 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCACACA GTGATGAACT TGATGCCGGT GATGTGCTGG CGGTTGAAAA TCTGAATATT 
GCCTTTATGC AGGACCAGCA GAAAATAGCT GCGGTCCGCA ATCTCTCTTT TAGCCTGCAA
CGCGGTGAGA CGCTGGCAAT TGTTGGCGAA TCCGGCTCCG GTAAGTCAGT GACTGCGCTG
GCATTGATGC GTCTGTTGGA ACAGGCGGGC GGTTTAGTAC AGTGCGATAA AATGCTGTTG
CGGCGGCGCA GTCGCGAAGT GATTGAACTT AGCGAGCAGA GCGCTGCACA AATGCGCCAT
GTGCGCGGTG CGGATATGGC GATGATATTT CAGGAACCGA TGACATCGCT GAACCCGGTA
TTTACTGTGG GTGAACAGAT TGCCGAATCA ATTCGTCTGC ATCAGAACGC CAGTCGTGAA
GAAGCGATGG TCGAGGCGAA GCGGATGCTG GATCAGGTAC GCATTCCGGA GGCACAAACC
ATTCTTTCAC GTTATCCGCA TCAACTCTCT GGCGGGATGC GCCAGCGAGT GATGATTGCG
ATGGCGCTGT CATGCTGCCC GGCGGTGCTG ATTGCCGATG AGCCAACCAC CGCGCTGGAT
GTCACTATTC AGGCGCAGAT CCTGCAATTA ATCAAAGTAT TGCAAAAAGA GATGTCGATG
GGCGTTATCT TTATCACTCA CGATATGGGC GTGGTGGCAG AGATTGCCGA TCGGGTTCTG
GTGATGTATC AGGGCGAGGC GGTGGAAACG GGTAGCGTCG AACAGATTTT TCATGCACCG
CAACATCCTT ATACCCGTGC GCTGTTAGCT GCTGTTCCGC AACTTGGTGC GATGAAAGGG
TTAGATTATC CCCGACGTTT CCCGTTGATA TCGCTTGAAC ATCCAGCGAA ACAGGAGCCA
CCCATCGAGC AGAAAACGGT GGTGGATGGC GAACCTGTTT TACGGGTGCG TAATCTGGTC
ACCCGTTTCC CTTTGCGCAG CGGTTTGTTG AATCGCGTAA CGCGGGAAGT GCATGCCGTT
GAGAAAGTCA GTTTTGATCT CTGGCCTGGT GAAACGCTAT CGCTGGTGGG CGAGTCTGGC
AGCGGTAAAT CCACTACCGG GCGGGCGTTG CTGCGCCTGG TCGAATCGCA GGGCGGCGAA
ATTATCTTTA ACGGTCAGCG AATCGATACC TTGTCATCCG GTAAACTTCA GGCATTGCGC
CGCGATATTC AGTTTATTTT TCAGGACCCT TACGCTTCGC TGGACCCACG TCAGACCATC
GGTGATTCGA TTATCGAACC GCTGCGCGTA CACGGTTTAT TGCCAGGTAA AGAAGCGGCT
GCACGTGTTG CGTGGTTGCT GGAGCGCGTG GGCCTGTTAC CTGAACATGC CTGGCGTTAC
CCGCATGAGT TTTCCGGCGG TCAGCGCCAG CGCATCTGCA TTGCTCGCGC GTTGGCATTG
AATCCAAAAG TGATCATTGC CGACGAAGCC GTCTCGGCGC TGGATGTTTC TATTCGCGGG
CAGATTATCA ACTTGTTGCT CGATCTCCAG CGTGATTTCG GCATTGCGTA TCTGTTTATC
TCCCACGATA TGGCCGTGGT AGAGCGGATT AGTCATCGTG TGGCGGTGAT GTATCTCGGG
CAAATTGTTG AAATTGGTCC ACGGCGCGCG GTCTTCGAAA ACCCGCAGCA TCCTTATACG
CGTAAATTAC TGGCGGCAGT TCCGGTCGCT GAACCGTCCC GACAACGACC GCAGCGTGTA
CTGCTGTCGG ACGATCTTCC CAGCAATATT CATCTGCGTG GCGAAGAGGT TGCAGCCGTC
TCGTTGCAAT GCGTCGGGCC GGGGCATTAC GTCGCACAAC CACAATCAGA ATACGCATTC
ATGCGTAGAT AA
 
Protein sequence
MPHSDELDAG DVLAVENLNI AFMQDQQKIA AVRNLSFSLQ RGETLAIVGE SGSGKSVTAL 
ALMRLLEQAG GLVQCDKMLL RRRSREVIEL SEQSAAQMRH VRGADMAMIF QEPMTSLNPV
FTVGEQIAES IRLHQNASRE EAMVEAKRML DQVRIPEAQT ILSRYPHQLS GGMRQRVMIA
MALSCCPAVL IADEPTTALD VTIQAQILQL IKVLQKEMSM GVIFITHDMG VVAEIADRVL
VMYQGEAVET GSVEQIFHAP QHPYTRALLA AVPQLGAMKG LDYPRRFPLI SLEHPAKQEP
PIEQKTVVDG EPVLRVRNLV TRFPLRSGLL NRVTREVHAV EKVSFDLWPG ETLSLVGESG
SGKSTTGRAL LRLVESQGGE IIFNGQRIDT LSSGKLQALR RDIQFIFQDP YASLDPRQTI
GDSIIEPLRV HGLLPGKEAA ARVAWLLERV GLLPEHAWRY PHEFSGGQRQ RICIARALAL
NPKVIIADEA VSALDVSIRG QIINLLLDLQ RDFGIAYLFI SHDMAVVERI SHRVAVMYLG
QIVEIGPRRA VFENPQHPYT RKLLAAVPVA EPSRQRPQRV LLSDDLPSNI HLRGEEVAAV
SLQCVGPGHY VAQPQSEYAF MRR