Gene EcHS_A0887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0887 
SymbolgsiA 
ID5594813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp895402 
End bp897273 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content54% 
IMG OID640920059 
Productglutathione transporter ATP-binding protein 
Protein accessionYP_001457626 
Protein GI157160308 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCACACA GTGATGAACT TGATGCCGGT GATGTGCTGG CGGTTGAAAA TCTGAAAATT 
GCCTTTATGC AGGACCAGCA GAAAATAGCT GCGGTCCGCA ATCTCTCTTT TAGCCTTCAA
CGCGGTGAGA CGCTGGCCAT TGTTGGCGAA TCCGGCTCCG GTAAGTCAGT GACTGCGCTG
GCATTGATGC GTCTGTTGGA ACAGGCGGGC GGTTTAGTAC AGTGCGATAA AATGCTGTTG
CGGCGGCGCA GTCGTGAAGT GATTGAACTT AGCGAGCAGA GCGCTGCACA AATGCGCCAT
GTGCGCGGTG CGGATATGGC GATGATATTT CAGGAGCCGA TGACATCGCT GAACCCGGTA
TTTACTGTGG GTGAACAGAT TGCCGAATCA ATTCGTCTGC ATCAGAACGC CAGTCGTGAA
GAAGCGATGG TCGAGGCGAA GCGGATGCTG GATCAGGTAC GCATTCCGGA GGCACAAACC
ATTCTTTCAC GTTATCCGCA TCAACTCTCT GGCGGGATGC GCCAGCGAGT GATGATTGCG
ATGGCGCTGT CATGCCGCCC GGCGGTGCTG ATTGCCGATG AGCCAACCAC CGCGCTGGAT
GTCACTATTC AGGCGCAGAT CCTGCAATTA ATCAAAGTAT TGCAAAAAGA GATGTCGATG
GGCGTTATCT TTATCACTCA CGATATGGGC GTGGTGGCAG AGATTGCCGA TCGGGTACTG
GTGATGTATC AGGGCGAGGC GGTGGAAACG GGTACCGTCG AACAGATTTT TCATGCACCG
CAACATCCTT ACACCCGTGC GCTGTTAGCT GCTGTTCCGC AACTTGGTGC GATGAAAGGG
TTAGATTATC CCCGACGTTT CCCATTGATA TCGCTTGAAC ATCCAGCGAA ACAGGAACCC
CCCATCGAGC AGAAAACGGT GGTGGATGGC GAACCTGTTT TACGGGTGCG TAATCTTGTC
ACCCGTTTCC CTTTGCGCAG CGGTTTGTTG AATCGCGTAA CGCGGGAAGT GCATGCCGTT
GAGAAAGTCA GTTTTGATCT CTGGCCTGGC GAAACGCTAT CGCTGGTGGG CGAGTCTGGC
AGCGGTAAAT CCACTACCGG GCGGGCGTTG CTGCGCCTGG TCGAATCGCA GGGCGGCGAA
ATTATCTTTA ACGGTCAGCG AATCGATACC TTGTCACCCG GCAAACTTCA GGCATTACGC
CGGGATATTC AGTTTATTTT TCAGGACCCT TACGCTTCGC TGGACCCACG TCAGACCATC
GGTGATTCGA TTATCGAACC GCTGCGCGTA CACGGTTTAT TGCCAGGTAA AGACGCGGCT
GCACGCGTTG CGTGGTTGCT GGAGCGCGTG GGCCTGTTAC CTGAACATGC CTGGCGTTAC
CCGCATGAGT TTTCCGGCGG TCAGCGCCAG CGCATCTGCA TTGCTCGCGC GTTGGCATTG
AATCCAAAAG TGATCATTGC CGACGAAGCC GTTTCGGCGC TGGATGTTTC TATTCGCGGG
CAGATTATCA ACTTGTTGCT CGATCTCCAG CGTGATTTCG GCATTGCGTA TCTGTTTATC
TCCCACGATA TGGCCGTGGT AGAGCGGATT AGTCATCGTG TGGCGGTGAT GTATCTCGGG
CAAATTGTTG AAATTGGCCC ACGGCGCGCG GTCTTCGAAA ACCCGCAGCA TCCTTATACG
CGTAAATTAC TGGCGGCAGT TCCGGTCGCT GAACCGTCCC GACAACGACC GCAGCGTGTA
CTGCTGTCGG ACGATCTTCC CAGCAATATT CATCTGCGTG GCGAAGAGGT GGCAGGCGTC
TCGTTGCAAT GCGTCGGGCC GGGGCATTAC GTCGCACAAC CACAATCAGA ATACGCATTC
ATGCGTAGAT AA
 
Protein sequence
MPHSDELDAG DVLAVENLKI AFMQDQQKIA AVRNLSFSLQ RGETLAIVGE SGSGKSVTAL 
ALMRLLEQAG GLVQCDKMLL RRRSREVIEL SEQSAAQMRH VRGADMAMIF QEPMTSLNPV
FTVGEQIAES IRLHQNASRE EAMVEAKRML DQVRIPEAQT ILSRYPHQLS GGMRQRVMIA
MALSCRPAVL IADEPTTALD VTIQAQILQL IKVLQKEMSM GVIFITHDMG VVAEIADRVL
VMYQGEAVET GTVEQIFHAP QHPYTRALLA AVPQLGAMKG LDYPRRFPLI SLEHPAKQEP
PIEQKTVVDG EPVLRVRNLV TRFPLRSGLL NRVTREVHAV EKVSFDLWPG ETLSLVGESG
SGKSTTGRAL LRLVESQGGE IIFNGQRIDT LSPGKLQALR RDIQFIFQDP YASLDPRQTI
GDSIIEPLRV HGLLPGKDAA ARVAWLLERV GLLPEHAWRY PHEFSGGQRQ RICIARALAL
NPKVIIADEA VSALDVSIRG QIINLLLDLQ RDFGIAYLFI SHDMAVVERI SHRVAVMYLG
QIVEIGPRRA VFENPQHPYT RKLLAAVPVA EPSRQRPQRV LLSDDLPSNI HLRGEEVAGV
SLQCVGPGHY VAQPQSEYAF MRR