Gene ECH74115_4250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4250 
SymbolgshB 
ID6967300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3937774 
End bp3938721 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content53% 
IMG OID643387988 
Productglutathione synthetase 
Protein accessionYP_002272427 
Protein GI209397305 
COG category[H] Coenzyme transport and metabolism
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0189] Glutathione synthase/Ribosomal protein S6 modification enzyme (glutaminyl transferase) 
TIGRFAM ID[TIGR01380] glutathione synthetase, prokaryotic 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.528078 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.275432 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAGC TCGGCATCGT GATGGACCCC ATCGCAAACA TCAACATCAA GAAAGATTCC 
AGTTTCGCTA TGTTGCTGGA AGCACAGCGT CGTGGTTACG AACTTCACTA TATGGAGATG
GGCGATCTGT ATCTGATCAA TGGCGAAGCC CGCGCCCATA CCCGCACGCT GAACGTGAAA
CAGAACTACG AAGAGTGGTT TTCATTCGTC GGTGAACAGG ATCTGCCGCT GGCCGATCTC
GATGTGATCC TGATGCGTAA AGACCCGCCG TTTGATACCG AGTTTATCTA CGCGACCTAT
ATTCTGGAAC GTGCCGAAGA GAAAGGGACG CTGATCGTTA ACAAGCCGCA GAGCCTGCGC
GACTGTAACG AGAAACTGTT TACCGCCTGG TTCTCTGACT TAACGCCAGA AACGCTGGTT
ACGCGCAATA AAGCGCAGCT AAAAGCGTTC TGGGAGAAAC ACAGCGACAT CATTCTTAAG
CCGCTGGACG GTATGGGCGG CGCGTCGATT TTCCGCGTGA AAGAAGGCGA TCCAAACCTC
GGCGTCATTG CCGAAACCTT GACCGAACAC GGCACTCGCT ACTGCATGGC ACAAAATTAT
CTGCCAGCCA TTAAAGATGG CGACAAACGC GTGCTGGTGG TGGACGGCGA ACCAGTACCG
TACTGCCTGG CGCGTATTCC GCAGGGGGGC GAAACCCGTG GCAATCTGGC TGCCGGTGGT
CGCGGTGAAC CTCGTCCGCT GACGGAAAGT GACTGGAAAA TCGCCCGTCA GATCGGGCCG
ACGCTGAAAG AAAAAGGGCT GATTTTTGTT GGTCTGGATA TCATCGGCGA CCGTCTGACT
GAAATTAACG TCACCAGCCC AACCTGTATT CGTGAGATTG AAGCAGAGTT TCCGGTGTCG
ATCACCGGAA TGTTAATGGA TGCCATCGAA GCACGTTTAC AGCAGTAG
 
Protein sequence
MIKLGIVMDP IANINIKKDS SFAMLLEAQR RGYELHYMEM GDLYLINGEA RAHTRTLNVK 
QNYEEWFSFV GEQDLPLADL DVILMRKDPP FDTEFIYATY ILERAEEKGT LIVNKPQSLR
DCNEKLFTAW FSDLTPETLV TRNKAQLKAF WEKHSDIILK PLDGMGGASI FRVKEGDPNL
GVIAETLTEH GTRYCMAQNY LPAIKDGDKR VLVVDGEPVP YCLARIPQGG ETRGNLAAGG
RGEPRPLTES DWKIARQIGP TLKEKGLIFV GLDIIGDRLT EINVTSPTCI REIEAEFPVS
ITGMLMDAIE ARLQQ