Gene EcSMS35_3089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3089 
SymbolgshB 
ID6146371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3178776 
End bp3179726 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content54% 
IMG OID641617957 
Productglutathione synthetase 
Protein accessionYP_001745108 
Protein GI170683849 
COG category[H] Coenzyme transport and metabolism
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0189] Glutathione synthase/Ribosomal protein S6 modification enzyme (glutaminyl transferase) 
TIGRFAM ID[TIGR01380] glutathione synthetase, prokaryotic 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00631084 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0000073665 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCAAGC TCGGCATCGT GATGGACCCC ATCGCAAACA TCAACATCAA GAAAGATTCC 
AGTTTCGCTA TGTTGCTGGA AGCACAGCGT CGTGGTTACG AACTTCACTA TATGGAGATG
GGCGATCTGT ATCTGATCAA TGGCGAAGCC CGCGCCCATA CCCGCACGCT GAACGTGAAG
CAGAACTACG AAGAGTGGTT TTCGTTCGTC GGTGAACAGG ATCTGCCGCT GGCCGATCTC
GATGTGATCC TGATGCGTAA AGACCCGCCG TTTGATACCG AGTTTATCTA CGCGACCTAT
ATTCTGGAAC GTGCCGAAGA GAAAGGGACG CTGATCGTTA ACAAGCCGCA GAGCCTGCGC
GACTGTAACG AGAAACTGTT TACCGCCTGG TTCTCTGACT TAACGCCAGA AACGCTGGTT
ACGCGCAATA AAGCACAGCT GAAAGCGTTC TGGGAGAAAC ACAGCGACAT CATTCTTAAG
CCGCTGGACG GTATGGGCGG CGCGTCGATT TTCCGCGTGA AAGAAGGCGA TCCAAACCTC
GGCGTGATTG CCGAAACCCT GACTGAGCAT GGCACTCGCT ACTGCATGGC GCAAAATTAC
CTGCCAGCCA TTAAAGATGG CGACAAACGC GTGCTGGTGG TGGATGGCGA GCCGGTTCCG
TACTGCCTGG CGCGTATTCC GCAGGGGGGC GAAACCCGTG GCAATCTGGC TGCCGGTGGT
CGCGGTGAAC CTCGTCCGCT GACGGAAAGT GACTGGAAAA TCGCCCGTCA GATCGGGCCG
ACGCTGAAAG AAAAAGGGCT GATTTTTGTT GGTCTGGATA TCATTGGCGA CCGTCTGACT
GAAATTAACG TCACCAGCCC AACCTGTATT CGTGAGATTG AAGCAGAGTT TCCGGTGTCG
ATCACCGGAA TGTTAATGGA TGCCATCGAA GCACGTTTAC AGCAGCAGTA A
 
Protein sequence
MIKLGIVMDP IANINIKKDS SFAMLLEAQR RGYELHYMEM GDLYLINGEA RAHTRTLNVK 
QNYEEWFSFV GEQDLPLADL DVILMRKDPP FDTEFIYATY ILERAEEKGT LIVNKPQSLR
DCNEKLFTAW FSDLTPETLV TRNKAQLKAF WEKHSDIILK PLDGMGGASI FRVKEGDPNL
GVIAETLTEH GTRYCMAQNY LPAIKDGDKR VLVVDGEPVP YCLARIPQGG ETRGNLAAGG
RGEPRPLTES DWKIARQIGP TLKEKGLIFV GLDIIGDRLT EINVTSPTCI REIEAEFPVS
ITGMLMDAIE ARLQQQ