Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3089 |
Symbol | gshB |
ID | 6146371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3178776 |
End bp | 3179726 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641617957 |
Product | glutathione synthetase |
Protein accession | YP_001745108 |
Protein GI | 170683849 |
COG category | [H] Coenzyme transport and metabolism [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0189] Glutathione synthase/Ribosomal protein S6 modification enzyme (glutaminyl transferase) |
TIGRFAM ID | [TIGR01380] glutathione synthetase, prokaryotic |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00631084 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.0000073665 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCAAGC TCGGCATCGT GATGGACCCC ATCGCAAACA TCAACATCAA GAAAGATTCC AGTTTCGCTA TGTTGCTGGA AGCACAGCGT CGTGGTTACG AACTTCACTA TATGGAGATG GGCGATCTGT ATCTGATCAA TGGCGAAGCC CGCGCCCATA CCCGCACGCT GAACGTGAAG CAGAACTACG AAGAGTGGTT TTCGTTCGTC GGTGAACAGG ATCTGCCGCT GGCCGATCTC GATGTGATCC TGATGCGTAA AGACCCGCCG TTTGATACCG AGTTTATCTA CGCGACCTAT ATTCTGGAAC GTGCCGAAGA GAAAGGGACG CTGATCGTTA ACAAGCCGCA GAGCCTGCGC GACTGTAACG AGAAACTGTT TACCGCCTGG TTCTCTGACT TAACGCCAGA AACGCTGGTT ACGCGCAATA AAGCACAGCT GAAAGCGTTC TGGGAGAAAC ACAGCGACAT CATTCTTAAG CCGCTGGACG GTATGGGCGG CGCGTCGATT TTCCGCGTGA AAGAAGGCGA TCCAAACCTC GGCGTGATTG CCGAAACCCT GACTGAGCAT GGCACTCGCT ACTGCATGGC GCAAAATTAC CTGCCAGCCA TTAAAGATGG CGACAAACGC GTGCTGGTGG TGGATGGCGA GCCGGTTCCG TACTGCCTGG CGCGTATTCC GCAGGGGGGC GAAACCCGTG GCAATCTGGC TGCCGGTGGT CGCGGTGAAC CTCGTCCGCT GACGGAAAGT GACTGGAAAA TCGCCCGTCA GATCGGGCCG ACGCTGAAAG AAAAAGGGCT GATTTTTGTT GGTCTGGATA TCATTGGCGA CCGTCTGACT GAAATTAACG TCACCAGCCC AACCTGTATT CGTGAGATTG AAGCAGAGTT TCCGGTGTCG ATCACCGGAA TGTTAATGGA TGCCATCGAA GCACGTTTAC AGCAGCAGTA A
|
Protein sequence | MIKLGIVMDP IANINIKKDS SFAMLLEAQR RGYELHYMEM GDLYLINGEA RAHTRTLNVK QNYEEWFSFV GEQDLPLADL DVILMRKDPP FDTEFIYATY ILERAEEKGT LIVNKPQSLR DCNEKLFTAW FSDLTPETLV TRNKAQLKAF WEKHSDIILK PLDGMGGASI FRVKEGDPNL GVIAETLTEH GTRYCMAQNY LPAIKDGDKR VLVVDGEPVP YCLARIPQGG ETRGNLAAGG RGEPRPLTES DWKIARQIGP TLKEKGLIFV GLDIIGDRLT EINVTSPTCI REIEAEFPVS ITGMLMDAIE ARLQQQ
|
| |