Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2810 |
Symbol | gshA |
ID | 6143033 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2890377 |
End bp | 2891933 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641617679 |
Product | glutamate--cysteine ligase |
Protein accession | YP_001744839 |
Protein GI | 170683222 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG2918] Gamma-glutamylcysteine synthetase |
TIGRFAM ID | [TIGR01434] glutamate--cysteine ligase [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000302956 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.00165416 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGATCCCGG ACGTATCACA GGCGCTGGCC TGGCTGGAAA AACATCCTCA GGCGTTAAAG GGGATACAGC GTGGGCTGGA GCGCGAAACT TTGCGTGTTA ATGCTGATGG CACATTGGCA ACAACAGGTC ATCCTGAAGC ATTAGGTTCC GCACTGACGC ACAAATGGAT TACTACCGAT TTTGCGGAAG CATTGCTGGA ATTCATTACA CCAGTGGATG GTGATATTGA ACATATGTTG ACCTTTATGC GCGATCTGCA TCGTTATACG GCGCGCAATA TGGGCGATGA GCGGATGTGG CCGTTAAGTA TGCCATGCTA CATCGCAGAA GGTCAGGACA TCGAACTGGC ACAGTACGGC ACTTCTAACA CTGGACGCTT TAAAACGCTG TACCGTGAAG GGCTGAAAAA TCGCTACGGC GCGCTGATGC AAACCATTTC CGGCGTGCAC TACAATTTCT CTTTGCCAAT GGCATTCTGG CAAGCGAAGT GTGGTGATAT CGCGGGCGCT GATGCCAAAG AGAAAATTTC TGCGGGCTAT TTCCGCGTTA TCCGCAATTA CTATCGTTTC GGTTGGGTCA TTCCTTATCT GTTTGGCGCG TCTCCGGCGA TTTGTTCTTC TTTCCTGCAA GGAAAACCAA CATCGCTGCC GTTTGAGAAA ACCGAGTGCG GTATGTATTA CCTGCCGTAT GCGACCTCTC TTCGTTTGAG TGATCTCGGC TATACCAATA AATCGCAAAG CAATCTTGGT ATTACCTTCA ACGATCTTTA CGAGTACGTA GCGGGCCTTA AACAGGCAAT CAAAACGCCA TCGGAAGAGT ACGCGAAGAT TGGCATTGAG AAAGATGGTA AGAGGCTGCA AATCAACAGC AACGTGCTGC AGATTGAAAA CGAACTGTAC GCGCCGATTC GTCCAAAACG CGTTACCCGC AGCGGCGAGT CGCCTTCTGA TGCGCTGTTA CGTGGCGGCA TTGAATATAT TGAAGTGCGT TCGCTGGACA TCAACCCATT CTCGCCGATT GGTGTAGATG AACAGCAGGT GCGATTCCTC GACCTGTTTA TGGTCTGGTG TGCGCTGGCT GATGCACCGG AAATGAGCAG TAGCGAACTT GCCTGTACAC GCGTTAACTG GAACCGGGTG ATCCTCGAAG GTCGCAAACC GGGCCTGACG CTGGGTATCG GCTGCGAAAC CGCGCAGTTC CCGTTACCGC AGGTGGGTAA AGATCTGTTC CGCGATCTGA AACGCGTCGC GCAAACGCTG GATAGCATTA ACGGCGGCGA AGCGTATCAG AAAGTGTGTG ATGAACTGGT CGCCTGCTTC GATAATCCCG ATCTGACTTT CTCTGCCCGT ATCTTAAGGT CTATGATTGA TACTGGTATT GGCGGAACAG GCAAAGCGTT TGCTGAAGCG TACCGTAATC TGCTGCGTGA AGAGCCGCTG GAAATTCTGC GCGAAGAGGA TTTTGTCGCC GAGCGCGAGG CGTCTGAACG CCGTCAGCAG GAAATGGAAG CCGCTGATAC CGAACCGTTT GCGGTGTGGC TGGAAAAACA CGCCTGA
|
Protein sequence | MIPDVSQALA WLEKHPQALK GIQRGLERET LRVNADGTLA TTGHPEALGS ALTHKWITTD FAEALLEFIT PVDGDIEHML TFMRDLHRYT ARNMGDERMW PLSMPCYIAE GQDIELAQYG TSNTGRFKTL YREGLKNRYG ALMQTISGVH YNFSLPMAFW QAKCGDIAGA DAKEKISAGY FRVIRNYYRF GWVIPYLFGA SPAICSSFLQ GKPTSLPFEK TECGMYYLPY ATSLRLSDLG YTNKSQSNLG ITFNDLYEYV AGLKQAIKTP SEEYAKIGIE KDGKRLQINS NVLQIENELY APIRPKRVTR SGESPSDALL RGGIEYIEVR SLDINPFSPI GVDEQQVRFL DLFMVWCALA DAPEMSSSEL ACTRVNWNRV ILEGRKPGLT LGIGCETAQF PLPQVGKDLF RDLKRVAQTL DSINGGEAYQ KVCDELVACF DNPDLTFSAR ILRSMIDTGI GGTGKAFAEA YRNLLREEPL EILREEDFVA EREASERRQQ EMEAADTEPF AVWLEKHA
|
| |