Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3651 |
Symbol | cysM |
ID | 6968733 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3366220 |
End bp | 3367131 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643387445 |
Product | cysteine synthase B |
Protein accession | YP_002271898 |
Protein GI | 209397165 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases [TIGR01138] cysteine synthase B |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.370065 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 74 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTACAT TAGAACAAAC AATAGGCAAT ACGCCTCTGG TGAAGTTGCA GCGAATGGGG CCGAATAACG GCAGTGAAGT GTGGTTAAAA CTGGAAGGCA ATAACCCGGC AGGTTCGGTA AAAGATCGTG CGGCGCTTTC GATGATCGTC GAGGCGGAAA AGCGCGGGGA AATTAAACCG GGTGATGTGT TAATCGAAGC CACCAGTGGT AACACCGGCA TTGCGCTGGC AATGATTGCC GCGCTTAAAG GCTATCGCAT GAAATTGCTG ATGCCCGACA ACATGAGCCA GGAACGCCGT GCGGCGATGC GTGCTTATGG TGCGGAACTG ATTCTGGTCA CCAAAGAGCA GGGCATGGAA GGTGCGCGCG ATCTGGCGCT GGAGATGGCG GATCGTGGTG AAGGAAAGCT GCTCGATCAG TTCAATAATC CCGATAACCC TTACGCCCAT TACACCACTA CCGGGCCGGA AATCTGGCAG CAGACTGGCG GGCGCATTAC CTATTTTGTC TCCAGCATGG GAACAACCGG CACTATCACC GGCGTTTCAC GCTTTATGCG CGAACAATCC AAACCGGTGA CCATTGTTGG CCTGCAGCCG GAAGAGGGCA GTAGCATTCC CGGCATTCGC CGCTGGCCTG CGGAATATCT GCCGGGGATT TTCAACGCTT CTCTGGTAGA TGAGGTGCTG GATATTCATC AGCGCGATGC GGAAAACACC ATGCGCGAAC TGGCGGTGCG GGAAGGAATA TTCTGTGGCG TCAGCTCCGG CGGCGCGGTT GCCGGAGCAC TGCGGGTGGC AAAAGCTAAC CCTGGTGCGG TGGTGGTAGC GATCATCTGC GATCGTGGCG ATCGCTACCT TTCCACCGGG GTGTTTGGGG AAGAGCATTT TAGTCAGGGG GCGGGGATTT AA
|
Protein sequence | MSTLEQTIGN TPLVKLQRMG PNNGSEVWLK LEGNNPAGSV KDRAALSMIV EAEKRGEIKP GDVLIEATSG NTGIALAMIA ALKGYRMKLL MPDNMSQERR AAMRAYGAEL ILVTKEQGME GARDLALEMA DRGEGKLLDQ FNNPDNPYAH YTTTGPEIWQ QTGGRITYFV SSMGTTGTIT GVSRFMREQS KPVTIVGLQP EEGSSIPGIR RWPAEYLPGI FNASLVDEVL DIHQRDAENT MRELAVREGI FCGVSSGGAV AGALRVAKAN PGAVVVAIIC DRGDRYLSTG VFGEEHFSQG AGI
|
| |