Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2576 |
Symbol | cysM |
ID | 6143180 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2630141 |
End bp | 2631052 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641617447 |
Product | cysteine synthase B |
Protein accession | YP_001744612 |
Protein GI | 170684234 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases [TIGR01138] cysteine synthase B |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.717552 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTACAT TAGAACAAAC AATTGGCAAT ACGCCTCTGG TGAAGTTGCA GCGAATGGGG CCGAATAACG GCAGTGAAGT CTGGTTGAAA CTGGAAGGCA ATAACCCGGC AGGTTCGGTA AAAGATCGTG CGGCACTTTC GATGATCGTC GAGGCGGAAA AGCGCGGCGA GATTAAACCG GGTGATGTGT TGATCGAAGC CACCAGTGGT AACACCGGCA TTGCGCTGGC AATGATTGCC GCGCTTAAAG GCTATCGCAT GAAATTGCTG ATGCCCGACA ACATGAGCCA GGAACGCCGT GCGGCAATGC GTGCCTACGG TGCGGAACTG ATTCTGGTCA CCAAAGAGCA GGGCATGGAA GGTGCGCGCG ATCTGGCGCT GGAGATGGCC AATCGTGGTG AAGGAAAGCT GCTCGATCAG TTCAATAATC CTGATAATCC CTTCGCCCAT TACACCACCA CCGGGCCGGA AATCTGGCAG CAGACTGGCG GGCGCATTAC CCATTTTGTC TCCAGCATGG GAACAACCGG CACTATCACC GGCGTTTCAC GCTTTATGCG CGAACAATCC AAACCGGTGA CCATTGTTGG CCTGCAGCCG GAAGAAGGCA GCAGTATTCC GGGCATTCGC CGCTGGCCTG CGGAATATCT GCCGGGGATT TTCAACGCTT CGCTGGTGGA TGAGGTGCTG GATATTCATC AGCGCGATGC GGAAAACACC ATGCGCGAAC TGGCAGTGCG GGAAGGAATA TTCTGTGGCG TCAGCTCCGG CGGCGCGGTT GCCGGAGCAC TGCGGGTGGC AAAAGCTAAC CCCGGCGCGG TGGTTGTGGC GATCATCTGC GATCGCGGCG ATCGTTACCT TTCTACCGGG GTGTTTGGGG AAGAGCATTT TAGCCAGGGG GCGGGGATTT AA
|
Protein sequence | MSTLEQTIGN TPLVKLQRMG PNNGSEVWLK LEGNNPAGSV KDRAALSMIV EAEKRGEIKP GDVLIEATSG NTGIALAMIA ALKGYRMKLL MPDNMSQERR AAMRAYGAEL ILVTKEQGME GARDLALEMA NRGEGKLLDQ FNNPDNPFAH YTTTGPEIWQ QTGGRITHFV SSMGTTGTIT GVSRFMREQS KPVTIVGLQP EEGSSIPGIR RWPAEYLPGI FNASLVDEVL DIHQRDAENT MRELAVREGI FCGVSSGGAV AGALRVAKAN PGAVVVAIIC DRGDRYLSTG VFGEEHFSQG AGI
|
| |