Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2683 |
Symbol | iscS |
ID | 6147223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2755832 |
End bp | 2757046 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617554 |
Product | cysteine desulfurase |
Protein accession | YP_001744719 |
Protein GI | 170682464 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | [TIGR02006] cysteine desulfurase IscS [TIGR03402] cysteine desulfurase NifS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.980146 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.976252 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTAC CGATTTATCT CGACTACTCC GCAACCACGC CGGTGGACCC GCGTGTTGCC GAGAAAATGA TGCAGTTTAT GACGATGGAC GGAACCTTTG GTAACCCGGC CTCCCGTTCT CACCGTTTCG GCTGGCAGGC TGAAGAAGCG GTTGATATCG CCCGTAATCA GATTGCAGAT CTGGTCGGCG CTGACCCGCG TGAAATCGTC TTTACCTCTG GTGCAACCGA ATCTGACAAC CTGGCGATCA AAGGTGCAGC CAACTTTTAT CAGAAAAAAG GCAAGCACAT CATCACCAGC AAAACCGAAC ACAAAGCGGT ACTGGATACC TGCCGTCAGC TGGAGCGCGA AGGTTTCGAA GTCACCTACC TGGCACCGCA GCGTAACGGC ATTATCGACC TGAAAGAACT TGAAGCAGCG ATGCGTGACG ACACCATCCT CGTTTCTATC ATGCACGTGA ATAACGAAAT CGGCGTGGTG CAGGATATCG CGGCTATCGG CGAAATGTGC CGTGCTCGCG GCATTATCTA TCACGTTGAT GCAACCCAGA GCGTGGGCAA ACTGCCTATC GACCTGAGCC AGTTGAAAGT TGACCTGATG TCATTCTCCG GTCACAAAAT CTATGGCCCG AAAGGTATCG GTGCGCTGTA TGTGCGTCGT AAACCGCGCG TACGCATCGA AGCGCAAATG CACGGTGGCG GTCACGAGCG CGGTATGCGT TCCGGCACTC TGCCTGTTCA CCAGATCGTT GGTATGGGCG AAGCCTATCG CATCGCAAAA GAAGAGATGG CGACCGAGAT GGAACGTCTG CGCGGCCTGC GTAACCGTCT GTGGAACGGC ATCAAAGATA TCGAAGAAGT TTACCTGAAC GGTGACCTGG AACACGGTGC GCCGAACATT CTCAACGTCA GCTTCAACTA CGTTGAAGGT GAGTCGCTGA TTATGGCGCT GAAAGACCTC GCGGTTTCTT CAGGTTCCGC CTGTACGTCA GCAAGCCTCG AACCGTCCTA CGTGCTGCGC GCGCTGGGGC TGAACGACGA GCTGGCACAT AGCTCTATCC GTTTCTCTTT AGGTCGTTTT ACTACTGAAG AAGAGATCGA CTACACCATC GAGTTAGTTC GTAAATCCAT CGGTCGTCTG CGTGACCTTT CTCCGCTGTG GGAAATGTAC AAGCAGGGCG TGGATCTGAA CAGCATCGAA TGGGCACATC ATTAA
|
Protein sequence | MKLPIYLDYS ATTPVDPRVA EKMMQFMTMD GTFGNPASRS HRFGWQAEEA VDIARNQIAD LVGADPREIV FTSGATESDN LAIKGAANFY QKKGKHIITS KTEHKAVLDT CRQLEREGFE VTYLAPQRNG IIDLKELEAA MRDDTILVSI MHVNNEIGVV QDIAAIGEMC RARGIIYHVD ATQSVGKLPI DLSQLKVDLM SFSGHKIYGP KGIGALYVRR KPRVRIEAQM HGGGHERGMR SGTLPVHQIV GMGEAYRIAK EEMATEMERL RGLRNRLWNG IKDIEEVYLN GDLEHGAPNI LNVSFNYVEG ESLIMALKDL AVSSGSACTS ASLEPSYVLR ALGLNDELAH SSIRFSLGRF TTEEEIDYTI ELVRKSIGRL RDLSPLWEMY KQGVDLNSIE WAHH
|
| |