Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1147 |
Symbol | |
ID | 6068059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 1251339 |
End bp | 1252553 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641600563 |
Product | cysteine desulfurase |
Protein accession | YP_001724141 |
Protein GI | 170019187 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | [TIGR02006] cysteine desulfurase IscS [TIGR03402] cysteine desulfurase NifS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0105086 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTAC CGATTTATCT CGACTACTCC GCAACCACGC CGGTGGACCC GCGTGTTGCC GAGAAAATGA TGCAGTTTAT GACGATGGAC GGAACCTTTG GTAACCCGGC CTCCCGTTCT CACCGTTTCG GCTGGCAGGC TGAAGAAGCG GTAGATATCG CCCGTAATCA GATTGCCGAT CTGGTCGGCG CTGATCCGCG TGAAATCGTC TTTACCTCTG GTGCAACCGA ATCTGACAAC CTGGCGATCA AAGGTGCAGC CAACTTTTAT CAGAAAAAAG GCAAGCACAT CATCACCAGC AAAACTGAAC ACAAAGCGGT ACTGGATACC TGCCGTCAGC TGGAGCGCGA AGGTTTTGAA GTCACCTACC TGGCACCGCA GCGTAACGGC ATTATCGACC TGAAAGAACT TGAAGCAGCG ATGCGTGACG ACACCATCCT CGTTTCTATC ATGCACGTGA ATAACGAAAT CGGCGTGGTA CAGGATATCG CGGCTATCGG CGAAATGTGC CGTGCTCGTG GCATTATCTA TCACGTTGAT GCAACCCAGA GCGTGGGCAA ACTGCCTATC GATCTGAGCC AGTTGAAAGT TGACCTGATG TCTTTCTCCG GTCACAAAAT CTATGGCCCG AAAGGTATTG GTGCGCTGTA TGTGCGTCGT AAACCGCGCG TACGCATCGA AGCGCAAATG CACGGCGGCG GTCACGAACG CGGTATGCGT TCCGGCACTC TGCCTGTTCA CCAGATCGTC GGAATGGGCG AAGCCTATCG CATCGCAAAA GAAGAGATGG CGACCGAGAT GGAACGTCTG CGCGGACTGC GTAACCGTCT GTGGAACGGC ATCAAAGATA TCGAAGAAGT TTACCTGAAC GGTGACCTGG AACACGGTGC GCCGAACATT CTCAACGTCA GCTTCAACTA CGTTGAAGGT GAGTCGCTGA TTATGGCGCT GAAAGACCTC GCGGTTTCTT CCGGTTCCGC CTGTACGTCA GCAAGCCTCG AACCGTCCTA CGTGCTGCGC GCGCTGGGGC TGAACGACGA GCTGGCACAT AGCTCTATCC GTTTCTCTTT AGGTCGTTTT ACTACTGAAG AAGAGATCGA CTACACCATC GAGTTAGTTC GTAAATCCAT CGGTCGTCTG CGTGACCTTT CTCCGCTGTG GGAAATGTAC AAGCAGGGCG TGGATCTGAA CAGCATCGAA TGGGCTCATC ATTAA
|
Protein sequence | MKLPIYLDYS ATTPVDPRVA EKMMQFMTMD GTFGNPASRS HRFGWQAEEA VDIARNQIAD LVGADPREIV FTSGATESDN LAIKGAANFY QKKGKHIITS KTEHKAVLDT CRQLEREGFE VTYLAPQRNG IIDLKELEAA MRDDTILVSI MHVNNEIGVV QDIAAIGEMC RARGIIYHVD ATQSVGKLPI DLSQLKVDLM SFSGHKIYGP KGIGALYVRR KPRVRIEAQM HGGGHERGMR SGTLPVHQIV GMGEAYRIAK EEMATEMERL RGLRNRLWNG IKDIEEVYLN GDLEHGAPNI LNVSFNYVEG ESLIMALKDL AVSSGSACTS ASLEPSYVLR ALGLNDELAH SSIRFSLGRF TTEEEIDYTI ELVRKSIGRL RDLSPLWEMY KQGVDLNSIE WAHH
|
| |