Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4017 |
Symbol | cysI |
ID | 6970800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3712902 |
End bp | 3714614 |
Gene Length | 1713 bp |
Protein Length | 570 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643387784 |
Product | sulfite reductase subunit beta |
Protein accession | YP_002272227 |
Protein GI | 209400070 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0155] Sulfite reductase, beta subunit (hemoprotein) |
TIGRFAM ID | [TIGR02041] sulfite reductase (NADPH) hemoprotein, beta-component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.96721 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAAA AACATCCAGG GCCTTTAGTG GTCGAAGGAA AACTGACAGA CGCCGAGCGC ATGAAGCTTG AAAGCAACTA CCTGCGCGGC ACCATTGCGG AAGATTTAAA CGACGGCCTG ACCGGCGGCT TTAAGGGCGA TAACTTCCTG CTGATCCGCT TCCACGGCAT GTATCAGCAG GATGACCGCG ACATCCGCGC CGAACGTGCT GAACAGAAGC TGGAGCCGCG CCATGCGATG CTGTTGCGCT GCCGTCTGCC GGGTGGAGTT ATCACCACCA AACAGTGGCA GGCGATCGAT AAATTTGCCG GTGAAAACAC CATCTATGGC AGCATTCGCC TGACCAACCG CCAGACATTC CAGTTCCACG GCATTCTGAA AAAGAACGTC AAACCGGTGC ACCAGATGCT GCACTCGGTC GGTCTTGATG CGCTGGCGAC CGCTAACGAC ATGAACCGTA ACGTACTCTG CACCTCGAAC CCTTACGAGT CGCAGCTGCA CGCAGAAGCG TACGAGTGGG CGAAGAAAAT CTCTGAACAT CTGCTGCCTC GTACCCGCGC GTATGCGGAG ATCTGGCTCG ATCAGGAAAA AGTCGCCACC ACCGATGAAG AACCGATCCT CGGCCAGACC TACCTGCCGC GTAAATTCAA AACCACGGTA GTGATCCCGC CACAGAACGA TATCGATCTG CACGCCAACG ACATGAACTT CGTGGCAATC GCCGAAAACG GCAAGCTGGT GGGCTTTAAC CTGTTGGTGG GCGGTGGACT TTCTATCGAA CACGGCAACA AGAAAACCTA CGCCCGCACG GCGAGTGAGT TTGGCTATCT GCCGCTGGAG CATACGCTGG CAGTGGCGGA AGCCGTCGTG ACAACTCAGC GTGACTGGGG TAACCGAACC GATCGTAAAA ATGCCAAAAC CAAATACACG CTGGAGCGCG TGGGGGTTGA GACGTTTAAA GCGGAAGTGG AGCGTCGCGC GGGGATCAAA TTTGAACCGA TCCGTCCGTA TGAGTTCACC GGACGCGGAG ATCGTATTGG CTGGGTTAAG GGCATTGATG ATAACTGGCA CCTGACGCTG TTTATCGAAA ATGGTCGCAT CCTTGATTAT CCGGGGCGTC CGCTGAAAAC CGGCCTGCTG GAGATCGCGA AGATCCACAA AGGTGATTTC CGCATTACGG CGAACCAGAA TCTGATCATC GCCGGTGTGC CGGAAAGCGA GAAAGCGAAG ATCGAGAAGA TCGCCAAAGA GAGCGGGTTA ATGAATGCCG TCACGCCGCA GCGTGAAAAC TCAATGGCCT GCGTCTCGTT CCCGACTTGC CCGCTGGCGA TGGCGGAAGC GGAGCGTTTC CTGCCGTCTT TTATCGACAA CATCGATAAT TTAATGGCGA AACATGGTGT CAGCGATGAG CATATCGTGA TGCGTGTAAC AGGCTGCCCG AACGGTTGTG GTCGCGCGAT GCTGGCGGAA GTGGGCCTGG TGGGTAAAGC GCCGGGTCGC TACAACCTGC ATCTTGGCGG CAACCGCATT GGGACACGTA TCCCACGGAT GTATAAAGAA AACATCACCG AGCCGGAAAT CCTGGCGTCG CTTGATGAAC TGATAGGGCG CTGGGCGAAA GAGCGCGAAG CGGGTGAAGG CTTCGGCGAC TTTACGGTGC GTGCGGGCAT CATTCGCCCG GTGCTCGATC CGGCGCGCGA TTTGTGGGAT TAA
|
Protein sequence | MSEKHPGPLV VEGKLTDAER MKLESNYLRG TIAEDLNDGL TGGFKGDNFL LIRFHGMYQQ DDRDIRAERA EQKLEPRHAM LLRCRLPGGV ITTKQWQAID KFAGENTIYG SIRLTNRQTF QFHGILKKNV KPVHQMLHSV GLDALATAND MNRNVLCTSN PYESQLHAEA YEWAKKISEH LLPRTRAYAE IWLDQEKVAT TDEEPILGQT YLPRKFKTTV VIPPQNDIDL HANDMNFVAI AENGKLVGFN LLVGGGLSIE HGNKKTYART ASEFGYLPLE HTLAVAEAVV TTQRDWGNRT DRKNAKTKYT LERVGVETFK AEVERRAGIK FEPIRPYEFT GRGDRIGWVK GIDDNWHLTL FIENGRILDY PGRPLKTGLL EIAKIHKGDF RITANQNLII AGVPESEKAK IEKIAKESGL MNAVTPQREN SMACVSFPTC PLAMAEAERF LPSFIDNIDN LMAKHGVSDE HIVMRVTGCP NGCGRAMLAE VGLVGKAPGR YNLHLGGNRI GTRIPRMYKE NITEPEILAS LDELIGRWAK EREAGEGFGD FTVRAGIIRP VLDPARDLWD
|
| |