Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4018 |
Symbol | cysJ |
ID | 6971249 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3714614 |
End bp | 3716413 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643387785 |
Product | sulfite reductase subunit alpha |
Protein accession | YP_002272228 |
Protein GI | 209400763 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0369] Sulfite reductase, alpha subunit (flavoprotein) |
TIGRFAM ID | [TIGR01931] sulfite reductase [NADPH] flavoprotein, alpha-component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.104123 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACAC AGGTCCCACC TTCCGCGTTG CTTCCGTTGA ACCCGGAGCA ACTGGCACGC CTTCAGGCGG CCACGACCGA TTTAACTCCC ACCCAGCTTG CCTGGGTTTC TGGCTATTTC TGGGGCGTGC TCAATCAGCA GCCTGCTGCG CTTGCAGCGA CGCCAGCGCC AGCCGCAGAA ATGCCGGGTA TAACTATTAT CTCTGCTTCG CAAACCGGCA ATGCGCGCCG GGTTGCTGAA GCATTACGCG ATGATTTATT GACGGCAAAA CTGAACGTTA AGCTGGTGAA CGCGGGCGAC TATAAATTCA AACAAATCGC CAGCGAAAAA CTGCTTATCG TGGTGACGTC AACGCAAGGA GAAGGGGAAC CGCCGGAAGA AGCCGTCGCG CTGCATAAGT TCCTGTTCTC CAAAAAAGCG CCGAAGCTGG AAAATACCGC TTTTGCCGTG TTTAGCCTCG GCGATAGTTC TTATGAATTT TTCTGCCAGT CCGGGAAAGA TTTCGACAGC AAGCTGGCGG AACTGGGCGG TGAACGCCTG CTCGACCGTG TCGACGCCGA CGTTGAATAC CAGGCTGCTG CCAGCGAGTG GCGCGCCCGC GTGGTTGATG CGCTTAAATC GCGTGCGCCT GTCGCGGCAC CTTCGCAATC CGTCGCTACT GGCGTGGTAA ATGAAATCCA CACCAGCCCG TACAGCAAAG ACGCGCCGCT GGTAGCGAGC CTTTCGGTTA ACCAGAAAAT TACCGGGCGT AACTCTGAAA AAGACGTTCG CCATATCGAA ATTGACTTAG GTGACTCGGG CCTGCGTTAC CAGCCGGGTG ACGCGCTGGG TGTCTGGTAT CAGAACGATC CGGCACTGGT GAAAGAACTT GTCGAACTGC TGTGGCTGAA AGGCGATGAA CCTGTCACCG TCGAGGGCAA AACGTTGCCT CTGAACGAAG CGCTACAGTG GCACTTCGAA CTGACCGTCA ACACCGCCAA TATTGTTGAG AACTATGCCA CGCTTACCCG CAGCGAAACG CTGTTGCCGC TGGTGGGCGA TAAAGCGAAG TTACAGCATT ACGCCGCGAC TACGCCGATT GTCGACATGG TGCGCTTCTC TCCGGCGCAA CTGGACGCCG AAGCGCTGAT TAATCTGCTG CGCCCGCTGA CGCCGCGTCT GTATTCCATC GCCTCCTCGC AGGAGGAAGT CGAGAACGAA GTACACGTCA CCGTTGGTGT GGTGCGTTAC GACGTGGAAG GCCGCGCCCG TGCCGGTGGT GCCTCCAGCT TCCTCGCGGA CCGCGTGGAA GAAGAGGGCG AAGTTCGCGT ATTTATCGAA CATAACGATA ACTTCCGCCT GCCCGCTAAC CCGGAAACCC CGGTGATTAT GATTGGCCCA GGCACCGGCA TCGCGCCGTT CCGCGCCTTT ATGCAGCAGC GCGCCGCTGA CGAAGCGCCG GGTAAAAACT GGCTGTTCTT TGGCAACCCG CACTTTACGG AAGATTTCCT CTACCAGGTG GAGTGGCAGC GTTACGTCAA AGAGGGCGTG CTGACGCGTA TCGATCTTGC CTGGTCGCGC GATCAAAAAG AAAAAGTTTA CGTACAAGAC AAACTGCGCG AACAGGGCGC GGAGCTGTGG CGCTGGATCA ATGACGGTGC CCACATTTAT GTCTGCGGCG ACGCTAATCG CATGGCGAAA GACGTTGAGC AGGCACTTCT GGAAGTGATT GCCGAATTTG GTGGCATGGA CACCGAAGCG GCGGATGAAT TTTTAAGTGA GCTGCGCGTA GAGCGCCGTT ATCAGCGAGA TGTCTACTAA
|
Protein sequence | MTTQVPPSAL LPLNPEQLAR LQAATTDLTP TQLAWVSGYF WGVLNQQPAA LAATPAPAAE MPGITIISAS QTGNARRVAE ALRDDLLTAK LNVKLVNAGD YKFKQIASEK LLIVVTSTQG EGEPPEEAVA LHKFLFSKKA PKLENTAFAV FSLGDSSYEF FCQSGKDFDS KLAELGGERL LDRVDADVEY QAAASEWRAR VVDALKSRAP VAAPSQSVAT GVVNEIHTSP YSKDAPLVAS LSVNQKITGR NSEKDVRHIE IDLGDSGLRY QPGDALGVWY QNDPALVKEL VELLWLKGDE PVTVEGKTLP LNEALQWHFE LTVNTANIVE NYATLTRSET LLPLVGDKAK LQHYAATTPI VDMVRFSPAQ LDAEALINLL RPLTPRLYSI ASSQEEVENE VHVTVGVVRY DVEGRARAGG ASSFLADRVE EEGEVRVFIE HNDNFRLPAN PETPVIMIGP GTGIAPFRAF MQQRAADEAP GKNWLFFGNP HFTEDFLYQV EWQRYVKEGV LTRIDLAWSR DQKEKVYVQD KLREQGAELW RWINDGAHIY VCGDANRMAK DVEQALLEVI AEFGGMDTEA ADEFLSELRV ERRYQRDVY
|
| |