Gene ECH74115_4018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4018 
SymbolcysJ 
ID6971249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3714614 
End bp3716413 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content57% 
IMG OID643387785 
Productsulfite reductase subunit alpha 
Protein accessionYP_002272228 
Protein GI209400763 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0369] Sulfite reductase, alpha subunit (flavoprotein) 
TIGRFAM ID[TIGR01931] sulfite reductase [NADPH] flavoprotein, alpha-component 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.104123 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACAC AGGTCCCACC TTCCGCGTTG CTTCCGTTGA ACCCGGAGCA ACTGGCACGC 
CTTCAGGCGG CCACGACCGA TTTAACTCCC ACCCAGCTTG CCTGGGTTTC TGGCTATTTC
TGGGGCGTGC TCAATCAGCA GCCTGCTGCG CTTGCAGCGA CGCCAGCGCC AGCCGCAGAA
ATGCCGGGTA TAACTATTAT CTCTGCTTCG CAAACCGGCA ATGCGCGCCG GGTTGCTGAA
GCATTACGCG ATGATTTATT GACGGCAAAA CTGAACGTTA AGCTGGTGAA CGCGGGCGAC
TATAAATTCA AACAAATCGC CAGCGAAAAA CTGCTTATCG TGGTGACGTC AACGCAAGGA
GAAGGGGAAC CGCCGGAAGA AGCCGTCGCG CTGCATAAGT TCCTGTTCTC CAAAAAAGCG
CCGAAGCTGG AAAATACCGC TTTTGCCGTG TTTAGCCTCG GCGATAGTTC TTATGAATTT
TTCTGCCAGT CCGGGAAAGA TTTCGACAGC AAGCTGGCGG AACTGGGCGG TGAACGCCTG
CTCGACCGTG TCGACGCCGA CGTTGAATAC CAGGCTGCTG CCAGCGAGTG GCGCGCCCGC
GTGGTTGATG CGCTTAAATC GCGTGCGCCT GTCGCGGCAC CTTCGCAATC CGTCGCTACT
GGCGTGGTAA ATGAAATCCA CACCAGCCCG TACAGCAAAG ACGCGCCGCT GGTAGCGAGC
CTTTCGGTTA ACCAGAAAAT TACCGGGCGT AACTCTGAAA AAGACGTTCG CCATATCGAA
ATTGACTTAG GTGACTCGGG CCTGCGTTAC CAGCCGGGTG ACGCGCTGGG TGTCTGGTAT
CAGAACGATC CGGCACTGGT GAAAGAACTT GTCGAACTGC TGTGGCTGAA AGGCGATGAA
CCTGTCACCG TCGAGGGCAA AACGTTGCCT CTGAACGAAG CGCTACAGTG GCACTTCGAA
CTGACCGTCA ACACCGCCAA TATTGTTGAG AACTATGCCA CGCTTACCCG CAGCGAAACG
CTGTTGCCGC TGGTGGGCGA TAAAGCGAAG TTACAGCATT ACGCCGCGAC TACGCCGATT
GTCGACATGG TGCGCTTCTC TCCGGCGCAA CTGGACGCCG AAGCGCTGAT TAATCTGCTG
CGCCCGCTGA CGCCGCGTCT GTATTCCATC GCCTCCTCGC AGGAGGAAGT CGAGAACGAA
GTACACGTCA CCGTTGGTGT GGTGCGTTAC GACGTGGAAG GCCGCGCCCG TGCCGGTGGT
GCCTCCAGCT TCCTCGCGGA CCGCGTGGAA GAAGAGGGCG AAGTTCGCGT ATTTATCGAA
CATAACGATA ACTTCCGCCT GCCCGCTAAC CCGGAAACCC CGGTGATTAT GATTGGCCCA
GGCACCGGCA TCGCGCCGTT CCGCGCCTTT ATGCAGCAGC GCGCCGCTGA CGAAGCGCCG
GGTAAAAACT GGCTGTTCTT TGGCAACCCG CACTTTACGG AAGATTTCCT CTACCAGGTG
GAGTGGCAGC GTTACGTCAA AGAGGGCGTG CTGACGCGTA TCGATCTTGC CTGGTCGCGC
GATCAAAAAG AAAAAGTTTA CGTACAAGAC AAACTGCGCG AACAGGGCGC GGAGCTGTGG
CGCTGGATCA ATGACGGTGC CCACATTTAT GTCTGCGGCG ACGCTAATCG CATGGCGAAA
GACGTTGAGC AGGCACTTCT GGAAGTGATT GCCGAATTTG GTGGCATGGA CACCGAAGCG
GCGGATGAAT TTTTAAGTGA GCTGCGCGTA GAGCGCCGTT ATCAGCGAGA TGTCTACTAA
 
Protein sequence
MTTQVPPSAL LPLNPEQLAR LQAATTDLTP TQLAWVSGYF WGVLNQQPAA LAATPAPAAE 
MPGITIISAS QTGNARRVAE ALRDDLLTAK LNVKLVNAGD YKFKQIASEK LLIVVTSTQG
EGEPPEEAVA LHKFLFSKKA PKLENTAFAV FSLGDSSYEF FCQSGKDFDS KLAELGGERL
LDRVDADVEY QAAASEWRAR VVDALKSRAP VAAPSQSVAT GVVNEIHTSP YSKDAPLVAS
LSVNQKITGR NSEKDVRHIE IDLGDSGLRY QPGDALGVWY QNDPALVKEL VELLWLKGDE
PVTVEGKTLP LNEALQWHFE LTVNTANIVE NYATLTRSET LLPLVGDKAK LQHYAATTPI
VDMVRFSPAQ LDAEALINLL RPLTPRLYSI ASSQEEVENE VHVTVGVVRY DVEGRARAGG
ASSFLADRVE EEGEVRVFIE HNDNFRLPAN PETPVIMIGP GTGIAPFRAF MQQRAADEAP
GKNWLFFGNP HFTEDFLYQV EWQRYVKEGV LTRIDLAWSR DQKEKVYVQD KLREQGAELW
RWINDGAHIY VCGDANRMAK DVEQALLEVI AEFGGMDTEA ADEFLSELRV ERRYQRDVY