Gene EcHS_A2904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2904 
SymbolcysJ 
ID5592258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2907331 
End bp2909130 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content56% 
IMG OID640922021 
Productsulfite reductase subunit alpha 
Protein accessionYP_001459532 
Protein GI157162214 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0369] Sulfite reductase, alpha subunit (flavoprotein) 
TIGRFAM ID[TIGR01931] sulfite reductase [NADPH] flavoprotein, alpha-component 


Plasmid Coverage information

Num covering plasmid clones69 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACAC AGGTCCCACC TTCCGCGTTG CTTCCGTTGA ACCCGGAGCA ACTGGCACGC 
CTTCAGGCGG CCACGACCGA TTTAACTCCC ACCCAGCTTG CCTGGGTTTC TGGCTATTTC
TGGGGCGTGC TCAATCAGCA GCCTGCTGCG CTTGCAGCGA CGCCAGCGCC AGCCGCAGAA
ATGCCGGGTA TAACTATTAT CTCCGCCTCG CAAACCGGCA ATGCGCGCCG GGTTGCTGAA
GCATTACGCG ATGATTTATT GACGGCAAAA CTGAACGTTA AACTGGTGAA CGCGGGCGAC
TATAAATTCA AACAAATCGC CAGCGAAAAA CTGCTTATCG TGGTGACGTC AACGCAAGGA
GAAGGGGAAC CGCCGGAAGA AGCCGTCGCG CTGCATAAGT TCCTGTTCTC CAAAAAAGCG
CCGAAGCTGG AAAACACCGC ATTTGCCGTG TTTAGCCTCG GCGATAGCTC TTATGAATTT
TTCTGTCAGT CCGGGAAAGA TTTTGACAGC AAGCTGGCGG AACTGGGCGG TGAACGCCTG
CTCGACCGTG TCGATGCCGA CGTTGAATAC CAGACCGCTG CCAGCGAGTG GCGCGCTCGC
GTGGTTGATG CGCTTAAATC GCGTGCGCCT GTCGCGGCAC CTTCGCAATC CGTCGCTACT
GGCGCGGTAA ATGAAATCCA CACCAGCCCG TACAGCAAAG ACTCGCCGCT GGTGGCGAGC
CTCTCTGTTA ACCAGAAAAT TACCGGGCGT AACTCTGAAA AAGACGTTCG CCATATCGAA
ATTGACTTAG GTGACTCGGG CCTGCGTTAC CAGCCGGGTG ACGCGCTGGG TGTCTGGTAT
CAGAACGATC CGGCACTGGT GAAAGAACTT GTCGAACTGC TGTGGCTGAA AGGCGATGAA
CCTGTCACCG TCGAGGGCAA AACGTTGCCT CTGAACGAAG CGCTACAGTG GCACTTCGAA
CTGACCGTCA ACACCGCCAA CATTGTTGAG AATTACGCCA CGCTTACCCG CAGCGAAACA
CTGCTGCCGC TGGTGGGCGA TAAAGCGAAG TTACAGCATT ACGCCGCGAC GACGCCGATT
GTTGACATGG TGCGTTTCTC CCCGGCACAG CTTGATGCCG AAGCGCTAAT TAATCTGCTG
CGCCCGCTGA CGCCGCGTCT CTATTCCATT GCCTCCTCGC AGGCGGAAGT CGAGAACGAA
GTACACGTCA CCGTTGGTGT GGTGCGTTAC GACGTGGAAG GCCGCGCCCG TGCCGGTGGT
GCCTCCAGCT TCCTCGCTGA CCGCGTGGAA GAAGAGGGCG AAGTCCGCGT ATTTATCGAA
CATAACGATA ACTTCCGCCT ACCAGCCAAT CCAGAAACCC CGGTGATTAT GATTGGCCCA
GGCACCGGTA TCGCGCCGTT CCGCGCCTTT ATGCAGCAAC GTGCTGCCGA TGAAGCACCG
GGTAAAAACT GGCTGTTCTT TGGCAACCCG CATTTTACGG AAGATTTCCT CTACCAGGTG
GAGTGGCAGC GCTACGTCAA AGAGGGCGTG CTGACACGTA TCGATCTTGC CTGGTCGCGC
GATCAAAAAG AAAAAGTTTA CGTACAAGAC AAACTGCGCG AACAGGGCGC GGAGCTGTGG
CGCTGGATCA ATGACGGTGC CCACATTTAT GTCTGCGGCG ACGCTAATCG CATGGCGAAA
GACGTTGAGC AGGCACTTCT GGAAGTGATT GCCGAATTTG GTGGCATGGA CACCGAAGCG
GCGGATGAAT TTTTAAGTGA GCTGCGCGTA GAGCGCCGTT ATCAGCGAGA TGTCTACTAA
 
Protein sequence
MTTQVPPSAL LPLNPEQLAR LQAATTDLTP TQLAWVSGYF WGVLNQQPAA LAATPAPAAE 
MPGITIISAS QTGNARRVAE ALRDDLLTAK LNVKLVNAGD YKFKQIASEK LLIVVTSTQG
EGEPPEEAVA LHKFLFSKKA PKLENTAFAV FSLGDSSYEF FCQSGKDFDS KLAELGGERL
LDRVDADVEY QTAASEWRAR VVDALKSRAP VAAPSQSVAT GAVNEIHTSP YSKDSPLVAS
LSVNQKITGR NSEKDVRHIE IDLGDSGLRY QPGDALGVWY QNDPALVKEL VELLWLKGDE
PVTVEGKTLP LNEALQWHFE LTVNTANIVE NYATLTRSET LLPLVGDKAK LQHYAATTPI
VDMVRFSPAQ LDAEALINLL RPLTPRLYSI ASSQAEVENE VHVTVGVVRY DVEGRARAGG
ASSFLADRVE EEGEVRVFIE HNDNFRLPAN PETPVIMIGP GTGIAPFRAF MQQRAADEAP
GKNWLFFGNP HFTEDFLYQV EWQRYVKEGV LTRIDLAWSR DQKEKVYVQD KLREQGAELW
RWINDGAHIY VCGDANRMAK DVEQALLEVI AEFGGMDTEA ADEFLSELRV ERRYQRDVY