Gene EcSMS35_2892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2892 
SymbolcysJ 
ID6145502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2963999 
End bp2965798 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content57% 
IMG OID641617761 
Productsulfite reductase subunit alpha 
Protein accessionYP_001744916 
Protein GI170680548 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0369] Sulfite reductase, alpha subunit (flavoprotein) 
TIGRFAM ID[TIGR01931] sulfite reductase [NADPH] flavoprotein, alpha-component 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACAC AGGTCTCACC TTCCGCGTTG CTTCCGTTGA ACCCGGAGCA ACTGGCACGC 
CTTCAGGCGG CCACGACCGA TTTAACTCCC ACCCAGCTTG CCTGGGTTTC TGGCTATTTC
TGGGGCGTGC TCAACCAGCA GCCTGCTGCG CTTGCAGCGA CGCCAGCGCC CGCCGCAGAA
ATGCCGGGTA TAACTATTAT CTCCGCCTCG CAAACCGGCA ATGCGCGCCG GGTTGCTGAA
GCGTTACGCG ATGACTTATT AGCGGCAAAA CTAAACGTTA AGCTGGTGAA CGCGGGCGAC
TATAAATTCA AACAAATCGC CAGCGAAAAA CTGCTTATCG TGGTGACGTC AACGCAAGGG
GAAGGGGAAC CGCCGGAAGA AGCCGTCGCG CTGCATAAGT TCCTGTTCTC CAAAAAAGCG
CCGAAGCTGG AAAACACCGC ATTTGCCGTG TTTAGCCTCG GCGATAGCTC TTATGAATTT
TTCTGCCAGT CCGGGAAAGA TTTTGACAGC AAGCTGGCGG AACTGGGCGG TGAACGCCTG
CTCGACCGTG TCGATGCCGA TGTTGAATAT CAGGCCGCTG CCAGCGAGTG GCGCGCCCGC
GTGGTTGATG CGCTTAAATC GCGTGCGCCT GTCGCGGCAC CTTCGCAATC CGTCGCTACT
GGCACGGTAA ATGAAATCCA CACCAGCCCG TATAGCAAAG ACGCGCCGCT GGTGGCGAGC
CTCTCTGTTA ACCAGAAAAT TACCGGGCGT AACTCTGAAA AAGACGTTCG CCATATCGAA
ATTGACTTAG GTGACTCGGG CCTGCGTTAC CAGCCGGGTG ACGCGCTGGG CGTCTGGTAT
CAGAACGATC CGGCACTGGT GAAAGAACTT GTCGAACTGC TGTGGCTGAA AGGCGATGAA
CCTGTCACCG TTGAGGGCAA AACCCTGCCG CTGAATGAGG CGCTACAGTG GCATTTTGAA
CTGACCGTCA ACACCGCCAA CATTGTTGAG AACTATGCCA CGCTTACCCG CAGCGAAACG
CTGCTGCCGC TGGTGGGCGA TAAAGCGAAG TTGCAGCATT ACGCCGCGAC GACGCCGATT
GTTGACATGG TGCGTTTCTC TCCGGCGCAA CTGGATGCCG AAGCGCTGAT TAATCTGCTG
CGCCCGCTGA CGCCGCGCCT GTATTCCATC GCCTCTTCGC AGGCGGAAGT CGAGAACGAA
GTACACGTCA CCGTTGGTGT GGTGCGTTAC GACGTGGAAG GCCGCGCCCG TGCCGGTGGT
GCCTCCAGCT TCCTCGCGGA TCGTGTGGAA GAAGAGGGTG AAGTCCGCGT CTTTATCGAA
CATAACGATA ACTTCCGCCT GCCAGCCAAT CCGGAAACGC CGGTGATTAT GATTGGCCCA
GGCACCGGCA TTGCGCCGTT CCGCGCCTTT ATGCAGCAAC GCGCCGCCGA CGAAGCGCCG
GGGAAAAACT GGCTGTTCTT TGGCAACCCG CACTTTACGG AAGATTTCCT CTACCAGGTG
GAATGGCAGC GCTACGTCAA AGAGGGCGTG CTGACGCGTA TCGATCTTGC CTGGTCGCGC
GATCAAAAAG AAAAAATTTA CGTACAAGAC AAACTGCGCG AACAGGGCGC GGAGCTGTGG
CGCTGGATCA ATGACGGTGC CCACATTTAT GTCTGCGGCG ACGCTAATCG CATGGCGAAA
GACGTTGAAC AGGCACTTCT GGAAGTGATT GCCGAATTTG GTGGCATGGA CACCGAAGCG
GCGGATGAAT TTTTAAGTGA GCTGCGCGTA GAGCGCCGTT ATCAGCGAGA TGTCTACTAA
 
Protein sequence
MTTQVSPSAL LPLNPEQLAR LQAATTDLTP TQLAWVSGYF WGVLNQQPAA LAATPAPAAE 
MPGITIISAS QTGNARRVAE ALRDDLLAAK LNVKLVNAGD YKFKQIASEK LLIVVTSTQG
EGEPPEEAVA LHKFLFSKKA PKLENTAFAV FSLGDSSYEF FCQSGKDFDS KLAELGGERL
LDRVDADVEY QAAASEWRAR VVDALKSRAP VAAPSQSVAT GTVNEIHTSP YSKDAPLVAS
LSVNQKITGR NSEKDVRHIE IDLGDSGLRY QPGDALGVWY QNDPALVKEL VELLWLKGDE
PVTVEGKTLP LNEALQWHFE LTVNTANIVE NYATLTRSET LLPLVGDKAK LQHYAATTPI
VDMVRFSPAQ LDAEALINLL RPLTPRLYSI ASSQAEVENE VHVTVGVVRY DVEGRARAGG
ASSFLADRVE EEGEVRVFIE HNDNFRLPAN PETPVIMIGP GTGIAPFRAF MQQRAADEAP
GKNWLFFGNP HFTEDFLYQV EWQRYVKEGV LTRIDLAWSR DQKEKIYVQD KLREQGAELW
RWINDGAHIY VCGDANRMAK DVEQALLEVI AEFGGMDTEA ADEFLSELRV ERRYQRDVY