Gene EcolC_0948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0948 
SymbolcysJ 
ID6068352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1031501 
End bp1033300 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content56% 
IMG OID641600356 
Productsulfite reductase subunit alpha 
Protein accessionYP_001723944 
Protein GI170018990 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0369] Sulfite reductase, alpha subunit (flavoprotein) 
TIGRFAM ID[TIGR01931] sulfite reductase [NADPH] flavoprotein, alpha-component 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.404353 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACAC AGGTCCCACC TTCCGCGTTG CTTCCGTTGA ACCCGGAGCA ACTGGCACGC 
CTTCAGGCGG CCACGACCGA TTTAACGCCT ACCCAGCTTG CCTGGGTTTC TGGCTATTTC
TGGGGCGTAC TCAATCAGCA GCCTGCTGCG CTTGCAGCGA CGCCAGCGCC AGCCGCAGAA
ATGCCGGGTA TAACTATTAT CTCCGCCTCG CAAACCGGCA ATGCGCGCCG GGTTGCTGAA
GCATTACGCG ATGATTTATT AGCAGCAAAA CTGAACGTTA AGCTGGTGAA CGCGGGCGAC
TATAAATTCA AACAAATCGC CAGCGAAAAA CTGCTCATCG TAGTGACGTC AACGCAAGGG
GAAGGGGAAC CGCCGGAAGA AGCCGTCGCG CTGCATAAGT TCCTGTTCTC CAAAAAAGCG
CCAAAGCTGG AAAACACCGC GTTTGCCGTG TTTAGCCTCG GCGATAGCTC TTATGAATTT
TTCTGCCAGT CCGGGAAAGA TTTCGACAGC AAGCTGGCGG AACTGGGTGG TGAACGCCTG
CTCGACCGTG TCGATGCCGA TGTTGAATAC CAGGCTGCTG CCAGCGAGTG GCGCGCCCGC
GTGGTTGATG CGCTTAAATC GCGTGCGCCT GTCGCGGCAC CTTCGCAATC CGTCGCTACT
GGCGCGGTAA ATGAAATCCA CACCAGCCCG TACAGCAAAG ACGCGCCGCT GGTGGCTAGC
CTCTCTGTTA ACCAGAAAAT TACCGGGCGT AACTCTGAAA AAGACGTTCG CCATATCGAA
ATTGACTTAG GTGACTCGGG CCTGCGTTAC CAGCCGGGTG ACGCGCTGGG CGTCTGGTAT
CAGAACGATC CGGCACTGGT GAAAGAACTT GTCGAACTGC TGTGGCTGAA AGGCGATGAA
CCTGTCACCG TCGAGGGCAA AACGTTGCCT CTGAACGAAG CGCTACAGTG GCACTTCGAA
CTGACGGTTA ACACCGCCAA TATCGTGGAG AACTACGCTA CCTTAACGCG CAGCGAAACG
CTGCTTCCAC TGGTGGGCGA TAAAGCGAAG TTACAGCATT ACGCCGCGAC CACGCCGATT
GTCGACATGG TGCGTTTCTC TCCGGCGCAA CTGGATGCCG AAGCGCTGAT TAATCTGCTG
CGTCCGCTGA CCCCGCGCCT GTATTCCATC GCCTCCTCGC AGGCGGAAGT CGAGAACGAA
GTACACGTCA CCGTTGGTGT GGTGCGTTAC GACGTGGAAG GCCGAGCCCG TGCCGGTGGT
GCCTCCAGCT TCCTCGCGGA TCGCGTGGAA GAAGAGGGCG AAGTCCGCGT ATTTATCGAA
CATAACGATA ACTTTCGCCT GCCCGCTAAC CCGGAAACCC CGGTGATTAT GATTGGCCCA
GGCACCGGTA TTGCGCCGTT CCGCGCCTTT ATGCAGCAAC GCGCCGCCGA CGAAGCGCCA
GGTAAAAACT GGCTGTTCTT TGGTAATCCG CACTTTACGG AAGACTTCCT GTACCAGGTG
GAGTGGCAGC GCTACGTCAA AGATGGCGTG CTGACACGTA TCGATCTTGC CTGGTCGCGC
GATCAAAAAG AAAAAGTTTA CGTACAAGAC AAACTGCGCG AACAGGGCGC GGAGCTGTGG
CGCTGGATCA ATGATGGTGC CCACATTTAT GTCTGCGGCG ACGCTAATCG CATGGCGAAA
GACGTTGAGC AGGCACTTCT GGAAGTGATT GCCGAATTTG GTGGCATGGA CACCGAAGCG
GCGGATGAAT TTTTAAGTGA ACTGCGCGTA GAGCGCCGTT ATCAGCGAGA TGTCTACTAA
 
Protein sequence
MTTQVPPSAL LPLNPEQLAR LQAATTDLTP TQLAWVSGYF WGVLNQQPAA LAATPAPAAE 
MPGITIISAS QTGNARRVAE ALRDDLLAAK LNVKLVNAGD YKFKQIASEK LLIVVTSTQG
EGEPPEEAVA LHKFLFSKKA PKLENTAFAV FSLGDSSYEF FCQSGKDFDS KLAELGGERL
LDRVDADVEY QAAASEWRAR VVDALKSRAP VAAPSQSVAT GAVNEIHTSP YSKDAPLVAS
LSVNQKITGR NSEKDVRHIE IDLGDSGLRY QPGDALGVWY QNDPALVKEL VELLWLKGDE
PVTVEGKTLP LNEALQWHFE LTVNTANIVE NYATLTRSET LLPLVGDKAK LQHYAATTPI
VDMVRFSPAQ LDAEALINLL RPLTPRLYSI ASSQAEVENE VHVTVGVVRY DVEGRARAGG
ASSFLADRVE EEGEVRVFIE HNDNFRLPAN PETPVIMIGP GTGIAPFRAF MQQRAADEAP
GKNWLFFGNP HFTEDFLYQV EWQRYVKDGV LTRIDLAWSR DQKEKVYVQD KLREQGAELW
RWINDGAHIY VCGDANRMAK DVEQALLEVI AEFGGMDTEA ADEFLSELRV ERRYQRDVY