Gene EcE24377A_3066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3066 
SymbolcysJ 
ID5587243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3069017 
End bp3070816 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content56% 
IMG OID640926710 
Productsulfite reductase subunit alpha 
Protein accessionYP_001464086 
Protein GI157158512 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0369] Sulfite reductase, alpha subunit (flavoprotein) 
TIGRFAM ID[TIGR01931] sulfite reductase [NADPH] flavoprotein, alpha-component 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.944096 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACAC AGGTCCCACC TTCCGCGTTG CTTCCGTTGA ACCCGGAGCA ACTGGCACGC 
CTTCAGGCGG CCACGACCGA TTTAACTCCC ACCCAGCTTG CCTGGGTTTC TGGCTATTTC
TGGGGCGTGC TCAATCAGCA GCCTGCTGCG CTTGCAGCGA CGCCAGCGCC AGCCGCAGAA
ATGCCGGGTA TAACTATTAT CTCCGCCTCG CAAACCGGCA ATGCGCGCCG GGTTGCTGAA
GCATTACGCG ATGATTTATT AGCAGCAAAA CTGAACGTTA AGCTGGTGAA CGCGGGCGAC
TATAAATTCA AACAAATCGC CAGCGAAAAA CTGCTCATCG TAGTGACGTC AACGCAAGGG
GAAGGGGAAC CGCCGGAGGA AGCCGTCGCG CTGCATAAGT TCCTGTTCTC CAAAAAAGCG
CCAAAGCTGG AAAACACCGC GTTTGCCGTG TTTAGCCTCG GCGATAGCTC TTATGAATTT
TTCTGCCAGT CCGGGAAAGA TTTCGACAGC AAGCTGGCGG AACTGGGTGG TGAACGCCTG
CTCGACCGTG TCGATGCCGA TGTTGAATAC CAGGCTGCTG CCAGCGAGTG GCGCGCCCGC
GTGGTTGATG CGCTTAAATC GCGTGCGCCT GTCGCGGCAC CTTCGCAATC CGTCGCTACT
GGCGCGGTAA ATGAAATCCA CACCAGCCCG TACAGCAAAG ACGCGCCGCT GGTGGCTAGC
CTCTCTGTTA ACCAGAAAAT TACCGGGCGT AACTCTGAAA AAGACGTTCG CCATATCGAA
ATTGACTTAG GTGACTCGGG CCTGCGTTAC CAGCCGGGTG ACGCGCTGGG CGTCTGGTAT
CAGAACGATC CGGCACTGGT GAAAGAACTT GTCGAACTGC TGTGGCTGAA AGGCGATGAA
CCTGTCACCG TCGAGGGCAA AACGTTGCCT CTGAACGAAG CGCTACAGTG GCACTTCGAA
CTGACCGTCA ACACCGCCAA CATTGTTGAG AATTACGCCA CGCTTACCCG CAGCGAAACA
CTGCTGCCGC TGGTGGGCGA TAAAGCGAAG TTACAGCATT ACGCCGCGAC GACGCCGATT
GTCGACATGG TGCGTTTCTC TCCGGCGCAA CTGGATGCCG AAGCGCTGAT TAATCTGCTG
CGCCCGCTGA CCCCGCGCCT GTATTCCATC GCCTCCTCGC AGGCGGAAGT CGAGAACGAA
GTACACGTCA CCGTTGGTGT GGTGCGTTAC GACGTGGAAG GCCGAGCCCG TGCCGGTGGT
GCCTCCAGCT TCCTCGCGGA TCGCGTGGAA GAAGAGGGCG AAGTCCGCGT ATTTATCGAA
CATAACGATA ACTTTCGCCT GCCCGCTAAC CCGGAAACCC CGGTGATTAT GATTGGCCCA
GGCACCGGTA TTGCGCCGTT CCGCGCCTTT ATGCAGCAAC GCGCCGCCGA CGAAGCGCCA
GGTAAAAACT GGCTGTTCTT TGGTAATCCG CACTTTACGG AAGACTTCCT GTATCAGGTG
GAGTGGCAGC GCTACGTCAA AGATGGCGTG CTGACACGTA TCGATCTTGC CTGGTCGCGC
GACCAAAAAG AAAAAGTTTA CGTACAAGAC AAACTGCGCG AACAGGGCGC GGAGCTGTGG
CGCTGGATCA ATGATGGTGC CCACATTTAT GTCTGCGGCG ACGCTAATCG CATGGCGAAA
GACGTTGAGC AGGCACTTCT GGAAGTGATT GCCGAATTTG GTGGCATGGA CACCGAAGCG
GCGGATGAAT TTTTAAGTGA GCTGCGCGTA GAGCGCCGTT ATCAGCGAGA TGTCTACTAA
 
Protein sequence
MTTQVPPSAL LPLNPEQLAR LQAATTDLTP TQLAWVSGYF WGVLNQQPAA LAATPAPAAE 
MPGITIISAS QTGNARRVAE ALRDDLLAAK LNVKLVNAGD YKFKQIASEK LLIVVTSTQG
EGEPPEEAVA LHKFLFSKKA PKLENTAFAV FSLGDSSYEF FCQSGKDFDS KLAELGGERL
LDRVDADVEY QAAASEWRAR VVDALKSRAP VAAPSQSVAT GAVNEIHTSP YSKDAPLVAS
LSVNQKITGR NSEKDVRHIE IDLGDSGLRY QPGDALGVWY QNDPALVKEL VELLWLKGDE
PVTVEGKTLP LNEALQWHFE LTVNTANIVE NYATLTRSET LLPLVGDKAK LQHYAATTPI
VDMVRFSPAQ LDAEALINLL RPLTPRLYSI ASSQAEVENE VHVTVGVVRY DVEGRARAGG
ASSFLADRVE EEGEVRVFIE HNDNFRLPAN PETPVIMIGP GTGIAPFRAF MQQRAADEAP
GKNWLFFGNP HFTEDFLYQV EWQRYVKDGV LTRIDLAWSR DQKEKVYVQD KLREQGAELW
RWINDGAHIY VCGDANRMAK DVEQALLEVI AEFGGMDTEA ADEFLSELRV ERRYQRDVY