Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2904 |
Symbol | cysJ |
ID | 5592258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2907331 |
End bp | 2909130 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640922021 |
Product | sulfite reductase subunit alpha |
Protein accession | YP_001459532 |
Protein GI | 157162214 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0369] Sulfite reductase, alpha subunit (flavoprotein) |
TIGRFAM ID | [TIGR01931] sulfite reductase [NADPH] flavoprotein, alpha-component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 69 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACAC AGGTCCCACC TTCCGCGTTG CTTCCGTTGA ACCCGGAGCA ACTGGCACGC CTTCAGGCGG CCACGACCGA TTTAACTCCC ACCCAGCTTG CCTGGGTTTC TGGCTATTTC TGGGGCGTGC TCAATCAGCA GCCTGCTGCG CTTGCAGCGA CGCCAGCGCC AGCCGCAGAA ATGCCGGGTA TAACTATTAT CTCCGCCTCG CAAACCGGCA ATGCGCGCCG GGTTGCTGAA GCATTACGCG ATGATTTATT GACGGCAAAA CTGAACGTTA AACTGGTGAA CGCGGGCGAC TATAAATTCA AACAAATCGC CAGCGAAAAA CTGCTTATCG TGGTGACGTC AACGCAAGGA GAAGGGGAAC CGCCGGAAGA AGCCGTCGCG CTGCATAAGT TCCTGTTCTC CAAAAAAGCG CCGAAGCTGG AAAACACCGC ATTTGCCGTG TTTAGCCTCG GCGATAGCTC TTATGAATTT TTCTGTCAGT CCGGGAAAGA TTTTGACAGC AAGCTGGCGG AACTGGGCGG TGAACGCCTG CTCGACCGTG TCGATGCCGA CGTTGAATAC CAGACCGCTG CCAGCGAGTG GCGCGCTCGC GTGGTTGATG CGCTTAAATC GCGTGCGCCT GTCGCGGCAC CTTCGCAATC CGTCGCTACT GGCGCGGTAA ATGAAATCCA CACCAGCCCG TACAGCAAAG ACTCGCCGCT GGTGGCGAGC CTCTCTGTTA ACCAGAAAAT TACCGGGCGT AACTCTGAAA AAGACGTTCG CCATATCGAA ATTGACTTAG GTGACTCGGG CCTGCGTTAC CAGCCGGGTG ACGCGCTGGG TGTCTGGTAT CAGAACGATC CGGCACTGGT GAAAGAACTT GTCGAACTGC TGTGGCTGAA AGGCGATGAA CCTGTCACCG TCGAGGGCAA AACGTTGCCT CTGAACGAAG CGCTACAGTG GCACTTCGAA CTGACCGTCA ACACCGCCAA CATTGTTGAG AATTACGCCA CGCTTACCCG CAGCGAAACA CTGCTGCCGC TGGTGGGCGA TAAAGCGAAG TTACAGCATT ACGCCGCGAC GACGCCGATT GTTGACATGG TGCGTTTCTC CCCGGCACAG CTTGATGCCG AAGCGCTAAT TAATCTGCTG CGCCCGCTGA CGCCGCGTCT CTATTCCATT GCCTCCTCGC AGGCGGAAGT CGAGAACGAA GTACACGTCA CCGTTGGTGT GGTGCGTTAC GACGTGGAAG GCCGCGCCCG TGCCGGTGGT GCCTCCAGCT TCCTCGCTGA CCGCGTGGAA GAAGAGGGCG AAGTCCGCGT ATTTATCGAA CATAACGATA ACTTCCGCCT ACCAGCCAAT CCAGAAACCC CGGTGATTAT GATTGGCCCA GGCACCGGTA TCGCGCCGTT CCGCGCCTTT ATGCAGCAAC GTGCTGCCGA TGAAGCACCG GGTAAAAACT GGCTGTTCTT TGGCAACCCG CATTTTACGG AAGATTTCCT CTACCAGGTG GAGTGGCAGC GCTACGTCAA AGAGGGCGTG CTGACACGTA TCGATCTTGC CTGGTCGCGC GATCAAAAAG AAAAAGTTTA CGTACAAGAC AAACTGCGCG AACAGGGCGC GGAGCTGTGG CGCTGGATCA ATGACGGTGC CCACATTTAT GTCTGCGGCG ACGCTAATCG CATGGCGAAA GACGTTGAGC AGGCACTTCT GGAAGTGATT GCCGAATTTG GTGGCATGGA CACCGAAGCG GCGGATGAAT TTTTAAGTGA GCTGCGCGTA GAGCGCCGTT ATCAGCGAGA TGTCTACTAA
|
Protein sequence | MTTQVPPSAL LPLNPEQLAR LQAATTDLTP TQLAWVSGYF WGVLNQQPAA LAATPAPAAE MPGITIISAS QTGNARRVAE ALRDDLLTAK LNVKLVNAGD YKFKQIASEK LLIVVTSTQG EGEPPEEAVA LHKFLFSKKA PKLENTAFAV FSLGDSSYEF FCQSGKDFDS KLAELGGERL LDRVDADVEY QTAASEWRAR VVDALKSRAP VAAPSQSVAT GAVNEIHTSP YSKDSPLVAS LSVNQKITGR NSEKDVRHIE IDLGDSGLRY QPGDALGVWY QNDPALVKEL VELLWLKGDE PVTVEGKTLP LNEALQWHFE LTVNTANIVE NYATLTRSET LLPLVGDKAK LQHYAATTPI VDMVRFSPAQ LDAEALINLL RPLTPRLYSI ASSQAEVENE VHVTVGVVRY DVEGRARAGG ASSFLADRVE EEGEVRVFIE HNDNFRLPAN PETPVIMIGP GTGIAPFRAF MQQRAADEAP GKNWLFFGNP HFTEDFLYQV EWQRYVKEGV LTRIDLAWSR DQKEKVYVQD KLREQGAELW RWINDGAHIY VCGDANRMAK DVEQALLEVI AEFGGMDTEA ADEFLSELRV ERRYQRDVY
|
| |