Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2892 |
Symbol | cysJ |
ID | 6145502 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2963999 |
End bp | 2965798 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641617761 |
Product | sulfite reductase subunit alpha |
Protein accession | YP_001744916 |
Protein GI | 170680548 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0369] Sulfite reductase, alpha subunit (flavoprotein) |
TIGRFAM ID | [TIGR01931] sulfite reductase [NADPH] flavoprotein, alpha-component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACAC AGGTCTCACC TTCCGCGTTG CTTCCGTTGA ACCCGGAGCA ACTGGCACGC CTTCAGGCGG CCACGACCGA TTTAACTCCC ACCCAGCTTG CCTGGGTTTC TGGCTATTTC TGGGGCGTGC TCAACCAGCA GCCTGCTGCG CTTGCAGCGA CGCCAGCGCC CGCCGCAGAA ATGCCGGGTA TAACTATTAT CTCCGCCTCG CAAACCGGCA ATGCGCGCCG GGTTGCTGAA GCGTTACGCG ATGACTTATT AGCGGCAAAA CTAAACGTTA AGCTGGTGAA CGCGGGCGAC TATAAATTCA AACAAATCGC CAGCGAAAAA CTGCTTATCG TGGTGACGTC AACGCAAGGG GAAGGGGAAC CGCCGGAAGA AGCCGTCGCG CTGCATAAGT TCCTGTTCTC CAAAAAAGCG CCGAAGCTGG AAAACACCGC ATTTGCCGTG TTTAGCCTCG GCGATAGCTC TTATGAATTT TTCTGCCAGT CCGGGAAAGA TTTTGACAGC AAGCTGGCGG AACTGGGCGG TGAACGCCTG CTCGACCGTG TCGATGCCGA TGTTGAATAT CAGGCCGCTG CCAGCGAGTG GCGCGCCCGC GTGGTTGATG CGCTTAAATC GCGTGCGCCT GTCGCGGCAC CTTCGCAATC CGTCGCTACT GGCACGGTAA ATGAAATCCA CACCAGCCCG TATAGCAAAG ACGCGCCGCT GGTGGCGAGC CTCTCTGTTA ACCAGAAAAT TACCGGGCGT AACTCTGAAA AAGACGTTCG CCATATCGAA ATTGACTTAG GTGACTCGGG CCTGCGTTAC CAGCCGGGTG ACGCGCTGGG CGTCTGGTAT CAGAACGATC CGGCACTGGT GAAAGAACTT GTCGAACTGC TGTGGCTGAA AGGCGATGAA CCTGTCACCG TTGAGGGCAA AACCCTGCCG CTGAATGAGG CGCTACAGTG GCATTTTGAA CTGACCGTCA ACACCGCCAA CATTGTTGAG AACTATGCCA CGCTTACCCG CAGCGAAACG CTGCTGCCGC TGGTGGGCGA TAAAGCGAAG TTGCAGCATT ACGCCGCGAC GACGCCGATT GTTGACATGG TGCGTTTCTC TCCGGCGCAA CTGGATGCCG AAGCGCTGAT TAATCTGCTG CGCCCGCTGA CGCCGCGCCT GTATTCCATC GCCTCTTCGC AGGCGGAAGT CGAGAACGAA GTACACGTCA CCGTTGGTGT GGTGCGTTAC GACGTGGAAG GCCGCGCCCG TGCCGGTGGT GCCTCCAGCT TCCTCGCGGA TCGTGTGGAA GAAGAGGGTG AAGTCCGCGT CTTTATCGAA CATAACGATA ACTTCCGCCT GCCAGCCAAT CCGGAAACGC CGGTGATTAT GATTGGCCCA GGCACCGGCA TTGCGCCGTT CCGCGCCTTT ATGCAGCAAC GCGCCGCCGA CGAAGCGCCG GGGAAAAACT GGCTGTTCTT TGGCAACCCG CACTTTACGG AAGATTTCCT CTACCAGGTG GAATGGCAGC GCTACGTCAA AGAGGGCGTG CTGACGCGTA TCGATCTTGC CTGGTCGCGC GATCAAAAAG AAAAAATTTA CGTACAAGAC AAACTGCGCG AACAGGGCGC GGAGCTGTGG CGCTGGATCA ATGACGGTGC CCACATTTAT GTCTGCGGCG ACGCTAATCG CATGGCGAAA GACGTTGAAC AGGCACTTCT GGAAGTGATT GCCGAATTTG GTGGCATGGA CACCGAAGCG GCGGATGAAT TTTTAAGTGA GCTGCGCGTA GAGCGCCGTT ATCAGCGAGA TGTCTACTAA
|
Protein sequence | MTTQVSPSAL LPLNPEQLAR LQAATTDLTP TQLAWVSGYF WGVLNQQPAA LAATPAPAAE MPGITIISAS QTGNARRVAE ALRDDLLAAK LNVKLVNAGD YKFKQIASEK LLIVVTSTQG EGEPPEEAVA LHKFLFSKKA PKLENTAFAV FSLGDSSYEF FCQSGKDFDS KLAELGGERL LDRVDADVEY QAAASEWRAR VVDALKSRAP VAAPSQSVAT GTVNEIHTSP YSKDAPLVAS LSVNQKITGR NSEKDVRHIE IDLGDSGLRY QPGDALGVWY QNDPALVKEL VELLWLKGDE PVTVEGKTLP LNEALQWHFE LTVNTANIVE NYATLTRSET LLPLVGDKAK LQHYAATTPI VDMVRFSPAQ LDAEALINLL RPLTPRLYSI ASSQAEVENE VHVTVGVVRY DVEGRARAGG ASSFLADRVE EEGEVRVFIE HNDNFRLPAN PETPVIMIGP GTGIAPFRAF MQQRAADEAP GKNWLFFGNP HFTEDFLYQV EWQRYVKEGV LTRIDLAWSR DQKEKIYVQD KLREQGAELW RWINDGAHIY VCGDANRMAK DVEQALLEVI AEFGGMDTEA ADEFLSELRV ERRYQRDVY
|
| |