Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2891 |
Symbol | cysI |
ID | 6144259 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2962287 |
End bp | 2963999 |
Gene Length | 1713 bp |
Protein Length | 570 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641617760 |
Product | sulfite reductase subunit beta |
Protein accession | YP_001744915 |
Protein GI | 170681318 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0155] Sulfite reductase, beta subunit (hemoprotein) |
TIGRFAM ID | [TIGR02041] sulfite reductase (NADPH) hemoprotein, beta-component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAAA AACATCCAGG GCCTTTAGTG GTCGAAGGAA AACTGACAGA CGCCGAGCGC ATGAAGCTTG AAAGCAACTA CCTGCGCGGC ACCATTGCGG AAGATTTAAA CGACGGTCTG ACCGGCGGCT TTAAGGGCGA CAACTTCCTG CTGATCCGCT TCCACGGCAT GTATCAGCAG GATGACCGCG ACATCCGCGC CGAACGTGCT GAACAGAAGC TGGAGCCGCG CCACGCGATG CTGTTGCGCT GCCGTCTGCC GGGTGGCGTG ATCACCACCA AACAGTGGCA GGCGATCGAT AAATTTGCCG GTGAAAACAC CATCTATGGC AGCATTCGCC TGACCAACCG CCAGACGTTT CAGTTCCACG GCATTCTGAA AAAGAACGTC AAACCGGTGC ACCAGATGCT GCACTCGGTC GGTCTTGATG CGCTGGCGAC TGCTAACGAC ATGAACCGTA ACGTACTCTG CACCTCGAAC CCTTACGAGT CGCAGCTACA CGCGGAAGCG TATGAATGGG CGAAGAAAAT CTCTGAACAT CTGCTGCCGC GTACCCGCGC GTATGCGGAG ATCTGGCTCG ATCAGGAAAA AGTCGCCACC ACCGATGAAG AACCGATCCT CGGTCAGACT TATTTGCCGC GTAAATTCAA AACCACGGTA GTGATCCCGC CGCAGAACGA TATCGATCTG CATGCCAACG ACATGAACTT CGTGGCAATC GCCGAAAACG GCAAGCTGGT GGGCTTTAAC CTGCTGGTGG GGGGTGGGCT TTCTATCGAA CACGGCAACA AGAAAACCTA CGCCCGCACG GCGAGTGAGT TTGGCTATCT GCCGCTGGAG CATACGCTAG CGGTGGCGGA AGCCGTCGTG ACAACTCAGC GTGACTGGGG TAACCGAACC GATCGTAAAA ATGCCAAAAC CAAATACACG CTGGAGCGCG TGGGGGTCGA GACGTTTAAA GCAGAAGTGG AACGTCGCGC GGGGATCAAG TTCGAACCGA TCCGTCCGTA TGAATTTACC GGGCGCGGCG ATCGTATCGG CTGGGTTAAG GGCATTGATG ATAACTGGCA CCTGACGCTG TTTATCGAAA ATGGTCGCAT CCTTGATTAT CCGGGGCGTC CGCTGAAAAC CGGCTTGTTG GAGATCGCGA AGATCCACAA AGGTGATTTC CGCATTACGG CGAACCAGAA TCTGATCATC GCCGGCGTGC CGGAAAGCGA GAAAGCGAAG ATCGAGAAGA TCGCCAAAGA AAGCGGGTTA ATGAATGCCG TCACGCCGCA GCGTGAAAAC TCAATGGCCT GCGTGTCATT CCCGACTTGC CCGCTGGCGA TGGCGGAAGC GGAGCGTTTC CTGCCGTCTT TTATCGACAA AATCGATAAT TTAATGGCGA AACATGGTGT CAGCGATGAG CATATCGTGA TGCGTGTAAC AGGCTGCCCG AACGGTTGTG GTCGCGCGAT GCTGGCGGAA GTGGGTCTGG TGGGGAAAGC GCCGGGTCGC TACAACCTGC ATCTTGGCGG CAACCGCATT GGGACACGTA TCCCACGGAT GTATAAAGAA AACATCACCG AACCGGAAAT CCTGGCGTCG CTTGATGAAC TGATAGGGCG CTGGGCGAAA GAGCGCGAAG CGGGTGAAGG CTTCGGCGAC TTTACGGTGC GTGCGGGCAT CATTCGCCCG GTGCTCGATC CGGCGCGCGA TTTGTGGGAT TAA
|
Protein sequence | MSEKHPGPLV VEGKLTDAER MKLESNYLRG TIAEDLNDGL TGGFKGDNFL LIRFHGMYQQ DDRDIRAERA EQKLEPRHAM LLRCRLPGGV ITTKQWQAID KFAGENTIYG SIRLTNRQTF QFHGILKKNV KPVHQMLHSV GLDALATAND MNRNVLCTSN PYESQLHAEA YEWAKKISEH LLPRTRAYAE IWLDQEKVAT TDEEPILGQT YLPRKFKTTV VIPPQNDIDL HANDMNFVAI AENGKLVGFN LLVGGGLSIE HGNKKTYART ASEFGYLPLE HTLAVAEAVV TTQRDWGNRT DRKNAKTKYT LERVGVETFK AEVERRAGIK FEPIRPYEFT GRGDRIGWVK GIDDNWHLTL FIENGRILDY PGRPLKTGLL EIAKIHKGDF RITANQNLII AGVPESEKAK IEKIAKESGL MNAVTPQREN SMACVSFPTC PLAMAEAERF LPSFIDKIDN LMAKHGVSDE HIVMRVTGCP NGCGRAMLAE VGLVGKAPGR YNLHLGGNRI GTRIPRMYKE NITEPEILAS LDELIGRWAK EREAGEGFGD FTVRAGIIRP VLDPARDLWD
|
| |