Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_02571 |
Symbol | cysI |
ID | 8113395 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 2719758 |
End bp | 2721470 |
Gene Length | 1713 bp |
Protein Length | 570 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 644848769 |
Product | hypothetical protein |
Protein accession | YP_003000342 |
Protein GI | 251786038 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0155] Sulfite reductase, beta subunit (hemoprotein) |
TIGRFAM ID | [TIGR02041] sulfite reductase (NADPH) hemoprotein, beta-component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAAA AACATCCAGG GCCTTTAGTG GTCGAAGGAA AACTGACAGA CGCCGAGCGC ATGAAGCTTG AAAGCAACTA CCTGCGCGGC ACCATTGCGG AAGATTTAAA CGACGGTCTG ACCGGCGGCT TTAAGGGCGA CAACTTCCTG CTGATCCGCT TCCACGGCAT GTATCAGCAG GATGACCGCG ACATCCGCGC CGAACGTGCT GAACAGAAGC TGGAGCCGCG CCACGCGATG CTGTTGCGCT GCCGTCTGCC GGGTGGGGTG ATTACCACTA AACAGTGGCA GGCGATAGAC AAATTTGCCG GTGAAAACAC CATCTATGGC AGCATTCGCC TGACCAACCG CCAGACGTTT CAGTTCCACG GCATTCTGAA AAAGAACGTC AAACCGGTGC ACCAGATGCT GCACTCGGTT GGTCTTGATG CGCTGGCGAC CGCTAACGAC ATGAACCGTA ACGTACTCTG CACCTCGAAC CCTTACGAGT CGCAGCTACA CGCGGAAGCG TACGAGTGGG CGAAGAAGAT TTCTGAGCAT CTGCTGCCGC GTACCCGCGC GTATGCGGAG ATCTGGCTCG ACCAGGAAAA AGTCGCCACT ACTGATGAAG AACCGATCCT CGGCCAGACC TACCTGCCGC GTAAATTCAA AACCACGGTA GTGATCCCGC CACAGAACGA TATCGATCTG CACGCCAACG ACATGAACTT CGTGGCGATC GCCGAAAACG GCAAGCTGGT GGGCTTTAAC CTGTTGGTGG GCGGTGGGCT TTCCATCGAA CACGGCAACA AGAAAACCTA CGCCCGCACC GCGAGTGAGT TTGGCTATCT GCCGCTGGAG CATACGCTGG CGGTGGCCGA AGCCGTCGTG ACAACTCAGC GTGACTGGGG TAACCGAACC GATCGTAAAA ATGCCAAAAC CAAATACACG CTGGAGCGCG TGGGGGTTGA GACGTTTAAA GCGGAAGTGG AGCGTCGCGC GGGGATCAAA TTTGAACCGA TCCGTCCATA TGAGTTCACC GGACGAGGCG ATCGTATTGG CTGGGTTAAG GGCATTGATG ATAACTGGCA CCTGACGCTG TTTATCGAAA ATGGTCGCAT CCTTGATTAT CCGGCGCGTC CGCTGAAAAC CGGCCTGCTG GAGATCGCGA AGATCCACAA AGGCGATTTC CGCATTACGG CGAACCAGAA TCTGATCATC GCCGGTGTAC CGGAAAGCGA GAAAGCGAAG ATCGAGAAGA TCGCCAAAGA GAGCGGGTTA ATGAATGCCG TCACGCCGCA GCGTGAAAAC TCAATGGCCT GCGTGTCATT CCCGACTTGC CCGCTGGCGA TGGCGGAAGC GGAGCGTTTC CTGCCGTCTT TTATCGACAA CATCGATAAT TTAATGGCGA AACATGGTGT CAGCGATGAG CATATCGTGA TGCGTGTAAC AGGCTGCCCG AACGGTTGTG GTCGCGCGAT GCTGGCGGAA GTGGGCCTGG TGGGTAAAGC GCCGGGTCGC TACAACCTGC ATCTTGGCGG CAACCGCATT GGGACACGTA TCCCTCGGAT GTATAAAGAA AACATCACCG AGCCGGAAAT CCTGGCGTCG CTTGATGAAC TGATAGGGCG CTGGGCGAAA GAGCGCGAAG CGGGTGAAGG CTTCGGCGAC TTTACGGTGC GTGCGGGCAT CATTCGCCCG GTGCTCGATC CGGCGCGCGA TTTGTGGGAT TAA
|
Protein sequence | MSEKHPGPLV VEGKLTDAER MKLESNYLRG TIAEDLNDGL TGGFKGDNFL LIRFHGMYQQ DDRDIRAERA EQKLEPRHAM LLRCRLPGGV ITTKQWQAID KFAGENTIYG SIRLTNRQTF QFHGILKKNV KPVHQMLHSV GLDALATAND MNRNVLCTSN PYESQLHAEA YEWAKKISEH LLPRTRAYAE IWLDQEKVAT TDEEPILGQT YLPRKFKTTV VIPPQNDIDL HANDMNFVAI AENGKLVGFN LLVGGGLSIE HGNKKTYART ASEFGYLPLE HTLAVAEAVV TTQRDWGNRT DRKNAKTKYT LERVGVETFK AEVERRAGIK FEPIRPYEFT GRGDRIGWVK GIDDNWHLTL FIENGRILDY PARPLKTGLL EIAKIHKGDF RITANQNLII AGVPESEKAK IEKIAKESGL MNAVTPQREN SMACVSFPTC PLAMAEAERF LPSFIDNIDN LMAKHGVSDE HIVMRVTGCP NGCGRAMLAE VGLVGKAPGR YNLHLGGNRI GTRIPRMYKE NITEPEILAS LDELIGRWAK EREAGEGFGD FTVRAGIIRP VLDPARDLWD
|
| |