Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01657 |
Symbol | ydiS |
ID | 8113827 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 1725126 |
End bp | 1726415 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644847881 |
Product | hypothetical protein |
Protein accession | YP_002999454 |
Protein GI | 251785150 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGATG ACAAATTTGA TGCCATTGTG GTCGGTGCGG GCGTTGCTGG TAGCGTTGCC GCACTGGTCA TGGCGCGAGC CGGGCTGGAT GTCCTGGTGA TAGAACGCGG CGACAGTGCC GGATGTAAAA ACATGACCGG CGGGCGTCTT TATGCCCACA CACTTGAAGC AATCATTCCA GGCTTTGCAG CATCAGCGCC GGTAGAACGC AAGGTCACAC GCGAGAAAAT CTCCTTCTTA ACCGAAGAGA GCGCCGTTAC CCTCGATTTT CACCGCGAGC AACCAGATGT TCCGCAACAC GCATCTTATA CCGTATTGCG TAATCGTCTG GACCCGTGGT TGATGGAACA AGCCGAGCAG GCGGGCGCAC AGTTTATCCC GGGCGTTCGC GTCGATGCGC TGGTTCGTGA AGGAAACAAG GTCACTGGCG TCCAGGCTGG GGATGATATT CTCGAAGCGA ATGTGGTGAT TCTGGCTGAT GGCGTTAACT CGATGCTTGG CCGCTCGCTG GGAATGGTTC CCGCTTCCGA TCCGCATCAT TACGCTGTTG GTGTTAAAGA GGTTATTGGC CTCACACCAG AACAGATCAA CGATCGCTTT AATATTACGG GCGAGGAAGG TGCCGCCTGG CTGTTTGCCG GTTCCCCTTC TGACGGCCTG ATGGGTGGCG GATTCCTCTA TACCAATAAG GATTCCATAT CCTTGGGGCT GGTTTGTGGA TTGGGTGATA TCGCCCATGC GCAAAAAAGC GTGCCGCAAA TGCTGGAAGA TTTTAAACAA CACCCCGCCA TTCGCCCGCT GATTAGCGGC GGCAAACTGC TTGAATATTC CGCGCATATG GTGCCGGAAG GCGGTCTGGC AATGGTGCCG CAACTGGTTA ACGAGGGCGT GATGATCGTT GGTGACGCCG CAGGCTTCTG CCTGAATTTG GGTTTTACGG TCCGCGGCAT GGATTTAGCC ATTGCATCGG CTCAGGCTGC CGCCACAACG GTGATCGCCG CCAAAGAACG CGCAGATTTC TCCGCCAGCA GTCTTGCGCA ATACAAACGT GAGCTGGAAC AAAGTTGTGT TATGCGCGAT ATGCAGCATT TTCGCAAGAT CCCGGCGCTG ATGGAAAATC CGCGCCTGTT TAGTCAGTAT CCGCGCATGG TCGCCGACAT CATGAACGAT ATGTTCACCA TTGACGGCAA ACCAAACCAG CCGGTACGCA AAATGATCAT GGGACACGCG AAGAAAATTG GGCTGATCAA CTTGCTGAAA GATGGCATTA AGGGAGCAAC CGCGCTATGA
|
Protein sequence | MSDDKFDAIV VGAGVAGSVA ALVMARAGLD VLVIERGDSA GCKNMTGGRL YAHTLEAIIP GFAASAPVER KVTREKISFL TEESAVTLDF HREQPDVPQH ASYTVLRNRL DPWLMEQAEQ AGAQFIPGVR VDALVREGNK VTGVQAGDDI LEANVVILAD GVNSMLGRSL GMVPASDPHH YAVGVKEVIG LTPEQINDRF NITGEEGAAW LFAGSPSDGL MGGGFLYTNK DSISLGLVCG LGDIAHAQKS VPQMLEDFKQ HPAIRPLISG GKLLEYSAHM VPEGGLAMVP QLVNEGVMIV GDAAGFCLNL GFTVRGMDLA IASAQAAATT VIAAKERADF SASSLAQYKR ELEQSCVMRD MQHFRKIPAL MENPRLFSQY PRMVADIMND MFTIDGKPNQ PVRKMIMGHA KKIGLINLLK DGIKGATAL
|
| |