Gene B21_01657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01657 
SymbolydiS 
ID8113827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp1725126 
End bp1726415 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content54% 
IMG OID644847881 
Producthypothetical protein 
Protein accessionYP_002999454 
Protein GI251785150 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGATG ACAAATTTGA TGCCATTGTG GTCGGTGCGG GCGTTGCTGG TAGCGTTGCC 
GCACTGGTCA TGGCGCGAGC CGGGCTGGAT GTCCTGGTGA TAGAACGCGG CGACAGTGCC
GGATGTAAAA ACATGACCGG CGGGCGTCTT TATGCCCACA CACTTGAAGC AATCATTCCA
GGCTTTGCAG CATCAGCGCC GGTAGAACGC AAGGTCACAC GCGAGAAAAT CTCCTTCTTA
ACCGAAGAGA GCGCCGTTAC CCTCGATTTT CACCGCGAGC AACCAGATGT TCCGCAACAC
GCATCTTATA CCGTATTGCG TAATCGTCTG GACCCGTGGT TGATGGAACA AGCCGAGCAG
GCGGGCGCAC AGTTTATCCC GGGCGTTCGC GTCGATGCGC TGGTTCGTGA AGGAAACAAG
GTCACTGGCG TCCAGGCTGG GGATGATATT CTCGAAGCGA ATGTGGTGAT TCTGGCTGAT
GGCGTTAACT CGATGCTTGG CCGCTCGCTG GGAATGGTTC CCGCTTCCGA TCCGCATCAT
TACGCTGTTG GTGTTAAAGA GGTTATTGGC CTCACACCAG AACAGATCAA CGATCGCTTT
AATATTACGG GCGAGGAAGG TGCCGCCTGG CTGTTTGCCG GTTCCCCTTC TGACGGCCTG
ATGGGTGGCG GATTCCTCTA TACCAATAAG GATTCCATAT CCTTGGGGCT GGTTTGTGGA
TTGGGTGATA TCGCCCATGC GCAAAAAAGC GTGCCGCAAA TGCTGGAAGA TTTTAAACAA
CACCCCGCCA TTCGCCCGCT GATTAGCGGC GGCAAACTGC TTGAATATTC CGCGCATATG
GTGCCGGAAG GCGGTCTGGC AATGGTGCCG CAACTGGTTA ACGAGGGCGT GATGATCGTT
GGTGACGCCG CAGGCTTCTG CCTGAATTTG GGTTTTACGG TCCGCGGCAT GGATTTAGCC
ATTGCATCGG CTCAGGCTGC CGCCACAACG GTGATCGCCG CCAAAGAACG CGCAGATTTC
TCCGCCAGCA GTCTTGCGCA ATACAAACGT GAGCTGGAAC AAAGTTGTGT TATGCGCGAT
ATGCAGCATT TTCGCAAGAT CCCGGCGCTG ATGGAAAATC CGCGCCTGTT TAGTCAGTAT
CCGCGCATGG TCGCCGACAT CATGAACGAT ATGTTCACCA TTGACGGCAA ACCAAACCAG
CCGGTACGCA AAATGATCAT GGGACACGCG AAGAAAATTG GGCTGATCAA CTTGCTGAAA
GATGGCATTA AGGGAGCAAC CGCGCTATGA
 
Protein sequence
MSDDKFDAIV VGAGVAGSVA ALVMARAGLD VLVIERGDSA GCKNMTGGRL YAHTLEAIIP 
GFAASAPVER KVTREKISFL TEESAVTLDF HREQPDVPQH ASYTVLRNRL DPWLMEQAEQ
AGAQFIPGVR VDALVREGNK VTGVQAGDDI LEANVVILAD GVNSMLGRSL GMVPASDPHH
YAVGVKEVIG LTPEQINDRF NITGEEGAAW LFAGSPSDGL MGGGFLYTNK DSISLGLVCG
LGDIAHAQKS VPQMLEDFKQ HPAIRPLISG GKLLEYSAHM VPEGGLAMVP QLVNEGVMIV
GDAAGFCLNL GFTVRGMDLA IASAQAAATT VIAAKERADF SASSLAQYKR ELEQSCVMRD
MQHFRKIPAL MENPRLFSQY PRMVADIMND MFTIDGKPNQ PVRKMIMGHA KKIGLINLLK
DGIKGATAL