Gene B21_00520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00520 
SymbolcusS 
ID8115083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp551828 
End bp553252 
Gene Length1425 bp 
Protein Length475 aa 
Translation table11 
GC content52% 
IMG OID644846799 
Producthypothetical protein 
Protein accessionYP_002998372 
Protein GI251784068 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR01386] heavy metal sensor kinase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCAGTA AGCCATTTCA GCGCCCGTTT TCGCTGGCAA CCCGCCTGAC CTTTTTTATC 
AGCCTGGCCA CCATCGCGGC GTTTTTCGCC TTTGCATGGA TCATGATCCA CTCAGTAAAA
GTTCATTTTG CCGAGCAGGA TATTAATGAT TTAAAAGAGA TTAGCGCCAC CCTTGAACGG
GTACTAAATC ACCCTGACGA AACGCAAGCC CGACGCTTAA TGACGCTGGA AGATATCGTC
AGTGGTTATT CCAATGTGTT GATTTCCCTG GCAGATAGTC AGGGTAAAAC GGTGTATCAC
TCCCCCGGTG CGCCGGATAT CCGCGAGTTT ACGCGTGACG CCATACCCGA TAAAGACGCT
CAGGGTGGCG AGGTGTATCT CCTTTCCGGC CCGACGATGA TGATGCCAGG CCACGGTCAC
GGGCATATGG AACACAGCAA CTGGCGGATG ATTAACTTGC CGGTTGGCCC GTTGGTGGAC
GGCAAACCGA TTTATACGCT CTACATCGCG CTTTCGATCG ATTTTCATCT TCATTACATA
AATGATTTGA TGAATAAACT TATTATGACC GCATCGGTAA TCAGCATCCT GATCGTCTTT
ATCGTACTGT TGGCGGTACA TAAAGGTCAC GCGCCGATCC GCAGCGTCAG CCGTCAAATC
CAGAATATTA CCTCGAAAGA TCTCGACGTT CGCCTCGACC CGCAGACCGT GCCGATTGAG
CTGGAACAGC TGGTACTGTC GTTCAACCAT ATGATCGAGC GTATTGAGGA TGTCTTTACC
CGCCAGTCCA ATTTCTCAGC GGATATCGCT CACGAAATTC GCACGCCGAT TACCAATCTC
ATCACGCAAA CGGAAATCGC CCTCAGCCAG TCTCGCAGCC AGAAGGAGCT GGAAGATGTG
CTCTACTCTA ATCTCGAAGA GCTGACGCGA ATGGCGAAAA TGGTCAGCGA TATGCTGTTT
CTCGCTCAGG CCGATAACAA CCAGCTAATC CCCGAAAAGA AAATGCTCAA CCTGGCGGAT
GAAGTCGGCA AAGTGTTCGA TTTTTTCGAG GCGTTAGCGG AAGATCGCGG CGTGGAGTTG
CGATTTGTTG GCGACAAATG TCAGGTTGCG GGCGATCCGC TGATGCTGCG TCGGGCGTTA
AGCAACCTGC TTTCTAATGC CCTGCGTTAT ACGCCACCCA GTGAGGCTAT TGTAGTGCGC
TGCCAGACGG TCAATCATCA GGTGCAAGTT TCCGTCGAAA ACCCCGGTAC GCCCATTGCG
CCCGAGCACT TACCGCGATT GTTTGACCGT TTCTATCGCG TTGCCCCTTC CCGCCAGCGA
AAAGGCGAAG GTAGCGGCAT TGGGCTGGCG ATAGTGAAAT CGATTGTTGT CGCGCATAAA
GGCACGGTTG CAGTAACGTC AGATGCGCGG GGGACAAGGT TTGTG
 
Protein sequence
MVSKPFQRPF SLATRLTFFI SLATIAAFFA FAWIMIHSVK VHFAEQDIND LKEISATLER 
VLNHPDETQA RRLMTLEDIV SGYSNVLISL ADSQGKTVYH SPGAPDIREF TRDAIPDKDA
QGGEVYLLSG PTMMMPGHGH GHMEHSNWRM INLPVGPLVD GKPIYTLYIA LSIDFHLHYI
NDLMNKLIMT ASVISILIVF IVLLAVHKGH APIRSVSRQI QNITSKDLDV RLDPQTVPIE
LEQLVLSFNH MIERIEDVFT RQSNFSADIA HEIRTPITNL ITQTEIALSQ SRSQKELEDV
LYSNLEELTR MAKMVSDMLF LAQADNNQLI PEKKMLNLAD EVGKVFDFFE ALAEDRGVEL
RFVGDKCQVA GDPLMLRRAL SNLLSNALRY TPPSEAIVVR CQTVNHQVQV SVENPGTPIA
PEHLPRLFDR FYRVAPSRQR KGEGSGIGLA IVKSIVVAHK GTVAVTSDAR GTRFV