Gene ECD_00531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_00531 
SymbolcusS 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp552004 
End bp553452 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content52% 
IMG OID 
Productsensory histidine kinase in two-component regulatory system with CusR, senses copper ions 
Protein accessionACT42411 
Protein GI253976741 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCAGTA AGCCATTTCA GCGCCCGTTT TCGCTGGCAA CCCGCCTGAC CTTTTTTATC 
AGCCTGGCCA CCATCGCGGC GTTTTTCGCC TTTGCATGGA TCATGATCCA CTCAGTAAAA
GTTCATTTTG CCGAGCAGGA TATTAATGAT TTAAAAGAGA TTAGCGCCAC CCTTGAACGG
GTACTAAATC ACCCTGACGA AACGCAAGCC CGACGCTTAA TGACGCTGGA AGATATCGTC
AGTGGTTATT CCAATGTGTT GATTTCCCTG GCAGATAGTC AGGGTAAAAC GGTGTATCAC
TCCCCCGGTG CGCCGGATAT CCGCGAGTTT ACGCGTGACG CCATACCCGA TAAAGACGCT
CAGGGTGGCG AGGTGTATCT CCTTTCCGGC CCGACGATGA TGATGCCAGG CCACGGTCAC
GGGCATATGG AACACAGCAA CTGGCGGATG ATTAACTTGC CGGTTGGCCC GTTGGTGGAC
GGCAAACCGA TTTATACGCT CTACATCGCG CTTTCGATCG ATTTTCATCT TCATTACATA
AATGATTTGA TGAATAAACT TATTATGACC GCATCGGTAA TCAGCATCCT GATCGTCTTT
ATCGTACTGT TGGCGGTACA TAAAGGTCAC GCGCCGATCC GCAGCGTCAG CCGTCAAATC
CAGAATATTA CCTCGAAAGA TCTCGACGTT CGCCTCGACC CGCAGACCGT GCCGATTGAG
CTGGAACAGC TGGTACTGTC GTTCAACCAT ATGATCGAGC GTATTGAGGA TGTCTTTACC
CGCCAGTCCA ATTTCTCAGC GGATATCGCT CACGAAATTC GCACGCCGAT TACCAATCTC
ATCACGCAAA CGGAAATCGC CCTCAGCCAG TCTCGCAGCC AGAAGGAGCT GGAAGATGTG
CTCTACTCTA ATCTCGAAGA GCTGACGCGA ATGGCGAAAA TGGTCAGCGA TATGCTGTTT
CTCGCTCAGG CCGATAACAA CCAGCTAATC CCCGAAAAGA AAATGCTCAA CCTGGCGGAT
GAAGTCGGCA AAGTGTTCGA TTTTTTCGAG GCGTTAGCGG AAGATCGCGG CGTGGAGTTG
CGATTTGTTG GCGACAAATG TCAGGTTGCG GGCGATCCGC TGATGCTGCG TCGGGCGTTA
AGCAACCTGC TTTCTAATGC CCTGCGTTAT ACGCCACCCA GTGAGGCTAT TGTAGTGCGC
TGCCAGACGG TCAATCATCA GGTGCAAGTT TCCGTCGAAA ACCCCGGTAC GCCCATTGCG
CCCGAGCACT TACCGCGATT GTTTGACCGT TTCTATCGCG TTGCCCCTTC CCGCCAGCGA
AAAGGCGAAG GTAGCGGCAT TGGGCTGGCG ATAGTGAAAT CGATTGTTGT CGCGCATAAA
GGCACGGTTG CAGTAACGTC AGATGCGCGG GGGACAAGGT TTGTGATTAT GCTGCCGGAG
AGAGAGTGA
 
Protein sequence
MVSKPFQRPF SLATRLTFFI SLATIAAFFA FAWIMIHSVK VHFAEQDIND LKEISATLER 
VLNHPDETQA RRLMTLEDIV SGYSNVLISL ADSQGKTVYH SPGAPDIREF TRDAIPDKDA
QGGEVYLLSG PTMMMPGHGH GHMEHSNWRM INLPVGPLVD GKPIYTLYIA LSIDFHLHYI
NDLMNKLIMT ASVISILIVF IVLLAVHKGH APIRSVSRQI QNITSKDLDV RLDPQTVPIE
LEQLVLSFNH MIERIEDVFT RQSNFSADIA HEIRTPITNL ITQTEIALSQ SRSQKELEDV
LYSNLEELTR MAKMVSDMLF LAQADNNQLI PEKKMLNLAD EVGKVFDFFE ALAEDRGVEL
RFVGDKCQVA GDPLMLRRAL SNLLSNALRY TPPSEAIVVR CQTVNHQVQV SVENPGTPIA
PEHLPRLFDR FYRVAPSRQR KGEGSGIGLA IVKSIVVAHK GTVAVTSDAR GTRFVIMLPE
RE