Gene SbBS512_E4495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4495 
SymbolzraS 
ID6272252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp4202783 
End bp4204159 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content54% 
IMG OID641728287 
Productsensor protein ZraS 
Protein accessionYP_001882689 
Protein GI187732012 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.000791685 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTTTA TGCAACGTTC TAAAGACTCC TTAGCTAAAT GGTTAAGCGC GATCCTCCCC 
GTGGTCATTG TTGGGCTGGT AGGGTTGTTT GCGGTGACGG TGATTCGCGA TTATGGGCGC
GAGACTGCCG CCGCCAGACA AACGCTGCTG GAAAAAGGCA GTGTACTTAT TCGCGCTCTT
GAATCCGGCT CGCGCGTCGG CATGGGGATG CGCATGCATC ATGCGCAGCA GCAGGCCTTA
CTGGAAGAAA TGGCCGGGCA GCCTGGAGTA CGTTGGTTTG CGGTCACGGA TGAACAAGGA
ACAATCGTGA TGCATAGCAA CTCCGGCATG GTGGGAAAAC AGATTTATTC CCCGCAGGAA
ATGCAGCAGT TACATCCGGG AGATGAAGAA GCGTGGCGGC GGATCGATAG CGCAGACGGT
GAGCCTGTTC TGGAAATTTA TCGCCAGTTT CAACCGATGT TTGGTGCTGG AATGCACCGG
ATGCGCCATA TGCAGCAGTA TGCCGCGACA CCACAAGCAA TTTTCATCGC TTTCGACGCC
AGTAATATTG TGAGTGCCGA AGATCGTGAG CAGAGAAACA CCCTGATTAT CCTCTTCGCC
CTGGCGACGG TCTTGCTGGC AAGCGTATTG TCATTCTTCT GGTATCGCCG CTATCTGCGC
TCGCGCCAGC TTCTACAAGA TGAAATGAAG CGCAAAGAGA AGCTGGTGGC ACTGGGGCAT
CTGGCGGCAG GCGTTGCCCA CGAAATCCGT AATCCACTTT CCTCAATTAA AGGGCTGGCG
AAATACTTTG CCGAACGCGC GCCAGCAGGG GGAGAAGCGC ATCAACTGGC GCAGGTGATG
GCGAAAGAAG CCGACCGTTT AAACCGCGTG GTAAGCGAGT TGCTGGAACT GGTTAAGCCA
ACGCATCTGG CTTTGCAGGC GGTGGATCTC AACACGCTGA TTAACCACTC ATTACAGCTG
GTAAGCCAGG ATGCAAACAG CCGGGAGATC CAGTTACGCT TTACCGCCAA CGACACATTA
CCGGAAATTC AGGCCGATCC GGACAGGCTG ACTCAGGTCC TGTTGAATCT CTATCTCAAT
GCTATTCAGG CGATTGTTCA GCATGGCGTG ATTAGCGTGA CGGTCAGCGA AAGCGGCGCG
GGCGTGAAAA TCAGCGTTAC CGACAGCGGT AAGGGAATTG CGGCAGATCA GCTTGAAGCC
ATCTTCACTC CGTACTTCAC CACCAAAGCC GAAGGCACCG GATTGGGGCT GGCGGTCGTG
CATAATATTG TTGAACAACA CGGTGGTACA ATTCAGGTCG CAAGCCAGGA GGGAAAAGGC
TCAACGTTCA CCCTCTGGCT TCCGGTCAAT ATTACGCGTA AGGACCCACA AGGATGA
 
Protein sequence
MRFMQRSKDS LAKWLSAILP VVIVGLVGLF AVTVIRDYGR ETAAARQTLL EKGSVLIRAL 
ESGSRVGMGM RMHHAQQQAL LEEMAGQPGV RWFAVTDEQG TIVMHSNSGM VGKQIYSPQE
MQQLHPGDEE AWRRIDSADG EPVLEIYRQF QPMFGAGMHR MRHMQQYAAT PQAIFIAFDA
SNIVSAEDRE QRNTLIILFA LATVLLASVL SFFWYRRYLR SRQLLQDEMK RKEKLVALGH
LAAGVAHEIR NPLSSIKGLA KYFAERAPAG GEAHQLAQVM AKEADRLNRV VSELLELVKP
THLALQAVDL NTLINHSLQL VSQDANSREI QLRFTANDTL PEIQADPDRL TQVLLNLYLN
AIQAIVQHGV ISVTVSESGA GVKISVTDSG KGIAADQLEA IFTPYFTTKA EGTGLGLAVV
HNIVEQHGGT IQVASQEGKG STFTLWLPVN ITRKDPQG