Gene SbBS512_E1360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1360 
SymboldhaR 
ID6271835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1238520 
End bp1240439 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content51% 
IMG OID641725471 
ProductDNA-binding transcriptional regulator DhaR 
Protein accessionYP_001879981 
Protein GI187732464 
COG category[K] Transcription
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3284] Transcriptional activator of acetoin/glycerol metabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGGCG CTTTTAACAA CGATGGTCGG GGCATATCTC CCTTAATTGC AACCTCCTGG 
GAGCGATGCA ATAAGCTGAT GAAACGGGAG ACATGGAGCG TACCACATCA GGCCCAGGGC
GTGACATTTG CTTCTATTTA TCGGCGTAAG AAAGCGATGC TGACGCTCGG GCAGGCTGCG
CTGGAAGATG CCTGGGAATA TATGGCACCG CGAGAGTGTG CGCTGTTTAT CCTCGATGAA
ACCGCCTGCA TTCTCAGCCG TAATGGCGAT CCGCAAACCT TGCAGCAGCT AAGTGCACTT
GGATTCCATG ACGGCACGTA TTGCGCCGAG GGAATTATTG GTACTTGTGC GCTATCGTTA
GCGGCTATCT CTGGTCAGGC CGTGAAAACG ATGGCCGATC AACATTTCAA ACAGGCACTC
TGGAACTGGG CCTTTTGTGC AACGCTGTTG TTTGACAGCA AGGGCCGATT GACGGGAACA
ATAGCGCTGG CGTGTCCGGT TGAGCAAACT ACCGCAGCTG ATTTGCCGTT GACGTTGGCA
ATCGCCCGCG AGGTCGGAAA TTTACTGCTG ACGGACAGTT TGCTCACTGA AACTAACCGT
CATTTAAATC AACTTAATGC CCTGTTAGAA AGTATGGATG ATGGCGTGAT TAGCTGGGAC
GAGCAGGGTA ATTTGCAATT TATCAATGCC CAGGCGGCGC GGGTCTTGCG CCTTGACGCG
ACGGCAAGTC AGGGACGGGC AATCACTGAA CTCTTAACGT TACCCGCCGT ATTGCAACAA
GCAATAAAAC AGGCACATCC GCTCAAACAC GTAGAAGCAA CCTTTGAAAG TCAGCATCAG
TTTATTGATG CGGTGATAAC CCTTAAACCG ATAATAGAAA CGCAGGGAAC CAGCTTTATT
TTGTTGCTCC ATCCTGTGGA ACAGATGCGG CAGTTGATGA CCAGTCAATT AGGAAAAGTC
AGCCATACCT TTGCTCATAT GCCACAGGAC GATCCGCAAA CCCGCCGCTT GATTCATTTT
GGTCGCCAGG CGGCGCGCAG TAGCTTTCCT GTCCTGCTTT GTGGAGAAGA GGGCGTGGGC
AAGGCACTGC TAAGTCAGGC AATTCATAAT GAAAGCGAGC GTGCTGCAGG TCCTTATATC
GCCGTCAATT GTGAGTTATA TGGTGATGCG GCACTGGCGG AAGAATTTAT TGGTGGCGAT
CGTACGGACA ATGAAAATGG CCGTCTGAGT CGGCTGGAAC TGGCGCACGG CGGCACGCTG
TTTCTTGAAA AGATTGAATA TCTGGCGGTG GAGTTACAGT CTGCTTTGCT TCAGGTTATC
AAGCAGGGTG TTATCACGCG ACTGGATGCG CGGCGTTTAA TACCAATTGA TGTCAAAGTG
ATTGCAACAA CGACCGCGGA CCTCGCAATG CTGGTGGAAC AAAATCGTTT TAGTCGCCAG
CTGTATTACG CGCTGCATGC ATTTGAAATT ACCATCCCGC CTCTGCGTAT GCGGCGTGGC
AGCATTCCGG CGCTGGTGAA TAACAAATTA CGCAGTCTTG AAAAACGCTT CTCTACGCGG
CTGAAAATTG ATGACGATGC CCTCGCTCGC CTGGTTTCTT GTGCATGGCC AGGCAACGAT
TTTGAACTTT ACAGCGTCAT CGAGAATCTT GCTCTGAGTA GTGATAATGG GCGCATTCGC
GTCAGTGATT TGCCGGAACA TCTGTTTACC GAGCAGGCAA CAGATGATGT CAGCGCCACC
CGCCTTTCCA CCAGTCTGTC ATTTGCGGAA GTTGAAAAAG AGGCAATTAT TAACGCAGCC
CAGGTCACAG GCGGTCGCAT TCAGGAAATG TCGGCTTTAC TTGGAATCGG CCGCACTACG
CTGTGGCGGA AAATGAAGCA ACATGGCATT GATGCAGGGC AGTTTAAGCG CCGGGTATGA
 
Protein sequence
MSGAFNNDGR GISPLIATSW ERCNKLMKRE TWSVPHQAQG VTFASIYRRK KAMLTLGQAA 
LEDAWEYMAP RECALFILDE TACILSRNGD PQTLQQLSAL GFHDGTYCAE GIIGTCALSL
AAISGQAVKT MADQHFKQAL WNWAFCATLL FDSKGRLTGT IALACPVEQT TAADLPLTLA
IAREVGNLLL TDSLLTETNR HLNQLNALLE SMDDGVISWD EQGNLQFINA QAARVLRLDA
TASQGRAITE LLTLPAVLQQ AIKQAHPLKH VEATFESQHQ FIDAVITLKP IIETQGTSFI
LLLHPVEQMR QLMTSQLGKV SHTFAHMPQD DPQTRRLIHF GRQAARSSFP VLLCGEEGVG
KALLSQAIHN ESERAAGPYI AVNCELYGDA ALAEEFIGGD RTDNENGRLS RLELAHGGTL
FLEKIEYLAV ELQSALLQVI KQGVITRLDA RRLIPIDVKV IATTTADLAM LVEQNRFSRQ
LYYALHAFEI TIPPLRMRRG SIPALVNNKL RSLEKRFSTR LKIDDDALAR LVSCAWPGND
FELYSVIENL ALSSDNGRIR VSDLPEHLFT EQATDDVSAT RLSTSLSFAE VEKEAIINAA
QVTGGRIQEM SALLGIGRTT LWRKMKQHGI DAGQFKRRV